Checking log distribution first
#Log transformation of the feature 'kilometers_driven'
sns.distplot(np.log(data["kilometers_driven"]), axlabel="Log(kilometers_driven)");
And doing the log transformation:
data["kilometers_driven_log"] = np.log(data["Kilometers_Driven"])Lastly dropping the original column:
data.drop(['kilometers_driven'], axis=1, inplace=True)
Hiç yorum yok:
Yorum Gönder