Common feature engineering techniques for numerical features
In the previous section, you learned how to handle missing data. In this section, you’ll learn about feature engineering techniques that are commonly applied to numeric variables. These techniques do things such as transform data so that it’s more conducive to modeling. Let’s get started.
Performing a log transform
The log transformation is a powerful technique, particularly for transforming a variable into a normal distribution. In this section, you’ll apply log transformation to positive numerical variables that exhibit non-normal distributions within the house prices dataset. You can refer to the data description provided on Kaggle for a comprehensive understanding of the variables that are subjected to log transformation. Let’s get started:
- To get started, plot the variables you’ll transform to see how they don’t follow a normal distribution initially. You...