Technical requirements
To effectively follow along with the code examples and exercises in this chapter on categorical encoding techniques, you will need to have the following Python libraries. The code provided in this chapter consists of Python functions designed to implement various encoding techniques for categorical data. All code examples and datasets used in this chapter are available on our GitHub repository at https://github.com/PacktPublishing/XGBoost-for-Regression-Predictive-Modeling-and-Time-Series-Analysis/blob/main/ch8/categorical_encoding.ipynb.
You will need the following Python packages:
- pandas 1.4.2
- NumPy 1.21.5
- Scikit-learn 1.2.2
- Matplotlib
As in Chapter 7, you will be using the Housing Price dataset. Access is available through the Kaggle platform at the following link: https://www.kaggle.com/competitions/house-prices-advanced-regression-techniques/data. To gain an understanding of the variables within the dataset, you can refer to...