Switching the quick start code to a different dataset
Before you build a model, you will want to get an understanding of the housing value data and make some graphs to visualize various parameters in the dataset, just as you did with the Iris dataset. You will also want to check for problems such as missing values, values that don’t make sense, and so on. In the case of the housing dataset, we have a number of parameters about the houses in a census block: average age, average number of bedrooms, average number of rooms, location, number of occupants, and a column with the median house value for that block. In the US census, a block is the smallest geographical unit for which the US Census Bureau publishes data. We will train a model to predict the value of a house given the values for the other columns.
Let’s get started.
Downloading the housing dataset
Perform the following steps to load the dataset:
- Setting up the Python environment: You can start...