Loading and examining the Diabetes dataset
For this exemplar project, principles similar to that of loading the data are used as before: loading the .csv
file in the Python environment (Spyder IDE). Before starting with loading the data or any other further procedures, all the required libraries are loaded.
We will be using the pandas
library to load, process, and describe the data, sklearn
to get the dataset, and other libraries such as matplotlib
and seaborn
to make different data visualizations. Make sure all of them are installed (i.e., run pip install packagename
in Command Prompt on Microsoft Windows or Terminal on Linux/macOS, as explained in Chapter 2 and Chapter 3); you can visit pypi.com for more information on installing the packages.
Here are the required libraries:
#load the libraries needed to perform the exemplar project 1 import pandas as pd from sklearn import datasets import matplotlib import matplotlib.pyplot as plot import seaborn as sns
The dataset used...