Impact of Numerical Features on the Outcome
In this section, we will analyze the relationship between the numerical features (already identified in Exercise 3.01, Analyzing Distributions of Numerical Features in the Banking Dataset) and the outcome of a marketing campaign, which is identified in the y
column in the banking dataset.
We will start our analysis by addressing the following question: Is there a statistically significant difference in numerical features for successful and non-successful marketing campaigns? For this reason, we will create violin plots (as shown in the previous chapters) that compare the distribution of the numerical features for the two types of outcomes ("yes" for a successful marketing campaign, "no" for an unsuccessful one):
""" create violin plots for successful and non-successful marketing campaigns """ plt.figure(figsize=(10,18)) for index, col in enumerate(numerical_features): &...