CS628 – Data Science
Project # 5: Visualization Assignment
Monroe College
For the following questions (1 to 3), work with the bank_marketing_training data set. Use Python.
Attached is the file. Start with the following code.
1. # import required package 2. import pandas as pd 3. 4. # read the csv bank_train data using the pandas package 5. bank_train = pd.read_csv(“/Users/edeki/Desktop/Website Data Sets/bank_marketing_training”)
1.
Create a bar graph of the previous_outcome variable, with response overlay.
2.
Create a normalized bar graph of previous_outcome variable with response overlay. Describe the
relationship between previous_outcome and response.
3.
Examine the non‐normalized and normalized histograms of duration, with overlay of response.
Identify cutoff point(s) for duration, which separate low values of response from high values.
Define a new categorical variable, duration_binned, using the cutoff points you identified.
For the following questions (4 to 5), work with the adult_ch3_training data set
1. # read the csv adult_ch3_training data using the pandas package 2. adult_ch3_train = pd.read_csv(“/Users/edeki/Desktop/Website Data Sets/adult_ch3_training”)
4.
Consider capital‐loss. Identify the outliers in capital‐loss using the Z‐score method. How many outliers are there?
5.
Construct a bar graph of Income for these outlier records.
For this short paper activity, you will learn about the three delays model, which explains…
Topic : Hospital adult medical surgical collaboration area a. Current Menu Analysis (5 points/5%) Analyze…
As a sales manager, you will use statistical methods to support actionable business decisions for Pastas R Us,…
Read the business intelligence articles: Getting to Know the World of Business Intelligence Business intelligence…
The behaviors of a population can put it at risk for specific health conditions. Studies…