For the dataset, adultsData.csv (in the assignment folder), it is required to analyze the dataset to answer the following questions. Follow the data analysis process and highlight the different phases you follow throughout your analysis. Incorporate visualisation in your analysis and add comments that conclude your findings. Then, upload your final program as a single Jupyter Notebook file.

Database System Concepts
7th Edition
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Chapter1: Introduction
Section: Chapter Questions
Problem 1PE
icon
Related questions
icon
Concept explainers
Question

adultsData.csv file: https://onq.queensu.ca/d2l/common/viewFile.d2lfile/Database/MzA1MzUwMzA/adultsData.csv?ou=748849

For the dataset, adultsData.csv (in the assignment folder), it is required to analyze the dataset to answer the
following questions. Follow the data analysis process and highlight the different phases you follow
throughout your analysis. Incorporate visualisation in your analysis and add comments that conclude your
findings. Then, upload your final program as a single Jupyter Notebook file.
1. How many men and women are represented in this dataset?
2. What is the average age of women?
3. What is the percentage of German citizens and Canadian citizens?
4. What are the mean and standard deviation of the age for those who earn more than 50K per year and
those who earn less than 50K per year?
5. What is the education level of people who earn more than 50K? Is it true that they have at least high
school education?
6. Display age statistics for each race and each gender. Use groupby() and describe(). Find the
maximum age of men and women in each race group.
7. Among whom is the proportion of those who earn greater than 50K: married or single? Consider as
married those who have a marital-status starting with Married (e.g., Married-civ-spouse, Married-
spouse-absent, Married-AF-spouse, etc.).
8. What is the maximum number of hours a person works per week? How many people work such a
number of hours, and what is the percentage of those who earn more than 50K among them?
9. Count the average time of work (hours-per-week) for those who earn less than 50K and more than
50K for each country. Compare these averages for Japan and Canada.
Transcribed Image Text:For the dataset, adultsData.csv (in the assignment folder), it is required to analyze the dataset to answer the following questions. Follow the data analysis process and highlight the different phases you follow throughout your analysis. Incorporate visualisation in your analysis and add comments that conclude your findings. Then, upload your final program as a single Jupyter Notebook file. 1. How many men and women are represented in this dataset? 2. What is the average age of women? 3. What is the percentage of German citizens and Canadian citizens? 4. What are the mean and standard deviation of the age for those who earn more than 50K per year and those who earn less than 50K per year? 5. What is the education level of people who earn more than 50K? Is it true that they have at least high school education? 6. Display age statistics for each race and each gender. Use groupby() and describe(). Find the maximum age of men and women in each race group. 7. Among whom is the proportion of those who earn greater than 50K: married or single? Consider as married those who have a marital-status starting with Married (e.g., Married-civ-spouse, Married- spouse-absent, Married-AF-spouse, etc.). 8. What is the maximum number of hours a person works per week? How many people work such a number of hours, and what is the percentage of those who earn more than 50K among them? 9. Count the average time of work (hours-per-week) for those who earn less than 50K and more than 50K for each country. Compare these averages for Japan and Canada.
Expert Solution
steps

Step by step

Solved in 2 steps

Blurred answer
Knowledge Booster
Query Syntax
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.
Recommended textbooks for you
Database System Concepts
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
C How to Program (8th Edition)
C How to Program (8th Edition)
Computer Science
ISBN:
9780133976892
Author:
Paul J. Deitel, Harvey Deitel
Publisher:
PEARSON
Database Systems: Design, Implementation, & Manag…
Database Systems: Design, Implementation, & Manag…
Computer Science
ISBN:
9781337627900
Author:
Carlos Coronel, Steven Morris
Publisher:
Cengage Learning
Programmable Logic Controllers
Programmable Logic Controllers
Computer Science
ISBN:
9780073373843
Author:
Frank D. Petruzella
Publisher:
McGraw-Hill Education