Traffic Violations Dataset contains around 65k+ traffic-related violation records. The attributes are Date of violation, Time of violation, Country name, Gender of violators (Male-M, Female-F), Age of violators, Race of violators, Category of violation, Search conducted, Result of violation, Arrest information, Detained time taken by the violators to stop and Involvement in drugs etc. You can download the dataset traffic_violaions.csv from the following link: https://www.kaggle.com/shubamsumbria/traffic-violations-dataset Write a summary of your understanding of the purpose and contents of this dataset and your assessment of the quality of the data. To do this, you must develop code to explore the data programmatically in a notebook and provide it as part of your answer. Write a summary (~ 300 words) in word processing document which includes the following: The contents of the above dataset with detailed description The quality of the data with respect to validity, accuracy, completeness, consistency and uniformity Estimate the amount of dirtiness of the data of each type and discuss its potential impact of the goal of the analysis
Traffic Violations Dataset contains around 65k+ traffic-related violation records. The attributes are Date of violation, Time of violation, Country name, Gender of violators (Male-M, Female-F), Age of violators, Race of violators, Category of violation, Search conducted, Result of violation, Arrest information, Detained time taken by the violators to stop and Involvement in drugs etc.
You can download the dataset traffic_violaions.csv from the following link:
https://www.kaggle.com/shubamsumbria/traffic-violations-dataset
Write a summary of your understanding of the purpose and contents of this dataset and your assessment of the quality of the data. To do this, you must develop code to explore the data programmatically in a notebook and provide it as part of your answer.
Write a summary (~ 300 words) in word processing document which includes the following:
- The contents of the above dataset with detailed description
- The quality of the data with respect to validity, accuracy, completeness, consistency and uniformity
- Estimate the amount of dirtiness of the data of each type and discuss its potential impact of the goal of the analysis
Step by step
Solved in 3 steps with 15 images