Airplane Crashes and Fatalities
Contents
This project entails analyzing a detailed dataset chronicling airplane crashes and fatalities globally from 1908 to 2023. The aim is to uncover insights into trends, patterns, and contributing factors over time. Check it out on GitHub.
Airplane Incidents Analysis
Overview
This project involves the analysis of a comprehensive dataset detailing the history of airplane crashes and fatalities worldwide from 1908 to 2023. The goal is to derive insights into trends, patterns, and factors contributing to airplane incidents over time.
Dataset
- Dataset Source: Airplane_Crashes_and_Fatalities_Since_1908_t0_2023.csv
- Columns: Date, Time, Location, Operator, Flight #, Route, AC Type, Registration, Aboard, Aboard Passengers, Aboard Crew, Fatalities, Fatalities Passengers, Fatalities Crew, Ground, Summary
Analysis Process
-
Initial Exploration
- Loaded the dataset and reviewed its structure.
- Checked for missing values and handled them appropriately.
- Identified and dropped unnecessary columns for analysis.
-
Temporal Analysis
- Yearly Trends
- Converted the ‘Date’ column to datetime format.
- Created a ‘Year’ column by extracting the year from the ‘Date’.
- Grouped the data by year and analyzed the trends in the number of airplane incidents.
- Seasonal Patterns
- Extracted the month from the ‘Date’ column and analyzed the seasonal patterns in airplane incidents.
- Yearly Trends
-
Geographical Analysis
- Global Distribution
- Grouped the data by location to identify the location with the highest number of airplane incidents.
- Location-specific Factors
- Extracted city information from the ‘Location’ column and identified cities with a higher incidence of airplane incidents.
- Global Distribution
-
Aircraft and Operator Analysis
- Common Aircraft Types
- Identified the most common aircraft types involved in incidents.
- Operator Patterns
- Analyzed operators with the highest and lowest incident rates.
- Explored trends related to the operator’s region or country.
- Common Aircraft Types
-
Fatality Analysis
- Fatalities Over Time
- Examined trends in the number of fatalities over the years.
- Periods with Higher Fatality Rate
- Calculated and analyzed the fatality rate over time.
- Identified specific periods with a higher fatality rate.
- Fatalities Over Time
-
Potential Causes and Factors
- Probable Causes
- Processed and tokenized the ‘Summary’ column.
- Analyzed the most common probable causes of airplane incidents.
- Temporal Analysis of Probable Causes
- Extended the analysis to identify shifts in predominant causes over time.
- Visualized changes in the most common probable causes over the years.
- Probable Causes
Challenges Faced
- Dealing with missing values and deciding on appropriate handling strategies.
- Ensuring proper tokenization and text processing for the ‘Summary’ column.
- Addressing potential conflicts with duplicate column names during the analysis.
Conclusions
- Yearly trends suggest fluctuations in the number of airplane incidents, indicating potential factors influencing aviation safety.
- Geographical analysis reveals insights into regions and cities with higher incident rates.
- Common aircraft types and operators associated with incidents are identified.
- Fatality analysis indicates periods with higher fatality rates, prompting further investigation into contributing factors.
- Analysis of probable causes provides valuable insights into recurrent issues, helping inform safety measures.
Author
- Name: Mohammed Mebarek Mecheter
- Email: mohammedmecheter@gmail.com