Contents

Airplane Crashes and Fatalities

This project entails analyzing a detailed dataset chronicling airplane crashes and fatalities globally from 1908 to 2023. The aim is to uncover insights into trends, patterns, and contributing factors over time. Check it out on GitHub.

Airplane Incidents Analysis

Overview

This project involves the analysis of a comprehensive dataset detailing the history of airplane crashes and fatalities worldwide from 1908 to 2023. The goal is to derive insights into trends, patterns, and factors contributing to airplane incidents over time.

Dataset

Analysis Process

  1. Initial Exploration

    • Loaded the dataset and reviewed its structure.
    • Checked for missing values and handled them appropriately.
    • Identified and dropped unnecessary columns for analysis.
  2. Temporal Analysis

    • Yearly Trends
      • Converted the ‘Date’ column to datetime format.
      • Created a ‘Year’ column by extracting the year from the ‘Date’.
      • Grouped the data by year and analyzed the trends in the number of airplane incidents.
    • Seasonal Patterns
      • Extracted the month from the ‘Date’ column and analyzed the seasonal patterns in airplane incidents.
  3. Geographical Analysis

    • Global Distribution
      • Grouped the data by location to identify the location with the highest number of airplane incidents.
    • Location-specific Factors
      • Extracted city information from the ‘Location’ column and identified cities with a higher incidence of airplane incidents.
  4. Aircraft and Operator Analysis

    • Common Aircraft Types
      • Identified the most common aircraft types involved in incidents.
    • Operator Patterns
      • Analyzed operators with the highest and lowest incident rates.
      • Explored trends related to the operator’s region or country.
  5. Fatality Analysis

    • Fatalities Over Time
      • Examined trends in the number of fatalities over the years.
    • Periods with Higher Fatality Rate
      • Calculated and analyzed the fatality rate over time.
      • Identified specific periods with a higher fatality rate.
  6. Potential Causes and Factors

    • Probable Causes
      • Processed and tokenized the ‘Summary’ column.
      • Analyzed the most common probable causes of airplane incidents.
    • Temporal Analysis of Probable Causes
      • Extended the analysis to identify shifts in predominant causes over time.
      • Visualized changes in the most common probable causes over the years.

Challenges Faced

  • Dealing with missing values and deciding on appropriate handling strategies.
  • Ensuring proper tokenization and text processing for the ‘Summary’ column.
  • Addressing potential conflicts with duplicate column names during the analysis.

Conclusions

  • Yearly trends suggest fluctuations in the number of airplane incidents, indicating potential factors influencing aviation safety.
  • Geographical analysis reveals insights into regions and cities with higher incident rates.
  • Common aircraft types and operators associated with incidents are identified.
  • Fatality analysis indicates periods with higher fatality rates, prompting further investigation into contributing factors.
  • Analysis of probable causes provides valuable insights into recurrent issues, helping inform safety measures.

Author