Dr. Semmelweis Handwashing
We will reanalyze the data that made Semmelweis discover the importance of handwashing and its impact on the hospital. For further details, please refer to the GitHub repository linked here.
Dr. Semmelweis and the Importance of Handwashing
Project Description
In 1847, the Hungarian physician Ignaz Semmelweis made a breakthrough discovery: he discovered handwashing and enforced it at his hospital to save hundreds of lives. This project aims to reanalyze the data behind one of the most important discoveries in modern medicine: handwashing.
Hungarian physician Dr. Ignaz Semmelweis worked at the Vienna General Hospital with childbed fever patients. Childbed fever is a deadly disease affecting women who have just given birth, and in the early 1840s, as many as 10% of the women giving birth died from it at the Vienna General Hospital. Dr. Semmelweis discovered that it was the contaminated hands of the doctors delivering the babies, and on June 1st, 1847, he decreed that everyone should wash their hands, an unorthodox and controversial request; nobody in Vienna knew about bacteria.
The data used for this analysis is stored as two CSV files:
-
yearly_deaths_by_clinic.csv
: contains the number of women giving birth at the two clinics at the Vienna General Hospital between the years 1841 and 1846.year
: Years (1841-1846)births
: Number of birthsdeaths
: Number of deathsclinic
: Clinic 1 or clinic 2
-
monthly_deaths.csv
: contains data from ‘Clinic 1’ of the hospital where most deaths occurred.date
: Date (YYYY-MM-DD)births
: Number of birthsdeaths
: Number of deaths
How much did handwashing reduce monthly death rates on average?
Project Tasks:
- Load the CSV files into yearly and monthly data frames and check the data.
- Add a
proportion_deaths
column to each dataframe, calculating the proportion of deaths per number of births for each year in yearly and each month in monthly. - Create two ggplot line plots: one for the yearly proportion of deaths and another for the monthly proportion of deaths. For the yearly plot, create a different colored line for each clinic.
- Add a
handwashing_started
boolean column to themonthly
dataframe using June 1st, 1847, as the threshold;TRUE
should mean that handwashing has started at the clinic. Plot the new dataframe with different colored lines depending onhandwashing_started
. - Calculate the mean proportion of deaths before and after handwashing from the
monthly
data, and store the result as a 2x2 dataframe namedmonthly_summary
with the first column containing thehandwashing_started
groups and the second column having the mean proportion of deaths.
Required Libraries:
- tidyverse
Author
- Name: Mohammed Mebarek Mecheter
- Email: mohammedmecheter@gmail.com
- GitHub: Mohammed Mebarek Mecheter
Feel free to contact me for any questions or additional information about this project.