Skip to content

This project explores tumor volume trends in mice treated with different drug regimens for squamous cell carcinoma. Using Python, we clean the dataset, perform basic statistical analysis, and create visualizations to summarize findings.

Notifications You must be signed in to change notification settings

kittychew/Pymaceuticals-challenge

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 

Repository files navigation

Pymaceuticals Inc.

Investigating the Effectiveness of Cancer Treatments in Mice

Overview

Pymaceuticals Inc. is a pharmaceutical company developing treatments for squamous cell carcinoma. This project analyzes tumor volume measurements from mice treated with different drug regimens to determine the effectiveness of various treatments. Using Python and data visualization techniques, we clean the dataset, perform statistical analysis, and visualize key findings.

Analysis

  • The dataset includes tumor volume measurements from mice treated with different drug regimens. The analysis focuses on summarizing key statistical properties, identifying potential data anomalies, and visualizing treatment effectiveness.

  • Data cleaning revealed duplicate mouse IDs with repeated timepoints, which were removed to ensure data integrity. The final dataset contains 248 unique mice after deduplication.

  • Summary statistics of tumor volume for each drug regimen indicate that Capomulin and Ramicane had the lowest average tumor volumes, suggesting potential effectiveness in reducing tumor size.

  • Bar and pie charts visualize the number of observations per drug regimen and the gender distribution of the mice. The study had a nearly even split between male and female mice.

  • A box plot analysis of tumor volumes for four key drug regimens revealed potential outliers, particularly in the Infubinol and Ceftamin groups, highlighting variability in response to treatment.

  • A line plot of tumor volume over time for an individual mouse treated with Capomulin demonstrates a steady decline, suggesting that the drug may be effective.

  • A scatter plot of mouse weight versus average tumor volume for the Capomulin regimen shows a strong positive correlation, which was quantified using Pearson's correlation coefficient.

  • A linear regression model further supports the correlation, with the regression line demonstrating a significant relationship between mouse weight and tumor volume.

Conclusion: The analysis suggests that Capomulin and Ramicane are the most promising treatments, with lower average tumor volumes and a consistent reduction in tumor size over time. However, further testing and statistical validation are necessary to confirm these findings.

Visualizations

Screenshot-2025-01-30-at-11-11-45-PM.png

Screenshot-2025-01-30-at-11-16-39-PM.png

Screenshot-2025-01-30-at-11-17-11-PM.png

Screenshot-2025-01-30-at-11-17-44-PM.png

Screenshot-2025-01-30-at-11-18-58-PM.png

Skills Used

  • Data Cleaning: Handling missing data, removing duplicates, and ensuring data integrity.
  • Exploratory Data Analysis (EDA): Generating summary statistics, identifying anomalies, and creating visualizations.
  • Statistical Analysis: Using statistical tests to assess data and establish correlations (e.g., Pearson's correlation).
  • Data Visualization: Creating effective plots (e.g., bar charts, pie charts, box plots, scatter plots, line plots) using Matplotlib.
  • Regression Analysis: Building linear regression models to quantify relationships in the data.
  • Hypothesis Testing: Conducting tests and analyzing data to validate the effectiveness of treatments.

Technologies Used

  • Python
  • Pandas
  • Matplotlib
  • scipy.stats
  • Jupyter Notebook

Resources

  • Chat GPT

About

This project explores tumor volume trends in mice treated with different drug regimens for squamous cell carcinoma. Using Python, we clean the dataset, perform basic statistical analysis, and create visualizations to summarize findings.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published