Skip to content

JacqueLeeMeyer/Matplotlib-Pymaceuticals

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 

Repository files navigation

Matplotlib-Pymaceuticals

Module 5 Challenge

This challenge/assignment was using data from drug trials on mice.

After cleaning the data I created a dataframe for Summary Statistics.

image

I then created the following graphs:

Bar graph of Total timepoints for each Drug

image

Pie chart of # of Male vs Female Mice in the study

image

Box and Whisker plot showing any outliers for the Drugs - Capomulin, Ramicane, Infubinol, and Ceftamin

image

Line graph of the Tumor Volume over the 40 days of Mouse y793 (Capomulin)

image

Scatter plot of the Average Tumor Volume vs Weights of all mice in the Capomulin Drug Trial

image

Then calculated correlation and linear regression and added it to copy of the scatter plot

image

Note on Pie Charts:

I used .unique as opposed to the .value-counts because I felt that using the 248 unique Mouse IDs to get the percentages of Males vs Females made more sense than counting all 1880 timepoints with a male/female attribute. This caused my percentages to be a little different than the example (which I am assuming used the .value-counts method).

Sources I used:

https://saturncloud.io/blog/python-pandas-conditionally-delete-rows/#:~:text=We%20can%20aslo%20use%20the,met%2C%20similar%20to%20boolean%20indexing.
    To drop Mouse ID g989 (Input 5)

https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.aggregate.html
    To use aggregation method to create summary statistics (Input 8)

https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.isin.html
    TO create DataFrame with only the values in a certain column (Input 13)

About

Module 5 Challenge

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published