Question 1
Please submit a Rmarkdown (word format) report capturing the following:Use the attached Iris Dataset: iris_exams.csv (click to download).Provide at least the following in the report for full credit: (1) Understanding the Data:
- The structure of the data and a preview of the data.
- Frequency Distribution. (Frequency Tables & Plots for each variable in the dataset (Barplots/Histograms)). Make sure to capture the skewness and kurtosis. – Provide an interpretation in one paragraph (no more than 300 words) explaining the distribution of the data.
- Summary Statistics of the Data at least including mean, quartiles, min/max, and standard deviation.
Question 2
Using the mtcars dataset, demonstrate the skills you have learned so far in class and submit a Rmarkdown (word doc) report including the following:
- Develop a hypothesis
- What is your hypothesis?
- What columns are IVs
- What columns are DVs
- What columns are ignoble (why)
- Check for Errors & Missing Data
- Clean the data
- How did you deal with NAs
- How did you deal with outliers
- Check Assumptions using Parametric Tests
- Additivity
- Linearity
- Normality
- Homogeneity, Homoscedasticity
Question 3
Create a bar graph using the attached Iris dataset: iris_exams.csv (click to download). Compare the Sepal Length of the flower Species. Include the following:
- Main Title
- X and Y-Axis Labels
- Colors by Species
- Provide an interpretation in one paragraph (no more than 300 words) explaining the distribution of the data.
Which Species Sepal.Length is greater?