“Exploring Data Analysis and Probability Distributions with Excel and R”

This being a graduate-level course, requires independent research capabilities. I am here to guide you but some of the steps will require you to find the processes step by step.
Should you have questions, feel free to ask. 
1. Open the Excel file attached. 
2. Save each one of the worksheets as separate CSV files
3. Read the CSV files into R using R Studio
4. Using Excel:
For Worksheet 1: Compute the average, mean, and median of the data. Create a histogram, a box plot (box and whisker plot). Using the box plot, identify the 25% and 75% percentile. 
For Worksheet 2: Prepare a scatter diagram of the 2 variables. Using the statistical toolkit, find for each column, the following data: Mean,standard error,median,mode,standard deviation,sample variance,kurtosis,skewness,range,minimum,maximum,sum,and count
5. Repeat Step 4 using R instead of Excel. 
6. Using Excel (Excel pre-built fomulas or your own typed formula), compute the values of a binomial distribution (probability mass function and cumulative probabilities) where successes x = varies from 0 to n, where the attempts n=10, and the probability of success at every attempt is p=0.2. Prepare a graph showing both values
7. Using Excel, compute the value of a hypergeometric distribution (probability mass function and cumulative distribution), where the number of successes x varies from 0 to n, where the number sampled n=3, the lot size is N= 40, and the total number of successes in the lot X= 4. Prepare a graph showing both probability mass function and cumulative distribution). 
8 Compare 7 with an approximation to a binomial distribution with equivalent parameters. 

Comments

Leave a Reply