Assignment Deliverable
Complete the following on the Data tab of the Pastas R Us data file: The file is attached.
1) Calculate “Annual Sales” for each restaurant. Annual Sales is the result of multiplying a restaurant’s “SqFt.” by “Sales/SqFt.” The first value has been provided for you.
2) Calculate the mean, standard deviation, skew, 5-number summary, and interquartile range (IQR) for each of the variables. The formulas and the first results have been provided for you.
3) Create a boxplot (sometimes referred to as a box and whisker chart) for the “Annual Sales” variable.
4) Create a histogram for the “Sales/SqFt” variable.
Respond to the following questions on the Questions tab of the Pastas R Us data file:
1) Does the annual sales boxplot look symmetric?
2) Would you prefer the IQR instead of the standard deviation to describe the dispersion of the annual sales variable? If so, why?
3) Does the histogram show that the sales per square foot distribution is symmetric?
4) If the sales per square foot distribution is not symmetric, what is the skew?
5) If there are any outliers, which one(s)? What is the “SqFt” area of the outlier(s)?
6) Is the outlier(s) smaller or larger than the average restaurant in the data? What can you conclude from this observation?
7) What measure of central tendency may be more appropriate to describe “Sales/SqFt”? Why?
Category: Statistics
-
“Analyzing Restaurant Sales Data for Pastas R Us”
-
Title: “Predicting Medical Charges using Linear Regression: An Analysis of Age, BMI, Sex, Smoker Status, and Region”
Please create linear regression model to predict charges based on age,BMI,SEX,Smoker status, and region
Medical Cost Personal Datasets (kaggle.com) -
Title: “Analyzing the Frequency and Variability of Income Levels in a Sample Population”
Follow instructions on files attached, cannot use same variables as the sample paper. Any questions let me know! Please use 1. Frequency table to include measures of central tendencies (mean, median, mode) and 2. Frequency table to include measures of variability ( interquartile range, variance, standard deviation, range) if possible, and use another one to seem like more research was done.
-
Project Part 1: Systematic Sample of “COVID-19 Vaccination Rates by State” Data Set
You will be working on the Semester Project throughout the term in parts as Project Part Assignments. Additional information can be found on the Semester Project Information page. You will get feedback from your instructor on the parts of the project in the Project Parts as listed. Use that feedback to improve that portion of the project.
Project Part 1: Systematic Sample of your chosen Data Set
1) Choose 1 Data Set from the Data Sets for Project Parts and the Semester Project page.
2) Create a Systematic Sample with 35 values.
Use your birth month as the starting value in Row 1, then use your birthdate as your nth value. An example with additional details is the Data Sets.
3) Write at least 2 quality sentences explaining which Data Set you used, what your starting number was, your nth value, and how you did it, so that any other person would be able to obtain the same results.
4) List your 35 values in the order they were collected.
When working on each part of the Semester Project the Best Practice is to type the information onto the appropriate slide of the Template Download Template, remove the directions, and then copy and paste your work and results into the text submission area of the assignment.
You may submit the project part as a text submission, Word Document, or PowerPoint Slide from the Template (only that slide!) Do not submit your work as an embedded image in one of those files. Images cannot be accepted for these assignments. (File Types allowed: .doc,.docx, ppt., pptx)
my birth month 06 birth day 08 -
“Excel Mastery: Analyzing Data and Making Informed Decisions”
Please read the questions carefully. You will be required to use Microsoft Excel to complete the assignments. The grades will be based on your ability to calculate the correct answers, the methodology employed, and the interpretation of the results.
-
“Exploring Heart Rate Data with Excel Graphs” Title: Exploring Heart Rate Data with Excel Graphs Variable 1: Gender (Qualitative) Graph type: Pie chart Excel graph: Insert > Pie Chart Variable 2
Open the Heart Rate Data Set in Excel
Using the classification of variables from the Unit 1 assignment as
qualitative, quantitative discrete, or quantitative continuous, match
each of the 3 variables to the most appropriate graph type. (For
example, qualitative data can best be displayed with a pie chart or bar
graph; continuous numerical data can best be displayed using a
histogram)
Use the graphing functions in Excel to create an appropriate graph
of the data for each variable. Remember to properly label and title your
graphs to identify what the graph is about clearly. -
“Understanding Descriptive Statistics and Correlation in Research: A Case Study on Cannabis Use and Patient Views on Kidney Disease”
What is the standard deviation (s) of the following set of scores?
12
25
6
9
16
13
11
10
8
7
6
14
16
12
11
23
Group of answer choices
7
5.49
5
3.36
What is the range of the following set of scores?
13.7
53.2
4.1
9.3
52.1
32.5
22.9
41.5
23.0
15.5
1.9
33.2
Group of answer choices
39.6
50.2
51.3
9
What is the variance (s2) of the following set of scores?
12
25
6
9
16
13
11
10
8
7
6
14
16
12
11
23
Canadian Adults with kidney disease were selected to participate in a survey regarding their views on cannabis use. The survey asked participants to rank on a scale of 1-5 (1, definitely would not; 5, definitely would) whether they would try cannabis for various symptoms.
Collister, D., Herrington, G., Delgado, L., & Whitlock, R. (2023). Patient views regarding cannabis use in chronic kidney disease and kidney failure: a survey study. Nephrology Dialysis Transplantation, 38(4), 922–931. https://doi-org.ezproxy1.lib.asu.edu/10.1093/ndt/gfac226
What category/scale of measurement is this?
Group of answer choices
ratio
ordinal
nominal
interval
Which of the following are nominal data? (choose one or more)
Group of answer choices
two categories of exposure to a treatment (exposed and unexposed)
three categories (low, medium, high) of heart rate ranges
five categories of race
four categories of satisfaction (low, somewhat satisfied, satisfied, very satisfied)
three streets (Dove, Raven, Hawk) in a neighborhood
Given the following data, what is the correlation between income and education?
Income
Education
$36,577
11
$54,365
12
$33,542
10
$65,654
16
$45,765
11
$24,354
7
$43,233
12
$44,321
12
$23,216
9
$43,454
12
$64,543
14
$43,433
13
$34,644
12
$33,213
10
$55,654
15
$76,545
16
$21,324
10
$17,645
10
$23,432
9
$44,543
15
Group of answer choices
.87
.75
.90
.42
Researchers study the relationship between hours spent playing video games and GPA. They find that as the number of hours spent playing video games increases, GPA decreases. What type of relationship is this?
Group of answer choices
No relationship
Positive relationship
Negative relationship
In this scatterplot:
x axis = scale of income inequality
y axis = patents per million population
Which of statement(s) is(are) TRUE? (choose one or more)
Group of answer choices
This scatterplot shows an indirect relationship
Countries with greater income inequality are less innovative (have less patents per million population)
Countries with greater income inequality have greater innovation (have more patents per million population
This scatterplot shows a direct relationship
Which of the following correlations would be interpreted as a strong relationship based upon our textbook? (choose one or more)
Group of answer choices
.50
.70
.60
.80
What is the possible range of values for a correlation coefficient?
Group of answer choices
0 to 100
–.01 to .01
–1.0 to 1.0
0 to 1.0
Given the following data, what is the correlation between age and length of sentence?
Respondent
Age (x)
Length of Sentence (months)
1
14
80
2
15
65
3
15
155
4
20
192
Average
16
123
Group of answer choices
.65
.79
-.58
-.87
A perfect negative correlation would be represented by Pearson’s r of -1.
Group of answer choices
True
False -
Title: Levels of Data and Types of Variables in Statistics: Exploring the Concepts and Visualizing Data
In this lesson’s assignment, you will complete a problem set in which you address levels of data and types of variables. Answers to the problems must be complete and written in formal narrative language. In addition, you will write a short essay related to data privacy. You will also explore the different types of graphs used to visualize data. Results from both Excel and SPSS should be copied and pasted into a Word document for submission.
Explain the concept of a random variable. Explain what it means to say, “Variables must vary.” Why is the concept of variables important for learning statistics?
List and define the four levels of measurement (using examples) discussed in this lesson’s introduction and resources. In your opinion, which one or more is the most appropriate for statistical analysis? Explain.
Compare and contrast the characteristics of continuous and discrete variables. What is a common challenge of trying to calculate statistics using discrete variables?
Identify example variables from your professional and personal life at each level of measurement. Explain why you selected the level you did for each, relying on this lesson’s resources for support.
Identify at least 4 (two of each) discrete and continuous variables from your own professional or personal life and explain why you selected the category you did for each, relying on this lesson’s resources for support.
Use the provided datasets for building one of each of the four chart types below. For each chart, select a variable from the provided dataset with a measurement level that is best visualized by that chart type. Use APA style to label each chart. Each graph must contain a narrative description of what it represents and an interpretation of the image. Use this narrative and the graph to tell a story with your data.
Pie chart
Bar chart
Scatterplot
Histogram
Length: 7 to 10 pages not including title page or reference page
References: Include a minimum of 4 scholarly resources (This is only a minimum requirement. You should strive to include more than the minimum in all doctoral research). Be sure to reference Excel and SPSS as they are resources for this assignment, although not scholarly. -
“Exploring the Foundations of Organizational Behavior: A Comprehensive Analysis”
I have uploaded the Rubric/Guidelines, Module Overview, Reading and Resources, and chapters from the textbook.
-
“Exploring the Relationship Between Property Size and Selling Price in the Real Estate Market: A Regional Analysis”
You have been recently hired as a junior analyst by D.M. Pan Real Estate Company. The sales team has tasked you with preparing a report that examines the relationship between the selling price of properties and their size in square feet. You have been provided with a Real Estate Data Spreadsheet spreadsheet that includes properties sold nationwide in recent years. The team has asked you to select a region, complete an initial analysis, and provide the report to the team.
Note: In the report you prepare for the sales team, the response variable (y) should be the listing price and the predictor variable (x) should be the square feet.
Specifically you must address the following rubric criteria, using the Module Two Assignment Template:
Generate a Representative Sample of the Data
Select a region and generate a simple random sample of 30 from the data.
Report the mean, median, and standard deviation of the listing price and the square foot variables.
Analyze Your Sample
Discuss how the regional sample created is or is not reflective of the national market.
Compare and contrast your sample with the population using the National Summary Statistics and Graphs Real Estate Data PDF document.
Explain how you have made sure that the sample is random.
Explain your methods to get a truly random sample.
Generate Scatterplot
Create a scatterplot of the x and y variables noted above. Include a trend line and the regression equation. Label the axes.
Observe patterns
Answer the following questions based on the scatterplot:
Define x and y. Which variable is useful for making predictions?
Is there an association between x and y? Describe the association you see in the scatter plot.
What do you see as the shape (linear or nonlinear)?
If you had a 1,800 square foot house, based on the regression equation in the graph, what price would you choose to list at?
Do you see any potential outliers in the scatterplot?
Why do you think the outliers appeared in the scatterplot you generated?
What do they represent?
https://learn.snhu.edu/d2l/le/content/1612807/viewContent/33022711/View