Category: Statistics

  • “Analyzing Restaurant Sales Data for Pastas R Us”

    Assignment Deliverable
    Complete the following on the Data tab of the Pastas R Us data file: The file is attached.
    1) Calculate “Annual Sales” for each restaurant. Annual Sales is the result of multiplying a restaurant’s “SqFt.” by “Sales/SqFt.” The first value has been provided for you.
    2) Calculate the mean, standard deviation, skew, 5-number summary, and interquartile range (IQR) for each of the variables. The formulas and the first results have been provided for you.
    3) Create a boxplot (sometimes referred to as a box and whisker chart) for the “Annual Sales” variable.
    4) Create a histogram for the “Sales/SqFt” variable.
    Respond to the following questions on the Questions tab of the Pastas R Us data file:
    1) Does the annual sales boxplot look symmetric?
    2) Would you prefer the IQR instead of the standard deviation to describe the dispersion of the annual sales variable? If so, why?
    3) Does the histogram show that the sales per square foot distribution is symmetric?
    4) If the sales per square foot distribution is not symmetric, what is the skew?
    5) If there are any outliers, which one(s)? What is the “SqFt” area of the outlier(s)?
    6) Is the outlier(s) smaller or larger than the average restaurant in the data? What can you conclude from this observation?
    7) What measure of central tendency may be more appropriate to describe “Sales/SqFt”? Why?

  • Title: “Predicting Medical Charges using Linear Regression: An Analysis of Age, BMI, Sex, Smoker Status, and Region”

    Please create linear regression model to predict charges based on age,BMI,SEX,Smoker status, and region
    Medical Cost Personal Datasets (kaggle.com)

  • Title: “Analyzing the Frequency and Variability of Income Levels in a Sample Population”

    Follow instructions on files attached, cannot use same variables as the sample paper. Any questions let me know! Please use 1. Frequency table to include measures of central tendencies (mean, median, mode) and 2. Frequency table to include measures of variability ( interquartile range, variance, standard deviation, range) if possible, and use another one to seem like more research was done.

  • Project Part 1: Systematic Sample of “COVID-19 Vaccination Rates by State” Data Set

    You will be working on the Semester Project throughout the term in parts as Project Part Assignments. Additional information can be found on the Semester Project Information page. You will get feedback from your instructor on the parts of the project in the Project Parts as listed. Use that feedback to improve that portion of the project.
    Project Part 1: Systematic Sample of your chosen Data Set
    1) Choose 1 Data Set from the Data Sets for Project Parts and the Semester Project page.
    2) Create a Systematic Sample with 35 values.
    Use your birth month as the starting value in Row 1, then use your birthdate as your nth value. An example with additional details is the Data Sets.
    3) Write at least 2 quality sentences explaining which Data Set you used, what your starting number was, your nth value, and how you did it, so that any other person would be able to obtain the same results.
    4) List your 35 values in the order they were collected.
    When working on each part of the Semester Project the Best Practice is to type the information onto the appropriate slide of the Template Download Template, remove the directions, and then copy and paste your work and results into the text submission area of the assignment.
    You may submit the project part as a text submission, Word Document, or PowerPoint Slide from the Template (only that slide!) Do not submit your work as an embedded image in one of those files. Images cannot be accepted for these assignments. (File Types allowed: .doc,.docx, ppt., pptx)
    my birth month 06 birth day 08

  • “Excel Mastery: Analyzing Data and Making Informed Decisions”

    Please read the questions carefully. You will be required to use Microsoft Excel to complete the assignments. The grades will be based on your ability to calculate the correct answers, the methodology employed, and the interpretation of the results. 

  • “Exploring Heart Rate Data with Excel Graphs” Title: Exploring Heart Rate Data with Excel Graphs Variable 1: Gender (Qualitative) Graph type: Pie chart Excel graph: Insert > Pie Chart Variable 2

    Open the Heart Rate Data Set in Excel
    Using the classification of variables from the Unit 1 assignment as
    qualitative, quantitative discrete, or quantitative continuous, match
    each of the 3 variables to the most appropriate graph type. (For
    example, qualitative data can best be displayed with a pie chart or bar
    graph; continuous numerical data can best be displayed using a
    histogram)
    Use the graphing functions in Excel to create an appropriate graph
    of the data for each variable. Remember to properly label and title your
    graphs to identify what the graph is about clearly.

  • “Understanding Descriptive Statistics and Correlation in Research: A Case Study on Cannabis Use and Patient Views on Kidney Disease”

    What is the standard deviation (s) of the following set of scores?
    12
    25
    6
    9
    16
    13
    11
    10
    8
    7
    6
    14
    16
    12
    11
    23
    Group of answer choices
    7
    5.49
    5
    3.36
    What is the range of the following set of scores?
    13.7
    53.2
    4.1
    9.3
    52.1
    32.5
    22.9
    41.5
    23.0
    15.5
    1.9
    33.2
    Group of answer choices
    39.6
    50.2
    51.3
    9
    What is the variance (s2) of the following set of scores?
    12
    25
    6
    9
    16
    13
    11
    10
    8
    7
    6
    14
    16
    12
    11
    23
    Canadian Adults with kidney disease were selected to participate in a survey regarding their views on cannabis use. The survey asked participants to rank on a scale of 1-5 (1, definitely would not; 5, definitely would) whether they would try cannabis for various symptoms.
    Collister, D., Herrington, G., Delgado, L., & Whitlock, R. (2023). Patient views regarding cannabis use in chronic kidney disease and kidney failure: a survey study. Nephrology Dialysis Transplantation, 38(4), 922–931. https://doi-org.ezproxy1.lib.asu.edu/10.1093/ndt/gfac226
    What category/scale of measurement is this?
    Group of answer choices
    ratio
    ordinal
    nominal
    interval
    Which of the following are nominal data? (choose one or more)
    Group of answer choices
    two categories of exposure to a treatment (exposed and unexposed)
    three categories (low, medium, high) of heart rate ranges
    five categories of race
    four categories of satisfaction (low, somewhat satisfied, satisfied, very satisfied)
    three streets (Dove, Raven, Hawk) in a neighborhood
    Given the following data, what is the correlation between income and education?
    Income
    Education
    $36,577
    11
    $54,365
    12
    $33,542
    10
    $65,654
    16
    $45,765
    11
    $24,354
    7
    $43,233
    12
    $44,321
    12
    $23,216
    9
    $43,454
    12
    $64,543
    14
    $43,433
    13
    $34,644
    12
    $33,213
    10
    $55,654
    15
    $76,545
    16
    $21,324
    10
    $17,645
    10
    $23,432
    9
    $44,543
    15
    Group of answer choices
    .87
    .75
    .90
    .42
    Researchers study the relationship between hours spent playing video games and GPA. They find that as the number of hours spent playing video games increases, GPA decreases. What type of relationship is this?
    Group of answer choices
    No relationship
    Positive relationship
    Negative relationship
    In this scatterplot:
    x axis = scale of income inequality
    y axis = patents per million population
    Which of statement(s) is(are) TRUE? (choose one or more)
    Group of answer choices
    This scatterplot shows an indirect relationship
    Countries with greater income inequality are less innovative (have less patents per million population)
    Countries with greater income inequality have greater innovation (have more patents per million population
    This scatterplot shows a direct relationship
    Which of the following correlations would be interpreted as a strong relationship based upon our textbook? (choose one or more)
    Group of answer choices
    .50
    .70
    .60
    .80
    What is the possible range of values for a correlation coefficient?
    Group of answer choices
    0 to 100
    –.01 to .01
    –1.0 to 1.0
    0 to 1.0
    Given the following data, what is the correlation between age and length of sentence?
    Respondent
    Age (x)
    Length of Sentence (months)
    1
    14
    80
    2
    15
    65
    3
    15
    155
    4
    20
    192
    Average
    16
    123
    Group of answer choices
    .65
    .79
    -.58
    -.87
    A perfect negative correlation would be represented by Pearson’s r of -1.
    Group of answer choices
    True
    False

  • Title: Levels of Data and Types of Variables in Statistics: Exploring the Concepts and Visualizing Data

    In this lesson’s assignment, you will complete a problem set in which you address levels of data and types of variables. Answers to the problems must be complete and written in formal narrative language. In addition, you will write a short essay related to data privacy. You will also explore the different types of graphs used to visualize data. Results from both Excel and SPSS should be copied and pasted into a Word document for submission.
    Explain the concept of a random variable.  Explain what it means to say, “Variables must vary.”  Why is the concept of variables important for learning statistics?
    List and define the four levels of measurement (using examples) discussed in this lesson’s introduction and resources. In your opinion, which one or more is the most appropriate for statistical analysis? Explain. 
    Compare and contrast the characteristics of continuous and discrete variables. What is a common challenge of trying to calculate statistics using discrete variables?
    Identify example variables from your professional and personal life at each level of measurement.  Explain why you selected the level you did for each, relying on this lesson’s resources for support.
    Identify at least 4 (two of each) discrete and continuous variables from your own professional or personal life and explain why you selected the category you did for each, relying on this lesson’s resources for support.
    Use the provided datasets for building one of each of the four chart types below.  For each chart, select a variable from the provided dataset with a measurement level that is best visualized by that chart type. Use APA style to label each chart. Each graph must contain a narrative description of what it represents and an interpretation of the image.  Use this narrative and the graph to tell a story with your data.
    Pie chart
    Bar chart
    Scatterplot
    Histogram
    Length: 7 to 10 pages not including title page or reference page 
    References: Include a minimum of 4 scholarly resources (This is only a minimum requirement. You should strive to include more than the minimum in all doctoral research). Be sure to reference Excel and SPSS as they are resources for this assignment, although not scholarly.

  • “Exploring the Foundations of Organizational Behavior: A Comprehensive Analysis”

    I have uploaded the Rubric/Guidelines, Module Overview, Reading and Resources, and chapters from the textbook.

  • “Exploring the Relationship Between Property Size and Selling Price in the Real Estate Market: A Regional Analysis”

    You have been recently hired as a junior analyst by D.M. Pan Real Estate Company. The sales team has tasked you with preparing a report that examines the relationship between the selling price of properties and their size in square feet. You have been provided with a Real Estate Data Spreadsheet spreadsheet that includes properties sold nationwide in recent years. The team has asked you to select a region, complete an initial analysis, and provide the report to the team.
    Note: In the report you prepare for the sales team, the response variable (y) should be the listing price and the predictor variable (x) should be the square feet.
    Specifically you must address the following rubric criteria, using the Module Two Assignment Template:
    Generate a Representative Sample of the Data
    Select a region and generate a simple random sample of 30 from the data.
    Report the mean, median, and standard deviation of the listing price and the square foot variables.
    Analyze Your Sample
    Discuss how the regional sample created is or is not reflective of the national market.
    Compare and contrast your sample with the population using the National Summary Statistics and Graphs Real Estate Data PDF document.
    Explain how you have made sure that the sample is random.
    Explain your methods to get a truly random sample.
    Generate Scatterplot
    Create a scatterplot of the x and y variables noted above. Include a trend line and the regression equation. Label the axes.
    Observe patterns
    Answer the following questions based on the scatterplot:
    Define x and y. Which variable is useful for making predictions?
    Is there an association between x and y? Describe the association you see in the scatter plot.
    What do you see as the shape (linear or nonlinear)?
    If you had a 1,800 square foot house, based on the regression equation in the graph, what price would you choose to list at?
    Do you see any potential outliers in the scatterplot?
    Why do you think the outliers appeared in the scatterplot you generated?
    What do they represent?
    https://learn.snhu.edu/d2l/le/content/1612807/viewContent/33022711/View