Order from us for quality, customized work in due time of your choice.
Competency Statement: Apply the concepts of statistical reasoning, data analysis, modeling, and interpretation.
Final Assessment Title: Statistical Analyst Case Study
Competency Learning Outcomes (CLOs)
1. Analyze and interpret a statistical problem and determine the optimal solution.
2. Calculate and interpret descriptive and inferential statistical data.
3. Create and explain graphs and charts that contain statistical data.
4. Analyze and interpret real world problems using basic probability theory.
5. Explain hypothesis testing and its use in statistical analyses.
6. Analyze and interpret correlation and regression in a real-world context.
7. Apply knowledge of statistics to a real-world project.
Purpose of the Assessment
The purpose of this assignment is for you to apply the course content related to analyzing data, selecting statistical tests and
interpreting the results to a real world problem. You will be working with a data set in excel or google sheets to calculate statistics and
create charts, and writing an essay in APA 7 style to present and discuss the results.
Submission Artifacts:
● A 2,000 to 3,000 word paper written from the point of view of a professional statistical analyst who has been hired to analyze a
data set.
● A Works Cited in APA 7 format containing the works cited in your paper.
For this essay, you will select ONLY ONE of the following data sets.
1) Music and Mental Health Survey Data: For this case, you will write as though you have been hired by the research team that
distributed this survey to analyze the results. The data in this survey includes self-reported information about anxiety,
depression and OCD symptoms and music listening preferences. Additional information about the variables and provenance of
the data is contained in the ‘dataset information’ tab of the spreadsheet. They’ve hired you to answer the question ‘What
genre of music is associated with highest levels of anxiety? Is BPM preference related to the experience of anxiety?’
2) IMDB Information for US Movies: This dataset contains information about a sample of American-produced movies, including
box office, genre, number of ratings, and how positive the ratings are. For this case study, you’ve been hired by a movie studio
as a data analyst to answer the question ‘What kinds of movies are most successful’? How do positive reviews relate to box
office income?
In your essay, you will answer the research question for your case study as though you were a professional statistics consultant
writing a report for their client. The essay will have the following sections:
● Introduction
○ Briefly introduce the topic you’ll be discussing, and the questions you hope to answer.
○ Describe the dataset. How was it gathered? Are there any limitations or issues with this data?
○ Describe the sample size and its relationship to the population of interest.
○ Select and describe the variables you will be working with in this paper. You will need to select three variables, including
at least two continuous and one categorical variable. You will use these variables to answer the question you’ve stated
above, so choose carefully and after re-reading the instructions for your data set.
■ What is the level of measurement for each variable?
■ Is this a dependent or independent variable?
● Descriptive Statistics
○ Calculate statistics related to measures of central tendency and measures of dispersion for your numeric variables. For
your categorical variable, describe the proportion of the group in each category.
■ Calculate these statistics using excel or google sheets. You will be required to submit your work.
○ Discuss the meaning of these statistics for understanding the data.
■ Remember that you’re writing as a statistical consultant explaining the results to your client.
■ What does the relative location of mean and median tell you?
■ What does the standard deviation, range, and IQR tell us about the dispersion of the data?
Are there any outliers?
■ Is the data normally distributed? Why does this matter?
■ Include at least one graphical display for each variable which communicates the data’s distribution.
● The charts should be clearly labeled and chosen so that they’re appropriate for the type of variable.
● Inferential Statistics
○ Calculate the confidence intervals for the proportion of your categorical variable.
■ Explain the results to your clients, so they can understand how their sample relates to the population
○ Translate your questions into two formal hypotheses to test.
■ One hypothesis should require you to compare the mean of a numerical variable across a category
■ One hypothesis will ask about correlation or regression.
○ For each hypothesis,
■ Select the appropriate statistical test
■ Identify your alpha level
■ Identify if this is a one-tailed or two-tailed test
■ Calculate your test statistics
● You will use excel or google sheets for your calculations. You will be required to submit these with your
work.
■ Discuss your results. Can we reject the null hypothesis?
● In plain language, tell your clients what this means for the answer to their research questions.
■ Include one chart for each hypothesis test. For your correlational or regression test, include a scatter plot.
● Conclusions
○ Conclude by summarizing your findings for your clients
■ What is the answer to their research question?
Submission 2: Excel or Google Sheets
● You will include the excel or google sheets file you used to calculate your results with your submission.
File Logistics:
● Submit your final paper (word, google doc or pdf) following APA 7 style.
● The essay must be 2,000 to 3,000 words in length.
Order from us for quality, customized work in due time of your choice.