Home /
Expert Answers /
Statistics and Probability /
imagine-that-you-have-been-hired-as-a-data-analyst-for-the-health-department-while-cancer-studies-pa585

Imagine that you have been hired as a data analyst for the health department. While cancer studies continue to try to solve the medical mysteries of cancer occurrences and death, the health department would like to see if there is a relationship between the death rates of cancer and various demographic qualities in counties across the United States. For this report, you will analyze the relationship between poverty rates and cancer-related death occurrence rates from a supplied data set of over 90% of counties in the United States. You will then use two methods to test a claim that the mean poverty rate is 16%. Your calculations will be executed in Microsoft Excel. You will submit your Excel file for your instructor’s reference. You will also submit a written report as a Word file. This written report will contain not only your analysis, but also images of supporting graphs, charts, and tables from your Excel file. Follow the instructions below to complete calculations and create visual representations of data within your Excel data file and complete the corresponding sections of your written report. Begin by reading through the directions for this assignment in full. In your Excel file 1. Randomly select 200 counties from the DATA tab to use for your sample. Be sure to follow the process outlined in the video guide for this project to ensure that your sample is randomly selected. 2. Highlight all information for your random sample, including your rand() column, click Ctrl-C, then place it into cell A1 of the CALCULATIONS tab by selecting the Calculations tab, then A1, and clicking Ctrl-V. Ignore the original data tab sheet once you create your sample. Work only with your sampled data. 3. Create a histogram for each poverty level and death from cancer. 4. Calculate the mean and the five-number summary for each variable. In your report 5. Beginning on the title page, replace bracketed text with your information and work. The brackets themselves should also be removed from your final report. 6. Under the “Introduction” header, introduce the purpose of your report. Replace the bracketed text and brackets with your work. 7. Under the “Histograms and Summary Statistics” header, paste your histograms as images. 8. Create a table of summary statistics for each of your two variables of interest. 9. Write a paragraph explaining the shape, center, and spread for each of these two variables. In your Excel file 10.Create a scatter plot that displays the association between poverty levels and death from cancer. Label each axis of your scatterplot and give it an appropriate title. 11.Add a linear regression line to your scatterplot. 12.Display the regression equation and coefficient of determination (r2) on your chart. Arrange and increase font as needed so that these are easily visible to your reader. 13.Calculate your correlation coefficient. In your report 14.Under the “Scatterplot with Regression” header, paste your scatterplot that also has your regression line, regression equation, and r2 displayed. 15.State your independent and dependent variable. 16.State your regression equation. 17.State your slope and intercept; interpret the intercept in the context of this scenario. 18.State your correlation coefficient and explain what this tells you about your data. 19.State your coefficient of determination and explain what this tells you about your data and your equation. 20.Discuss whether you see any potential outliers and explain your reasoning. 21.If the poverty rate were to increase by 10%, say from 30% to 40%, by how much does the death rate change? 22.For what range of poverty rates is this regression equation best used? a. Give an example of interpolation stating the input and output for the regression equation. b. Give an example of extrapolation stating the input and output for the regression equation.