MATH 140 -Problem Set 4F
This chapter is from Introduction to Statistics for Community College Students, 1st Edition, by Matt Teachout, College of the Canyons, Santa Clarita, CA, USA, and is licensed
under a CC-By Creative Commons Attribution 4.0 International license 10/1/18
Practice Problems Section 4F
(#1-10) Use each of the following categorical association ????2-test statistics and the corresponding critical values to fill out the table.
????2-test stat
Sentence to explain ????2-test statistic.
Critical Value
Does the ????2-test statistic fall in a tail determined by the critical value?
(Yes or No)
Does sample data significantly
disagree with ????0? 1. +1.573 +4.117 2. +6.226 +5.118 3. +2.144 +4.121 4. +3.415 +5.091 5. +13.718 +7.189 6. +0.972 +4.812 7. +31.652 +12.557 8. +11.185 +5.181 9. +25.443 +7.008
10. +1.133 +8.336
(#11-20) Use each of the following P-values and corresponding significance levels to fill out the table.
P-value Proportion
P- value
%
Sentence to explain the
P-value
Significance Level %
Significance level
Proportion
If ????0 is true, could the sample data occur by random
chance or is it unlikely?
Reject ????0 or Fail to reject ????0?
11. 0.263 10% 12. 0.0042 1% 13. 5.22 × 10?4 5% 14. 0.0639 1% 15. 0 5% 16. 0.539 10% 17. 0.0419 5% 18. 0.0027 10% 19. 7.73 × 10?8 1% 20. 0.674 5%
21. If we have two raw categorical data sets, what must we click on in Statcato to perform a categorical association test?
22. If we have summary counts organized in a contingency table, what must we click on in Statcato to perform a categorical association test?
23. What are the assumptions for a categorical association test if the data was collected from on random sample?
24. What are the assumptions for a categorical association test if the data was collected from multiple random samples?
25. How are the expected counts calculated in a categorical association test?
26. If the expected counts from the null hypothesis are significantly different from the observed sample counts, describe the effect on the Chi-Squared test statistic.
27. If the expected counts from the null hypothesis are close to the observed sample counts, describe the effect on the Chi-Squared test statistic.
https://creativecommons.org/licenses/by/4.0/
This chapter is from Introduction to Statistics for Community College Students, 1st Edition, by Matt Teachout, College of the Canyons, Santa Clarita, CA, USA, and is licensed
under a CC-By Creative Commons Attribution 4.0 International license 10/1/18
(#28-31) Directions: For each of the following problems, use the Statcato printout provided to answer the following questions.
a) Write the null and alternative hypothesis. Make sure to label which one is the claim.
b) Check the assumptions for the categorical association test.
c) What is the Chi-squared test statistic? Write a sentence to explain the test statistic.
d) Does the test statistic fall in the tail determined by the critical value?
e) Does the sample data significantly disagree with the null hypothesis? Explain your answer.
f) Are the observed counts significantly different from the expected counts? Explain your answer.
g) What is the P-value? Write a sentence to explain the P-value.
h) Compare the P-value to the significance level. Should we reject the null hypothesis or fail to reject the null hypothesis? Explain your answer.
i) If the null hypothesis was true, could the sample data or more extreme have occurred by sampling variability or is it unlikely to be sampling variability? Explain your answer.
j) Write a conclusion for the test addressing evidence and the claim. Explain your conclusion in non-technical language.
k) Are the categories related or not? Explain your answer.
28. A random sample of male college students were asked their major. Later, a random sample of female college students were asked their major. The goal of the study was to show that gender is not related to major. Use a 5% significance level and the Statcato printout below to answer the questions given above.
https://creativecommons.org/licenses/by/4.0/
This chapter is from Introduction to Statistics for Community College Students, 1st Edition, by Matt Teachout, College of the Canyons, Santa Clarita, CA, USA, and is licensed
under a CC-By Creative Commons Attribution 4.0 International license 10/1/18
29. A random sample of adults were asked their blood type and Rh status. (Blood tests were provided for those that did not know their blood type and Rh status.) The goal of the study was to show that blood type is related to Rh status (dependent). Use a 10% significance level and the Statcato printout below to answer the questions given above.
30. A hospital wanted to determine if the age of a patient is not related to what part of the hospital they were in. They took a random sample of patients that have visited their hospital and determined both their age and the part of the hospital. The ages were broken up into age groups. Use a 1% significance level and the Statcato printout below to answer the questions given above.
https://creativecommons.org/licenses/by/4.0/
This chapter is from Introduction to Statistics for Community College Students, 1st Edition, by Matt Teachout, College of the Canyons, Santa Clarita, CA, USA, and is licensed
under a CC-By Creative Commons Attribution 4.0 International license 10/1/18
31. A random sample of American adults was taken and their health and education status obtained. Test to test the claim that health and education are related. Use a 5% significance level and the Statcato printout below to answer the questions given above.
(#32-35) Directions: Use StatKey at www.lock5stat.com to simulate the following chi-squared categorical association tests. Go to the More Advanced Randomization Tests menu at the bottom of the StatKey page. Click on the button that says, ????2 Test for Association. Click on Edit Data and type in the contingency table provided. Click on Generate 1000 Samples a few times to create the simulated sampling distribution and answer the following questions.
a) Write the null and alternative hypothesis. Make sure to label which one is the claim.
b) Check the assumptions for the categorical association test. Assume the data was collected randomly. Under Original Sample, click on Show Details to see the expected counts.
c) Use the formula df = (r 1)(c 1) to calculate the degrees of freedom. r is the number of rows and c is the number of columns not counting the totals.
d) What is the Chi-squared test statistic? Write a sentence to explain the test statistic.
e) Put the significance level proportion in the right tail proportion to calculate the critical value. What is the critical value? (Answers will vary slightly.) Does the original sample ????2 test statistic fall in the tail determined by the critical value?
f) Does the sample data significantly disagree with the null hypothesis? Explain your answer.
g) Are the observed counts significantly different from the expected counts? Explain your answer.
h) Put the original sample test ????2 test statistic in the bottom box in the simulation to calculate the P-value. What is the P-value? (Answers will vary slightly.) Write a sentence to explain the P-value.
https://creativecommons.org/licenses/by/4.0/
http://www.lock5stat.com/
This chapter is from Introduction to Statistics for Community College Students, 1st Edition, by Matt Teachout, College of the Canyons, Santa Clarita, CA, USA, and is licensed
under a CC-By Creative Commons Attribution 4.0 International license 10/1/18
i) Compare the P-value to the significance level. Should we reject the null hypothesis or fail to reject the null hypothesis? Explain your answer.
j) If the null hypothesis was true, could the sample data or more extreme have occurred by sampling variability or is it unlikely to be sampling variability? Explain your answer.
k) Write a conclusion for the test addressing evidence and the claim. Explain your conclusion in non-technical language.
l) Are the categories related or not? Explain your answer.
32. We want to know if the state a home is built in is related to the size of the home. A random sample of homes in the U.S was taken. Click on Edit Data in StatKey and type in the following contingency table. Do not forget to include a space after the commas. Use a 5% significance level and randomized simulation to test the claim that the state is not related to size of the home.
[blank], CA, NJ, NY, PA
Large, 7, 6, 7, 3
Small, 23, 24, 23, 27
33. Open the Car Data at www.matt-teachout.org. Copy and paste the Country and Cylinders columns next to each other in a new Excel spreadsheet. Then copy the two columns together. Click on Edit Data in StatKey and paste the two columns into StatKey. Use a 1% significance level to test the claim that the country a car is made in is related to the cylinders. Answer the questions above.
34. We want to show that gender is related to getting an award. A random sample of people that won famous awards in the Olympic, Academia, and Nobel was taken and their gender was noted. Click on Edit Data in StatKey and type in the following contingency table. Do not forget to include a space after the commas. Use a 10% significance level and randomized simulation to test the claim that awards are related to gender.
[blank], Olympic, Academy, Nobel
Male, 109, 11, 73
Female, 73, 20, 76
35. Open the Math 140 Fall 2015 Survey Data at www.matt-teachout.org. Copy and paste the Tattoo and Favorite Social Media columns next to each other in a new Excel spreadsheet. Then copy the two columns together. Click on Edit Data in StatKey and paste the two columns into StatKey. Use a 5% significance level to test the claim that having a tattoo or not is not related to social media. Answer the questions above.
——————————————————————————————————————————————————–
https://creativecommons.org/licenses/by/4.0/
http://www.matt-teachout.org/
http://www.matt-teachout.org/
Recent Comments