How are the smoking habits of students related to their parents' smoking? Here is a two-way table from a survey of student s in eight Arizona high sch

Clifland 2021-03-04 Answered
How are the smoking habits of students related to their parents' smoking? Here is a two-way table from a survey of student s in eight Arizona high schools:
\(\begin{array}{c|c}&\text{Student smokes}&\text{Student does not smoke}&\text{Total}\\\hline\text{Both parents smoke}&400&1380&400+1380=1780\\\hline\text{One parent smokes}&416&1823&416+1823=2239\\\hline\text{Neither parent smokes}&188&1168&188+1168=1356\\\hline\text{Total}&400+416+188=1004&1380+1823+1168=4371&1004+4371=5375\end{array}\)
(a) Write the null and alternative hypotheses for the question of interest.
(b) Find the expected cell counts. Write a sentence that explains in simple language what "expected counts" are.
(c) Find the chi-square statistic, its degrees of freedom, and the P-value.
(d) What is your conclusion about significance?

Want to know more about Two-way tables?

Expert Community at Your Service

  • Live experts 24/7
  • Questions are typically answered in as fast as 30 minutes
  • Personalized clear answers
Learn more

Solve your problem for the price of one coffee

  • Available 24/7
  • Math expert for every subject
  • Pay only if we can solve it
Ask Question

Expert Answer

saiyansruleA
Answered 2021-03-05 Author has 14325 answers

Let us assume:
\(\alpha=0.05=5\%\)
(a) The null hypothesis states that there is no association between the variables, while the alternative hypothesis states that there is an association between the variables.
\(H_0:\) There is no association between student smoking habit and parent smoking habit
\(H_{\alpha}:\) There is no association between student smoking habit and parent smoking habit
(b) Determine the row and column totals of the given table:
\(\begin{array}{c|c}&\text{Student smokes}&\text{Student does not smoke}&\text{Total}\\\hline\text{Both parents smoke}&400&1380&400+1380=1780\\\hline\text{One parent smokes}&416&1823&416+1823=2239\\\hline\text{Neither parent smokes}&188&1168&188+1168=1356\\\hline\text{Total}&400+416+188=1004&1380+1823+1168=4371&1004+4371=5375\end{array}\)
The expected frequencies E are the product of the column and row total, divided by the table total.
\(E_{11}=\frac{r_1\times c_1}{n}=\frac{1780\times 1004}{5375}\approx332.4874\)
\(E_{12}=\frac{r_1\times c_2}{n}=\frac{1780\times4371}{5375}\approx1447.5126\)
\(E_{21}=\frac{r_2\times c_1}{n}=\frac{2239\times1004}{5375}\approx418.2244\)
\(E_{22}=\frac{r_2\times c_2}{n}=\frac{2239\times4371}{5375}\approx1820.7756\)
\(E_{31}=\frac{r_3\times c_1}{n}=\frac{1356\times1004}{5375}\approx253.2882\)
\(E_{32}=\frac{r_3\times c_2}{n}=\frac{1356\times4371}{5375}\approx1102.7118\)
Expected counts are the counts that we expect based on the row and column totals, when there is no association between the variables.
(c) The chi-square subtotals are the squared differences between the observed abd expected frequencies, divivded by the expected frequency.
The value of the test-statistic is then the sum of the chi-square subtotals:
\(X^2=\sum\frac{(O-E)^2}{E}\)
\(=\frac{(400-322.4874)^2}{332.4874}+\frac{(1380-1447.5126)^2}{1447.5126}+\frac{(416-418.2244)^2}{418.2244}+\frac{(1823-1820.7756)^2}{1820.7756}+\frac{(188-253.2882)^2}{253.2882}+\frac{1168-1102.7119)^2}{1102.7118}\)
The degrees of freedom is the product od the number of row and the number of columns, both decreased by 1.
\(df=(r-1)(c-1)=(3-1)(2-1)=2\)
The P-value is the probability of obtaining the value of the test statistic, or a value more extreme. The P-value is the number (or interval) in the column title of the chi-square distribution table in the appendix containing the \(X^2\) -value in the row \(df=2:\)
\(P<0.001\)
(d) If the P-value is less than or equal to the significance level, then the null hypothesis is rejected:
\(P<0.05\Rightarrow\text{Reject }H_0\)
There is sufficient evidence to support the claim that there is an association between student smoking habit and parent smoking habit.
Result: (a)
\(H_0:\) There is no association between smoking habit and parent smoking habit.
\(H_{\alpha}:\) There is an association between student smoking habit and parent smoking habit.
(b) 332.4874, 1447.5126, 418.2244, 1820.7756, 253.2882, 1102.7118
Expected counts are the counts that we expect based on the row and column totals, when there is no association between the variables.
(c) \(X^2=37.5664\), degrees of freedom, P<0.01
(d) There is sufficient evidence to support the claim that there is an association between student smoking habit and parent smoking habit.

Not exactly what you’re looking for?
Ask My Question
44
 

Expert Community at Your Service

  • Live experts 24/7
  • Questions are typically answered in as fast as 30 minutes
  • Personalized clear answers
Learn more

Relevant Questions

asked 2021-01-17
A new thermostat has been engineered for the frozen food cases in large supermarkets. Both the old and new thermostats hold temperatures at an average of \(25^{\circ}F\). However, it is hoped that the new thermostat might be more dependable in the sense that it will hold temperatures closer to \(25^{\circ}F\). One frozen food case was equipped with the new thermostat, and a random sample of 21 temperature readings gave a sample variance of 5.1. Another similar frozen food case was equipped with the old thermostat, and a random sample of 19 temperature readings gave a sample variance of 12.8. Test the claim that the population variance of the old thermostat temperature readings is larger than that for the new thermostat. Use a \(5\%\) level of significance. How could your test conclusion relate to the question regarding the dependability of the temperature readings? (Let population 1 refer to data from the old thermostat.)
(a) What is the level of significance?
State the null and alternate hypotheses.
\(H0:?_{1}^{2}=?_{2}^{2},H1:?_{1}^{2}>?_{2}^{2}H0:?_{1}^{2}=?_{2}^{2},H1:?_{1}^{2}\neq?_{2}^{2}H0:?_{1}^{2}=?_{2}^{2},H1:?_{1}^{2}?_{2}^{2},H1:?_{1}^{2}=?_{2}^{2}\)
(b) Find the value of the sample F statistic. (Round your answer to two decimal places.)
What are the degrees of freedom?
\(df_{N} = ?\)
\(df_{D} = ?\)
What assumptions are you making about the original distribution?
The populations follow independent normal distributions. We have random samples from each population.The populations follow dependent normal distributions. We have random samples from each population.The populations follow independent normal distributions.The populations follow independent chi-square distributions. We have random samples from each population.
(c) Find or estimate the P-value of the sample test statistic. (Round your answer to four decimal places.)
(d) Based on your answers in parts (a) to (c), will you reject or fail to reject the null hypothesis?
At the ? = 0.05 level, we fail to reject the null hypothesis and conclude the data are not statistically significant.At the ? = 0.05 level, we fail to reject the null hypothesis and conclude the data are statistically significant. At the ? = 0.05 level, we reject the null hypothesis and conclude the data are not statistically significant.At the ? = 0.05 level, we reject the null hypothesis and conclude the data are statistically significant.
(e) Interpret your conclusion in the context of the application.
Reject the null hypothesis, there is sufficient evidence that the population variance is larger in the old thermostat temperature readings.Fail to reject the null hypothesis, there is sufficient evidence that the population variance is larger in the old thermostat temperature readings. Fail to reject the null hypothesis, there is insufficient evidence that the population variance is larger in the old thermostat temperature readings.Reject the null hypothesis, there is insufficient evidence that the population variance is larger in the old thermostat temperature readings.
asked 2021-09-06
Tell whether the expression is TRUE or FALSE.
“In a two-way table, if the expected counts are about the same as the observed counts, we fail to reject the null hypothesis”.
asked 2021-01-19
The following is a two-way table showing preferences for an award (A, B, C) by gender for the students sampled in survey. Test whether the data indicate there is some association between gender and preferred award.
\(\begin{array}{|c|c|c|}\hline &\text{A}&\text{B}&\text{C}&\text{Total}\\\hline \text{Female} &20&76&73&169\\ \hline \text{Male}&11&73&109&193 \\ \hline \text{Total}&31&149&182&360 \\ \hline \end{array}\\\)
Chi-square statistic=?
p-value=?
Conclusion: (reject or do not reject \(H_0\))
Does the test indicate an association between gender and preferred award? (yes/no)
asked 2020-12-14
Find the expected count and the contribution to the chi-square statistic for the (Group 1, Yes) cell in the two-way table below.
\(\begin{array}{|c|c|c|}\hline&\text{Yes}&\text{No}&\text{Total}\\\hline\text{Group 1} &710 & 277 & 987\\ \hline\text{Group 2}& 1175 & 323&1498\\\hline \ \text{Total}&1885&600&2485 \\ \hline \end{array}\)
Round your answer for the excepted count to one decimal place, and your answer for the contribution to the chi-square statistic to three decimal places.
Expected count=?
contribution to the chi-square statistic=?
asked 2020-11-26
Find the expected count and the contribution to the chi-square statistic for the (Control, Disagree) cell in the two-way table below.
\(\begin{array}{|c|c|c|}\hline&\text{Strongly Agree}&\text{Agree}&\text{Neutral}&\text{Disagree}&\text{Strongly Disagree}\\\hline\text{Control} &38&47&2&12&11\\ \hline \text{Treatment}&60&45&9&4&2 \\ \hline \end{array}\\\)
Round your answer for the excepted count to one decimal place, and your answer for the contribution to the chi-square statistic to three decimal places.
Expected count ?
Contribution to the chi-square statistic ?
asked 2021-06-13
1. Who seems to have more variability in their shoe sizes, men or women?
a) Men
b) Women
c) Neither group show variability
d) Flag this Question
2. In general, why use the estimate of \(n-1\) rather than n in the computation of the standard deviation and variance?
a) The estimate n-1 is better because it is used for calculating the population variance and standard deviation
b) The estimate n-1 is never used to calculate the sample variance and standard deviation
c) \(n-1\) provides an unbiased estimate of the population and allows more variability when using a sample and gives a better mathematical estimate of the population
d) The estimate n-1 is better because it is use for calculation of both the population and sample variance as well as standard deviation.
\(\begin{array}{|c|c|}\hline \text{Shoe Size (in cm)} & \text{Gender (M of F)} \\ \hline 25.7 & M \\ \hline 25.4 & F \\ \hline 23.8 & F \\ \hline 25.4 & F \\ \hline 26.7 & M \\ \hline 23.8 & F \\ \hline 25.4 & F \\ \hline 25.4 & F \\ \hline 25.7 & M \\ \hline 25.7 & F \\ \hline 23.5 & F \\ \hline 23.1 & F \\ \hline 26 & M \\ \hline 23.5 & F \\ \hline 26.7 & F \\ \hline 26 & M \\ \hline 23.1 & F \\ \hline 25.1 & F \\ \hline 27 & M \\ \hline 25.4 & F \\ \hline 23.5 & F \\ \hline 23.8 & F \\ \hline 27 & M \\ \hline 25.7 & F \\ \hline \end{array}\)
\(\begin{array}{|c|c|}\hline \text{Shoe Size (in cm)} & \text{Gender (M of F)} \\ \hline 27.6 & M \\ \hline 26.9 & F \\ \hline 26 & F \\ \hline 28.4 & M \\ \hline 23.5 & F \\ \hline 27 & F \\ \hline 25.1 & F \\ \hline 28.4 & M \\ \hline 23.1 & F \\ \hline 23.8 & F \\ \hline 26 & F \\ \hline 25.4 & M \\ \hline 23.8 & F \\ \hline 24.8 & M \\ \hline 25.1 & F \\ \hline 24.8 & F \\ \hline 26 & M \\ \hline 25.4 & F \\ \hline 26 & M \\ \hline 27 & M \\ \hline 25.7 & F \\ \hline 27 & M \\ \hline 23.5 & F \\ \hline 29 & F \\ \hline \end{array}\)
asked 2021-01-02
Find the expected count and the contribution to the chi-square statistic for the (Control, Disagree) cell in the two-way table below. \(\begin{array}{|c|c|c|}\hline&\text{Strongly Agree}&\text{Agree}&\text{Neutral}&\text{Disagree}&\text{Strongly Disagree}\\\hline\text{Control} &38&47&2&12&11\\ \hline \text{Treatment}&60&45&9&4&2 \\ \hline \end{array}\\\)
Round your answer for the excepted count to one decimal place, and your answer for the contribution to the chi-square statistic to three decimal places.
Expected count ?
Contribution to the chi-square statistic ?
...