0:00 / 0:00

Hypothesis Test for Two Independent Means: Pooled t-test for Equal Variances

Our approach to comparing the two means depends if we if can assume the population variances σ12 σ_1^2\ and σ22σ_2^2 are equal or not, even if we don't know their values.


If we can assume that the population variances are equal \rightarrow Pooled t-test (Equal Variances t-test)
σ1=σ2σ_1=σ_2

t=x1x2sp1n1+1n2\boxed{t=\frac{\overline{x}_1-\overline{x}_2}{s_p\sqrt{\frac{1}{n_1}+\frac{1}{n_2}}}}
Where the pooled standard deviation is:

sp=(n11)s12+(n21)s22n1+n22s_p=\sqrt{\frac{\left(n_1-1\right)s_1^2+\left(n_2-1\right)s_2^2}{n_1+n_2-2}}

Degrees of freedom:
df=n1+n22df=n_1+n_2-2


Wize Concept
If you cannot assume that σ1 = σ2σ_1\ =\ σ_2, then it is more appropriate to run an Unpooled t-test.

0:00 / 0:00

Example: Hypothesis Test for Differences in Population Means (Pooled t-test)

A company sells educational materials that should improve reading ability in elementary school pupils. A consultant arranges for 21 grade 3 students to use the educational materials for 8 weeks. A control classroom of 23 students follows the same curriculum without the company’s educational materials.

Using the pooled/equal variances method, test the hypothesis that the mean reading scores for the treatment group is higher than that of the control group. The populations are independent and the significance level is 5%. The following shows the results of reading ability scores:
  • xˉ1 =51.48, xˉ2=41.52x̄_1\ =51.48,\ x̄_2=41.52
  • s1=11.01 , s2=17.15s_1=11.01\ ,\ s_2=17.15
  • n1=21 , n2=23n_1=21\ ,\ n_2=23
(a) State the hypotheses

Based on the keyword "higher", this is a one-sided test.

Ho: μ1μ2=0H_o:\ \mu_1-\mu_2=0
Ha: μ1μ2>0H_a:\ \mu_1-\mu_2>0


PAGE BREAK
(b) Calculate the Pooled/Equal Variances test statistic and its degrees of freedom:

t=x1x2sp1n1+1n2\boxed{t=\frac{\overline{x}_1-\overline{x}_2}{s_p\sqrt{\frac{1}{n_1}+\frac{1}{n_2}}}}

sp=(n11)s12+(n21)s22n1+n22s_p=\sqrt{\frac{\left(n_1-1\right)s_1^2+\left(n_2-1\right)s_2^2}{n_1+n_2-2}}

sp=(211)(11.01)2 +(231)(17.15)221+232=14.5529s_p=\sqrt{\frac{\left(21-1\right)\left(11.01\right)^{2\ }+\left(23-1\right)\left(17.15\right)^2}{21+23-2}}=14.5529

Plug sps_p into the tt formula:
t=51.4841.52(14.5529)121+123=2.27t=\frac{51.48-41.52}{\left(14.5529\right)\sqrt{\frac{1}{21}+\frac{1}{23}}}=2.27

Degrees of freedom:
df=n1+n22df=n_1+n_2-2

df=n1+n22 =21+232=42df=n_1+n_2-2\ =21+23-2=42


PAGE BREAK
(c) Find the range of p-values for each method. (Use the closest available dfdf provided in the table.)


Pooled variance (t=2.27, Method #1 df=42 \rightarrow40) *More conservative to round down the df.*
  • From t-table, 0.01<0.01< p-value <0.025<0.025
PAGE BREAK
(d) Draw your conclusion. Are reading scores for the treatment group is higher than that of the control group?

Pooled Method
  • pvalue<0.05 =>pvalue αpvalue<0.05\ =>pvalue\ \leα therefore we reject HoH_o.
  • At the 5% level of significance, we have enough evidence to reject HoH_o.
  • The mean reading score of the treatment group is significantly greater than the mean reading score of the control group.
There are 10 students in Professor Bean's statistics class and 10 students in Professor Dutta's class. Professor Bean's students (Group 1) were not timed during their exam while Professor Dutta's students (Group 2) were timed. Here are the results:



Assumptions:
  • Both populations are normal
  • The population variances are assumed to be equal
At the 1% level of significance, we want to determine if timed tests reduce grades.


(i) State the hypotheses. (Hint: read the question carefully.)