0:00 / 0:00

Confidence Interval for Two Independent Means: Pooled t (Equal Variances)

Our approach to comparing the two means depends if we if can assume the population variances σ1 σ_1\ and σ2σ_2 are equal or not, even if we don't know their values.

If we can assume that the population variances are equal \rightarrow Pooled (Equal Variances) Confidence Interval
σ1=σ2σ_1=σ_2

A 100(1α)%100\left(1-\alpha\right)\% confidence interval for the mean difference μ1μ2\mu_1-\mu_2 of two independent populations with equal variances is:


(x1x2)±(t)(sp)1n1+1n2\boxed{\left(\overline{x}_1-\overline{x}_2\right)\pm (t^{\ast })(s_p)\sqrt{\frac{1}{n_1}+\frac{1}{n_2}}}
where,
  • tt^\star is the multiplier or critical value from the t-distribution with a given degrees of freedom.

The pooled standard deviation sp\colorFour {s_p} is:
sp=(n11)s12+(n21)s22n1+n22s_p=\sqrt{\frac{\left(n_1-1\right)s_1^2+\left(n_2-1\right)s_2^2}{n_1+n_2-2}}

PAGE BREAK

Degrees of freedom:
df=n1+n22df=n_1+n_2-2


Wize Concept
If you are not able to assume that σ1 = σ2σ_1\ =\ σ_2, then it is more appropriate to construct an Unpooled Confidence Interval,


PAGE BREAK

Solving for t\colorOne {t^*} :

Example
You want construct a 90% confidence interval and your sample sizes are n1=7 n_1=7\spaceand n2=9n_2=9 .

df=n1+n22df=n_1+n_2-2
df=7+92=14df=7+9-2=14

Use the t-table to find tt^*:


t=1.761 t^*=1.761

0:00 / 0:00

Example: Confidence Interval for Two Means (Pooled t)

17 students that don't have part-time jobs (Group 1) report studying on average 5 hours a day with a standard deviation of 1.40 hours. 10 other students that have part-time jobs (Group 2) report studying on average 4 hours a day with a standard deviation of 2.05 hours.

Assume both populations are normal and the population variances are equal.

(a) Compute a 95% confidence interval for the mean difference in hours of studying.

xˉ1 =5, xˉ2=4x̄_1\ =5,\ x̄_2=4
s1=1.4 , s2=2.05s_1=1.4\ ,\ s_2=2.05
n1=17 , n2=10n_1=17\ ,\ n_2=10

Find the degrees of freedom and then find the critical value tt^*:
df=n1+n22=17+102=25df=n_1+n_2-2=17+10-2=25

PAGE BREAK


Using the t-table, we get t=2.060t^* = 2.060

PAGE BREAK

(x1x2)±(t)(sp)1n1+1n2\boxed{\left(\overline{x}_1-\overline{x}_2\right)\pm (t^{\ast })(s_p)\sqrt{\frac{1}{n_1}+\frac{1}{n_2}}}

sp=(n11)s12+(n21)s22n1+n22s_p=\sqrt{\frac{\left(n_1-1\right)s_1^2+\left(n_2-1\right)s_2^2}{n_1+n_2-2}}

sp=(171)(1.4)2+(101)(2.05)217+102=1.6635s_p=\sqrt{\frac{\left(17-1\right)\left(1.4\right)^2+\left(10-1\right)\left(2.05\right)^2}{17+10-2}}=1.6635

(54)±(2.060)(1.6635)117+110\left(5-4\right)\pm\left(2.060\right)\left(1.6635\right)\sqrt{\frac{1}{17}+\frac{1}{10}}

1±1.371\pm 1.37

[0.37 , 2.37]\left[-0.37\ ,\ 2.37\right]

(b) Interpret your results.

We are 95% confident that students without part-time jobs study on average between 0.37 hours less and 2.37 hours more than students that have part-time jobs.

(c) Does the confidence interval contain 0? What does that mean?

The confidence interval contains 0. There is no evidence that the means differ.


Two simple random samples were collected to compare the average spending of Blue Jay fans (Group 1) and Yankee fans (Group 2) at baseball games.



Assume that the two populations have equal variances. Construct a 95% confidence interval.
(i) What is the point estimate?