Confidence Interval [Two-Sample T-Procedure]

The Problem

Example #129: Let us estimate the difference in means between two populations using a confidence interval to indicate our uncertainty. To do this, we collect data from each of the two populations. The data from population 1 are:

36, 38, 61, 49, 37, 32, 40, 39, 28, 24, 35, 49, 32, 44, 29, 33, 38

The data from population 2 are:

44, 45, 26, 47, 41, 31, 29, 45, 30, 51, 29, 42, 0, 49, 55, 41, 40, 53, 44, 56, 31

In addition to these data, we also know one important thing: The data are generated from independent Normal processes; that is, the data in each sample come from independent Normal distributions. With this information, calculate the endpoints of the symmetric 99% confidence interval. Use the pooled variance.

Information given:

To summarize the above, the values of import are:

Summary statistics from the problem
$\bar{x}_1$	=	37.8824
$\bar{x}_2$	=	39.4762

$n_1$	=	17
$n_2$	=	21

$s_1$	=	9.013062
$s_2$	=	12.812568
$s_p$	=	11.28298

$\alpha$	=	0.01

Calculate these values yourself then hover your mouse over the grey spaces to see if you calculated them correctly.

Your Answer

In the box below, please enter the two endpoints of the 99% confidence interval for the population mean, then click on the “Check your answer!” button. Please round your answer to the ten-thousandths place.

99% Confidence Bounds: (, )

Make sure the lower limit is less than the upper limit.

Another Set of Data

Would you like to continue working on this topic? If so, click here for another data set.

Assistance

Show Formula

Hide Formula

Here is the formula you can use to calculate the confidence bounds.

$\begin{align} \text{Both confidence limits:} &= (\bar{x}_1 - \bar{x}_2) \pm t(\alpha/2, \nu)\ s_p\ \sqrt{\ \frac{1}{n_1} + \frac{1}{n_2}\ } \\[3em] \text{Lower confidence limit:} &= (\bar{x}_1 - \bar{x}_2) - t(\alpha/2, \nu)\ s_p\ \sqrt{\ \frac{1}{n_1} + \frac{1}{n_2}\ } \\[1em] \text{Upper confidence limit:} &= (\bar{x}_1 - \bar{x}_2) + t(\alpha/2, \nu)\ s_p\ \sqrt{\ \frac{1}{n_1} + \frac{1}{n_2}\ } \\ \end{align}$

In this formula, there are a few symbols to know:

$t(\alpha/2, \nu)$

the t-value corresponding to the probability $1-\alpha/2$ and number of degrees of freedom $\nu = n_1 + n_2 -2$
Here, $\alpha = 0.01$ , so $\alpha/2 = 0.005$ and $t(\alpha/2, 36) = 2.719$ .

$\bar{x}_1$

the mean of sample 1

$\bar{x}_2$

the mean of sample 2

$s^2_1$

the variance of sample 1

$s^2_2$

the variance of sample 2

$s_p^2$

the pooled variance

$n_1$

the size of sample 1

$n_2$

the size of sample 2

Show Solution

Hide Solution

$\begin{align} \text{Both confidence limits:} &= (\bar{x}_1 - \bar{x}_2) \pm t(\alpha/2, \nu)\ s_p\ \sqrt{\ \frac{1}{n_1} + \frac{1}{n_2}\ } \\[1em] &= (37.8824 - 39.4762) \pm t(0.01/2, 36)\ 11.28298\ \sqrt{\ \frac{1}{17} + \frac{1}{21}\ } \\[1em] &= (-1.5938) \pm 2.719 \times 11.28298\ \sqrt{\ \left( 0.058824\right) + \left(0.047619\right)\ } \\[1em] &= (-1.5938) \pm 2.719 \times 11.28298\ \left( 0.326255\right)\ \\[1em] &= (-1.5938) \pm 10.009 \\[1em] \end{align}$

Thus, we are 99% confident that the mean height for the population is between -11.6028 and 8.4152.

Note that the value following the plus/minus sign ± is known as the margin of error, $E = 10.009$ , and is always half of the width of the confidence interval. The margin of error is affected by the sample size, the level of confidence, and the variability of the population. If n increases, the margin of error shrinks. If the level of confidence increases, the margin of error expands. If the variability in the population increases, the margin of error expands.

Show the R Code

Show the Excel Code

Hide the Excel Code

The z-procedures are sensitive to knowing the population variance. Logic dictates that if we do not know the population mean, then we will not know the population variance. As such, the z-procedures are rarely used now. As such, there is no z-test in the base Excel program.

Copy and paste the following code into your Excel spreadsheet window, making sure the value sample1 ends up in A1 after pasting.

How to calculate the expected value in Excel.
sample1	sample2
36	44	s1:	=STDEV.S(A:A)
38	45	s2:	=STDEV.S(B:B)
61	26	sp:	=SQRT(((COUNT(A:A)-1)D2^2+(COUNT(B:B)-1)D3^2)/(COUNT(A:A)+COUNT(B:B)-2))
49	47
37	41	lower:	=AVERAGE(A:A)-AVERAGE(B:B)-ABS(T.INV((1-0.99)/2,COUNT(A:A)+COUNT(B:B)-2))D4SQRT(1/COUNT(A:A)+1/COUNT(B:B))
32	31	upper:	=AVERAGE(A:A)-AVERAGE(B:B)+ABS(T.INV((1-0.99)/2,COUNT(A:A)+COUNT(B:B)-2))D4SQRT(1/COUNT(A:A)+1/COUNT(B:B))
40	29
39	45
28	30
24	51
35	29
49	42
32	0
44	49
29	55
33	41
38	40
	53
	44
	56
	31

The endpoints of the 99% confidence interval are given in cells D6 and D7. Again, when you paste this code into Excel, make sure that you start the pasting in cell A1. To help with that, you may want to also copy this notice. It seems to help.

© Ole J. Forsberg, Ph.D. 2025. All rights reserved.		.

Calculating the Confidence Interval

The Problem

Information given:

Your Answer

Another Set of Data

Assistance