Confidence Interval [Two-Sample Proportions Procedures]

The Problem

Example # 422: Estimate the difference in success rates between two populations using a confidence interval to indicate uncertainty. To estimate this difference, we collect data. The data are a series of “Success” and “Failure” values. For sample 1, the data are

“Failure”, “Failure”, “Failure”, “Failure”, “Success”, “Failure”, “Success”, “Success”, “Success”, “Success”, “Success”, “Success”, “Success”, “Success”, “Failure”, “Success”, “Success”, “Success”, “Success”, “Failure”, “Failure”, “Failure”, “Failure”, “Success”, “Failure”, “Success”, “Success”, “Success”, “Failure”, “Success”, “Success”, “Failure”, “Success”, “Success”

For sample 2, the data are

“Success”, “Success”, “Success”, “Success”, “Failure”, “Failure”, “Success”, “Success”, “Failure”, “Failure”, “Success”, “Success”, “Failure”, “Failure”, “Success”, “Success”, “Success”, “Success”, “Failure”, “Failure”, “Success”, “Success”, “Success”, “Success”, “Success”, “Success”, “Failure”

With this information, calculate the endpoints of the symmetric 90% confidence interval.

Information given:

To summarize the above, the values of import are:

Summary statistics from the problem
$x_1$	=	21
$x_2$	=	18

$n_1$	=	34
$n_2$	=	27

$\hat{p}_1$	=	0.6176
$\hat{p}_2$	=	0.6667

$\alpha$	=	0.1

Note that there is no value given for the hypothesized difference p₀. This is because confidence intervals are based solely on the data, and not on any hypothesized values.

It may be helpful if you calculate these values yourself. Once you have, you can check your answers by hovering your mouse over the grey spaces to see if you calculated them correctly.

Your Answer

In the box below, please enter the two endpoints of the 90% confidence interval for the difference in population proportions, then click on the “Check your answer!” button. Please round your answer to the ten-thousandths place.

90% Confidence Bounds: (, )

Make sure the lower limit is less than the upper limit.

Another Set of Data

Would you like to continue working on this topic? If so, click here for another data set.

Assistance

Show Formula

Hide Formula

Here is the formula you can use to calculate the confidence bounds.

$\begin{align} \text{Both confidence limits:} &= \hat{p}_1 - \hat{p}_2 \pm Z(\alpha/2) \sqrt{ \frac{ \hat{p}_1 \left(1 - \hat{p}_1\right) }{n_1} + \frac{ \hat{p}_2 \left(1 - \hat{p}_2\right) }{n_2} } \\[3em] \text{Lower confidence limit:} &= \hat{p}_1 - \hat{p}_2 - Z(\alpha/2) \sqrt{ \frac{ \hat{p}_1 \left(1 - \hat{p}_1\right) }{n_1} + \frac{ \hat{p}_2 \left(1 - \hat{p}_2\right) }{n_2} } \\[1em] \text{Upper confidence limit:} &= \hat{p}_1 - \hat{p}_2 + Z(\alpha/2) \sqrt{ \frac{ \hat{p}_1 \left(1 - \hat{p}_1\right) }{n_1} + \frac{ \hat{p}_2 \left(1 - \hat{p}_2\right) }{n_2} } \\ \end{align}$

In this formula, there are a few symbols to know:

$Z(\alpha/2)$

the z-value corresponding to a right-hand probability of $\alpha/2$

$\hat{p}_1$

the sample proportion for sample 1, $\frac{x_1}{n_1}$

$x_1$

the number of successes in sample 1

$n_1$

the size of sample 1

$\hat{p}_2$

the sample proportion for sample 2, $\frac{x_2}{n_2}$

$x_2$

the number of successes in sample 2

$n_2$

the size of sample 2

Show Solution

Hide Solution

$\begin{align} \text{Confidence Limits} &= \hat{p}_1 - \hat{p}_2 \pm Z(\alpha/2) \sqrt{ \frac{ \hat{p}_1 \left(1 - \hat{p}_1\right) }{n_1} + \frac{ \hat{p}_2 \left(1 - \hat{p}_2\right) }{n_2} } \\[3em] &= 0.6176 - 0.6667 \pm Z(0.1/2) \sqrt{ \frac{ 0.6176 \left(1 - 0.6176\right) }{34} + \frac{ 0.6667 \left(1 - 0.6667 \right) }{27} } \\[1em] &= -0.049 \pm Z(0.05)\ \sqrt{ \frac{ 0.6176 \left(0.3824\right) }{34} + \frac{ 0.6667 \left(0.3333 \right) }{27} } \\[1em] &= -0.049 \pm 1.645\ \sqrt{ \frac{ 0.236159 }{34} + \frac{ 0.222222}{27} } \\[1em] &= -0.049 \pm 1.645\ \sqrt{ 0.006946\ +\ 0.00823 } \\[1em] &= -0.049 \pm 1.645\ \sqrt{ 0.015176 } \\[1em] &= -0.049 \pm 1.645\ \left( 0.123192 \right) \\[1em] &= -0.049 \pm 0.202651 \\[1em] \end{align}$

Thus, we are 90% confident that the difference in success rates between population 1 and population 2 is between -0.2517 and 0.1536.

Note that 0.202651 is the margin of error, which is usually symbolized as E. So, for sample sizes like these, polling companies would (should) report the results as “-4.9% plus or minus 20.3 points.” As you may expect, larger sample sizes will produce smaller margins of error.

Show the R Code

Show the Excel Code

Hide the Excel Code

This formulation of the confidence interval is pedagogically simple to understand. That is why it is used in introductory textbooks. It is actually based on the Normal approximation to the Binomial distribution. There are several improvements to the test. For such reasons, Excel does not have a built-in function to perform these calculations. The following code echoes the above calculations to provide the endpoints of the confidence interval.

Copy and paste the following code into your Excel window, making sure the value sample1 ends up in A1 after pasting.

How to calculate the test statistic in Excel.
sample1	sample2		alpha:	0.1
Failure	Success
Failure	Success		samp 1	samp 2
Failure	Success	x:	=COUNTIF(A:A,"Success")	=COUNTIF(B:B,"Success")
Failure	Success	n:	=COUNTIF(A:A,"Success")+COUNTIF(A:A,"Failure")	=COUNTIF(B:B,"Success")+COUNTIF(B:B,"Failure")
Success	Failure	p-hat:	=D4/D5	=E4/E5
Failure	Failure
Success	Success	lcl:	=(D6-E6)-ABS(NORM.S.INV(E1/2))SQRT(D6(1-D6)/D5+E6*(1-E6)/E5)
Success	Success	ucl:	=(D6-E6)+ABS(NORM.S.INV(E1/2))SQRT(D6(1-D6)/D5+E6*(1-E6)/E5)
Success	Failure
Success	Failure
Success	Success
Success	Success
Success	Failure
Success	Failure
Failure	Success
Success	Success
Success	Success
Success	Success
Success	Failure
Failure	Failure
Failure	Success
Failure	Success
Failure	Success
Success	Success
Failure	Success
Success	Success
Success	Failure
Success
Failure
Success
Success
Failure
Success
Success

The limits of the 90% confidence interval are the numbers calculated in cells D8 and D9. Again, when you paste this code into Excel, make sure that you start the pasting in cell A1. To help with that, you may want to also copy this notice. It seems to help.

© Ole J. Forsberg, Ph.D. 2025. All rights reserved.		.

Calculating the Confidence Interval

The Problem

Information given:

Your Answer

Another Set of Data

Assistance