Confidence Interval [Two-Sample Proportions Procedures]

The Problem

Example # 23: Estimate the difference in success rates between two populations using a confidence interval to indicate uncertainty. To estimate this difference, we collect data. The data are a series of “Success” and “Failure” values. For sample 1, the data are

“Success”, “Success”, “Success”, “Failure”, “Failure”, “Success”, “Success”, “Failure”, “Success”, “Success”, “Success”, “Failure”, “Success”, “Failure”, “Success”, “Failure”, “Failure”, “Failure”, “Success”, “Success”, “Success”, “Success”, “Failure”, “Failure”

For sample 2, the data are

“Success”, “Failure”, “Failure”, “Success”, “Success”, “Success”, “Failure”, “Success”, “Failure”, “Failure”, “Failure”, “Success”, “Success”, “Failure”, “Failure”, “Failure”, “Success”, “Success”, “Failure”, “Success”, “Failure”, “Failure”, “Failure”, “Success”, “Success”, “Failure”, “Success”, “Failure”, “Failure”, “Failure”

With this information, calculate the endpoints of the symmetric 90% confidence interval.

Information given:

To summarize the above, the values of import are:

Summary statistics from the problem
\( x_1 \)	=	14
\( x_2 \)	=	13

\( n_1 \)	=	24
\( n_2 \)	=	30

\( \hat{p}_1 \)	=	0.5833
\( \hat{p}_2 \)	=	0.4333

\( \alpha \)	=	0.1

Note that there is no value given for the hypothesized difference p₀. This is because confidence intervals are based solely on the data, and not on any hypothesized values.

It may be helpful if you calculate these values yourself. Once you have, you can check your answers by hovering your mouse over the grey spaces to see if you calculated them correctly.

Your Answer

In the box below, please enter the two endpoints of the 90% confidence interval for the difference in population proportions, then click on the “Check your answer!” button. Please round your answer to the ten-thousandths place.

90% Confidence Bounds: (, )

Make sure the lower limit is less than the upper limit.

Another Set of Data

Would you like to continue working on this topic? If so, click here for another data set.

Assistance

Show Formula

Show Solution

Show the R Code

Show the Excel Code

Hide the Excel Code

This formulation of the confidence interval is pedagogically simple to understand. That is why it is used in introductory textbooks. It is actually based on the Normal approximation to the Binomial distribution. There are several improvements to the test. For such reasons, Excel does not have a built-in function to perform these calculations. The following code echoes the above calculations to provide the endpoints of the confidence interval.

Copy and paste the following code into your Excel window, making sure the value sample1 ends up in A1 after pasting.

How to calculate the test statistic in Excel.
sample1	sample2		alpha:	0.1
Success	Success
Success	Failure		samp 1	samp 2
Success	Failure	x:	=COUNTIF(A:A,"Success")	=COUNTIF(B:B,"Success")
Failure	Success	n:	=COUNTIF(A:A,"Success")+COUNTIF(A:A,"Failure")	=COUNTIF(B:B,"Success")+COUNTIF(B:B,"Failure")
Failure	Success	p-hat:	=D4/D5	=E4/E5
Success	Success
Success	Failure	lcl:	=(D6-E6)-ABS(NORM.S.INV(E1/2))SQRT(D6(1-D6)/D5+E6*(1-E6)/E5)
Failure	Success	ucl:	=(D6-E6)+ABS(NORM.S.INV(E1/2))SQRT(D6(1-D6)/D5+E6*(1-E6)/E5)
Success	Failure
Success	Failure
Success	Failure
Failure	Success
Success	Success
Failure	Failure
Success	Failure
Failure	Failure
Failure	Success
Failure	Success
Success	Failure
Success	Success
Success	Failure
Success	Failure
Failure	Failure
Failure	Success
	Success
	Failure
	Success
	Failure
	Failure
	Failure

The limits of the 90% confidence interval are the numbers calculated in cells D8 and D9. Again, when you paste this code into Excel, make sure that you start the pasting in cell A1. To help with that, you may want to also copy this notice. It seems to help.

© Ole J. Forsberg, Ph.D. 2024. All rights reserved.		.

Calculating the Confidence Interval

The Problem

Information given:

Your Answer

Another Set of Data

Assistance