The derivation of the Wald interval
up vote
2
down vote
favorite
I'm asking about the binomial proportion confidence interval, also known as the Wald interval.
Recall that
$$lim_{n to infty}{P_p left( -z_{1-frac{alpha}{2}}frac{sigma}{sqrt{n}}+bar{X_n} leq p leq z_{1-frac{alpha}{2}}frac{sigma}{sqrt{n}}+bar{X_n} right)} = 1-alpha, $$
with $sigma = sqrt{p(1-p)}$.
Starting from the expression above, and the fact that for $hat{p}=dfrac{sum X_i}{n}, hat{sigma}=sqrt{hat{p}(1-hat{p})}$ is consistent for $sigma$ ($X_i sim rm Bin(1,p) )$, what argument can I use to show that
$$left[-z_{1-frac{alpha}{2}}frac{hat{sigma}}{sqrt{n}}+bar{X_n} , z_{1-frac{alpha}{2}}frac{hat{sigma}}{sqrt{n}}+bar{X_n}right]$$
is a confidence interval?
statistics statistical-inference
add a comment |
up vote
2
down vote
favorite
I'm asking about the binomial proportion confidence interval, also known as the Wald interval.
Recall that
$$lim_{n to infty}{P_p left( -z_{1-frac{alpha}{2}}frac{sigma}{sqrt{n}}+bar{X_n} leq p leq z_{1-frac{alpha}{2}}frac{sigma}{sqrt{n}}+bar{X_n} right)} = 1-alpha, $$
with $sigma = sqrt{p(1-p)}$.
Starting from the expression above, and the fact that for $hat{p}=dfrac{sum X_i}{n}, hat{sigma}=sqrt{hat{p}(1-hat{p})}$ is consistent for $sigma$ ($X_i sim rm Bin(1,p) )$, what argument can I use to show that
$$left[-z_{1-frac{alpha}{2}}frac{hat{sigma}}{sqrt{n}}+bar{X_n} , z_{1-frac{alpha}{2}}frac{hat{sigma}}{sqrt{n}}+bar{X_n}right]$$
is a confidence interval?
statistics statistical-inference
thx for the edit
– kcesc04
Sep 23 '15 at 16:09
I've edited your question a bit more to tidy it up, but I'm not sure exactly what you're asking. Technically speaking, any interval is a confidence interval, some just have a higher confidence level than others. Are you asking how to show that the given interval matches the specified error level $alpha$?
– Ilmari Karonen
Sep 23 '15 at 17:29
no , by this definition of CI $lim_{n to infty}{P_p left( -z_{1-frac{alpha}{2}}frac{sigma}{sqrt{n}}+bar{X_n} leq p leq z_{1-frac{alpha}{2}}frac{sigma}{sqrt{n}}+bar{X_n} right)} = 1-alpha$
– kcesc04
Sep 23 '15 at 20:48
1
knowing the variace of the binomial is dependant on p
– kcesc04
Sep 23 '15 at 20:49
add a comment |
up vote
2
down vote
favorite
up vote
2
down vote
favorite
I'm asking about the binomial proportion confidence interval, also known as the Wald interval.
Recall that
$$lim_{n to infty}{P_p left( -z_{1-frac{alpha}{2}}frac{sigma}{sqrt{n}}+bar{X_n} leq p leq z_{1-frac{alpha}{2}}frac{sigma}{sqrt{n}}+bar{X_n} right)} = 1-alpha, $$
with $sigma = sqrt{p(1-p)}$.
Starting from the expression above, and the fact that for $hat{p}=dfrac{sum X_i}{n}, hat{sigma}=sqrt{hat{p}(1-hat{p})}$ is consistent for $sigma$ ($X_i sim rm Bin(1,p) )$, what argument can I use to show that
$$left[-z_{1-frac{alpha}{2}}frac{hat{sigma}}{sqrt{n}}+bar{X_n} , z_{1-frac{alpha}{2}}frac{hat{sigma}}{sqrt{n}}+bar{X_n}right]$$
is a confidence interval?
statistics statistical-inference
I'm asking about the binomial proportion confidence interval, also known as the Wald interval.
Recall that
$$lim_{n to infty}{P_p left( -z_{1-frac{alpha}{2}}frac{sigma}{sqrt{n}}+bar{X_n} leq p leq z_{1-frac{alpha}{2}}frac{sigma}{sqrt{n}}+bar{X_n} right)} = 1-alpha, $$
with $sigma = sqrt{p(1-p)}$.
Starting from the expression above, and the fact that for $hat{p}=dfrac{sum X_i}{n}, hat{sigma}=sqrt{hat{p}(1-hat{p})}$ is consistent for $sigma$ ($X_i sim rm Bin(1,p) )$, what argument can I use to show that
$$left[-z_{1-frac{alpha}{2}}frac{hat{sigma}}{sqrt{n}}+bar{X_n} , z_{1-frac{alpha}{2}}frac{hat{sigma}}{sqrt{n}}+bar{X_n}right]$$
is a confidence interval?
statistics statistical-inference
statistics statistical-inference
edited Sep 23 '15 at 17:31
Ilmari Karonen
19.2k25182
19.2k25182
asked Sep 23 '15 at 15:51
kcesc04
289
289
thx for the edit
– kcesc04
Sep 23 '15 at 16:09
I've edited your question a bit more to tidy it up, but I'm not sure exactly what you're asking. Technically speaking, any interval is a confidence interval, some just have a higher confidence level than others. Are you asking how to show that the given interval matches the specified error level $alpha$?
– Ilmari Karonen
Sep 23 '15 at 17:29
no , by this definition of CI $lim_{n to infty}{P_p left( -z_{1-frac{alpha}{2}}frac{sigma}{sqrt{n}}+bar{X_n} leq p leq z_{1-frac{alpha}{2}}frac{sigma}{sqrt{n}}+bar{X_n} right)} = 1-alpha$
– kcesc04
Sep 23 '15 at 20:48
1
knowing the variace of the binomial is dependant on p
– kcesc04
Sep 23 '15 at 20:49
add a comment |
thx for the edit
– kcesc04
Sep 23 '15 at 16:09
I've edited your question a bit more to tidy it up, but I'm not sure exactly what you're asking. Technically speaking, any interval is a confidence interval, some just have a higher confidence level than others. Are you asking how to show that the given interval matches the specified error level $alpha$?
– Ilmari Karonen
Sep 23 '15 at 17:29
no , by this definition of CI $lim_{n to infty}{P_p left( -z_{1-frac{alpha}{2}}frac{sigma}{sqrt{n}}+bar{X_n} leq p leq z_{1-frac{alpha}{2}}frac{sigma}{sqrt{n}}+bar{X_n} right)} = 1-alpha$
– kcesc04
Sep 23 '15 at 20:48
1
knowing the variace of the binomial is dependant on p
– kcesc04
Sep 23 '15 at 20:49
thx for the edit
– kcesc04
Sep 23 '15 at 16:09
thx for the edit
– kcesc04
Sep 23 '15 at 16:09
I've edited your question a bit more to tidy it up, but I'm not sure exactly what you're asking. Technically speaking, any interval is a confidence interval, some just have a higher confidence level than others. Are you asking how to show that the given interval matches the specified error level $alpha$?
– Ilmari Karonen
Sep 23 '15 at 17:29
I've edited your question a bit more to tidy it up, but I'm not sure exactly what you're asking. Technically speaking, any interval is a confidence interval, some just have a higher confidence level than others. Are you asking how to show that the given interval matches the specified error level $alpha$?
– Ilmari Karonen
Sep 23 '15 at 17:29
no , by this definition of CI $lim_{n to infty}{P_p left( -z_{1-frac{alpha}{2}}frac{sigma}{sqrt{n}}+bar{X_n} leq p leq z_{1-frac{alpha}{2}}frac{sigma}{sqrt{n}}+bar{X_n} right)} = 1-alpha$
– kcesc04
Sep 23 '15 at 20:48
no , by this definition of CI $lim_{n to infty}{P_p left( -z_{1-frac{alpha}{2}}frac{sigma}{sqrt{n}}+bar{X_n} leq p leq z_{1-frac{alpha}{2}}frac{sigma}{sqrt{n}}+bar{X_n} right)} = 1-alpha$
– kcesc04
Sep 23 '15 at 20:48
1
1
knowing the variace of the binomial is dependant on p
– kcesc04
Sep 23 '15 at 20:49
knowing the variace of the binomial is dependant on p
– kcesc04
Sep 23 '15 at 20:49
add a comment |
1 Answer
1
active
oldest
votes
up vote
0
down vote
The Wald confidence interval for binomial success probability $p$ depends on two approximations.
(1) That $Z = frac{hat p - p}{sqrt{p(1-p)/n}}$ is approximately standard normal, $Norm(0, 1)$. Thus one would have
$P(-1.96 < Z < 1.96) approx 0.95.$ This is a good approximation
if $n$ is large and $p$ is not too far from $1/2.$
[A common rule of thumb is that $np$ and $n(1-p)$ should both
exceed. 5.]
From there, simple algebra gives
$$Pleft(hat p - 1.96sqrt{p(1-p)/n} < p < hat p + 1.96sqrt{p(1-p)/n} right) approx .95.$$
This is promising because $p$ is 'isolated' (after a fashion) between two 'bounds', but not useful in practice for making a confidence interval because $sqrt{p(1-p)/n}$ is unknown.
(2) This leads to the second assumption, that if $n$ is sufficiently
large, then $hat p$ will be sufficiently close to $p$ that we can
write
$$Pleft(hat p - 1.96sqrt{hat p(1-hat p)/n} < p < hat p + 1.96sqrt{hat p(1- hat p)/n} right) approx .95.$$
So that an approximate 95% confidence interval for $p$ is
of the form $hat p pm 1.96sqrt{hat p(1- hat p)/n}.$
Similarly for other confidence levels with an appropriate
number from standard normal tables replacing 1.96. (For
example, 1.645 for a 90% CI and 2.576 for a 99% CI.)
$Notes:;$
Unfortunately, as shown by intensive computations for various values of
$n$ and $p,$ the actual 'coverage probability'
of the Wald interval can be far from 95% (and what is worse,
often far $below$ 95%) with 1.96. Similarly for other 'target'
confidence levels. (A key reference is Brown, Cai, and DasGupta, 2001.)
If $n$ is several hundred or thousand (as in a public opinion poll
or a large-scale
simulation) the Wald interval is tolerably accurate.
Otherwise, for a 95% CI with smaller $n$ a considerable
improvement is artificially to introduce two extra successes
and two extra failures into the data before finding $hat p$ and $n$.
This adjustment (due to Agresti and Coull, 1998) is now widely
used instead of the Wald interval. (See Wikipedia.)
The Wilson interval (again, Wikipedia) results from taking the square
and then solving a quadratic equation to (truly) isolate $p$ in $-1.96 < Z < 1.96$ without making assumption (2). Equating 1.96 and 2 in the
95% Wilson CI gives nearly the same result as the simpler Agresti-Coull interval.
The plots below show $actual$ coverage probabilities
of Wald and Agresti "95%" CIs for 2000 values of $p$
between 0 and 1 for $n = 100$. The rapid oscillation of
coverage probabilities for even small changes in $p$ is due
to the discreteness of the binomial distribution.
add a comment |
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
up vote
0
down vote
The Wald confidence interval for binomial success probability $p$ depends on two approximations.
(1) That $Z = frac{hat p - p}{sqrt{p(1-p)/n}}$ is approximately standard normal, $Norm(0, 1)$. Thus one would have
$P(-1.96 < Z < 1.96) approx 0.95.$ This is a good approximation
if $n$ is large and $p$ is not too far from $1/2.$
[A common rule of thumb is that $np$ and $n(1-p)$ should both
exceed. 5.]
From there, simple algebra gives
$$Pleft(hat p - 1.96sqrt{p(1-p)/n} < p < hat p + 1.96sqrt{p(1-p)/n} right) approx .95.$$
This is promising because $p$ is 'isolated' (after a fashion) between two 'bounds', but not useful in practice for making a confidence interval because $sqrt{p(1-p)/n}$ is unknown.
(2) This leads to the second assumption, that if $n$ is sufficiently
large, then $hat p$ will be sufficiently close to $p$ that we can
write
$$Pleft(hat p - 1.96sqrt{hat p(1-hat p)/n} < p < hat p + 1.96sqrt{hat p(1- hat p)/n} right) approx .95.$$
So that an approximate 95% confidence interval for $p$ is
of the form $hat p pm 1.96sqrt{hat p(1- hat p)/n}.$
Similarly for other confidence levels with an appropriate
number from standard normal tables replacing 1.96. (For
example, 1.645 for a 90% CI and 2.576 for a 99% CI.)
$Notes:;$
Unfortunately, as shown by intensive computations for various values of
$n$ and $p,$ the actual 'coverage probability'
of the Wald interval can be far from 95% (and what is worse,
often far $below$ 95%) with 1.96. Similarly for other 'target'
confidence levels. (A key reference is Brown, Cai, and DasGupta, 2001.)
If $n$ is several hundred or thousand (as in a public opinion poll
or a large-scale
simulation) the Wald interval is tolerably accurate.
Otherwise, for a 95% CI with smaller $n$ a considerable
improvement is artificially to introduce two extra successes
and two extra failures into the data before finding $hat p$ and $n$.
This adjustment (due to Agresti and Coull, 1998) is now widely
used instead of the Wald interval. (See Wikipedia.)
The Wilson interval (again, Wikipedia) results from taking the square
and then solving a quadratic equation to (truly) isolate $p$ in $-1.96 < Z < 1.96$ without making assumption (2). Equating 1.96 and 2 in the
95% Wilson CI gives nearly the same result as the simpler Agresti-Coull interval.
The plots below show $actual$ coverage probabilities
of Wald and Agresti "95%" CIs for 2000 values of $p$
between 0 and 1 for $n = 100$. The rapid oscillation of
coverage probabilities for even small changes in $p$ is due
to the discreteness of the binomial distribution.
add a comment |
up vote
0
down vote
The Wald confidence interval for binomial success probability $p$ depends on two approximations.
(1) That $Z = frac{hat p - p}{sqrt{p(1-p)/n}}$ is approximately standard normal, $Norm(0, 1)$. Thus one would have
$P(-1.96 < Z < 1.96) approx 0.95.$ This is a good approximation
if $n$ is large and $p$ is not too far from $1/2.$
[A common rule of thumb is that $np$ and $n(1-p)$ should both
exceed. 5.]
From there, simple algebra gives
$$Pleft(hat p - 1.96sqrt{p(1-p)/n} < p < hat p + 1.96sqrt{p(1-p)/n} right) approx .95.$$
This is promising because $p$ is 'isolated' (after a fashion) between two 'bounds', but not useful in practice for making a confidence interval because $sqrt{p(1-p)/n}$ is unknown.
(2) This leads to the second assumption, that if $n$ is sufficiently
large, then $hat p$ will be sufficiently close to $p$ that we can
write
$$Pleft(hat p - 1.96sqrt{hat p(1-hat p)/n} < p < hat p + 1.96sqrt{hat p(1- hat p)/n} right) approx .95.$$
So that an approximate 95% confidence interval for $p$ is
of the form $hat p pm 1.96sqrt{hat p(1- hat p)/n}.$
Similarly for other confidence levels with an appropriate
number from standard normal tables replacing 1.96. (For
example, 1.645 for a 90% CI and 2.576 for a 99% CI.)
$Notes:;$
Unfortunately, as shown by intensive computations for various values of
$n$ and $p,$ the actual 'coverage probability'
of the Wald interval can be far from 95% (and what is worse,
often far $below$ 95%) with 1.96. Similarly for other 'target'
confidence levels. (A key reference is Brown, Cai, and DasGupta, 2001.)
If $n$ is several hundred or thousand (as in a public opinion poll
or a large-scale
simulation) the Wald interval is tolerably accurate.
Otherwise, for a 95% CI with smaller $n$ a considerable
improvement is artificially to introduce two extra successes
and two extra failures into the data before finding $hat p$ and $n$.
This adjustment (due to Agresti and Coull, 1998) is now widely
used instead of the Wald interval. (See Wikipedia.)
The Wilson interval (again, Wikipedia) results from taking the square
and then solving a quadratic equation to (truly) isolate $p$ in $-1.96 < Z < 1.96$ without making assumption (2). Equating 1.96 and 2 in the
95% Wilson CI gives nearly the same result as the simpler Agresti-Coull interval.
The plots below show $actual$ coverage probabilities
of Wald and Agresti "95%" CIs for 2000 values of $p$
between 0 and 1 for $n = 100$. The rapid oscillation of
coverage probabilities for even small changes in $p$ is due
to the discreteness of the binomial distribution.
add a comment |
up vote
0
down vote
up vote
0
down vote
The Wald confidence interval for binomial success probability $p$ depends on two approximations.
(1) That $Z = frac{hat p - p}{sqrt{p(1-p)/n}}$ is approximately standard normal, $Norm(0, 1)$. Thus one would have
$P(-1.96 < Z < 1.96) approx 0.95.$ This is a good approximation
if $n$ is large and $p$ is not too far from $1/2.$
[A common rule of thumb is that $np$ and $n(1-p)$ should both
exceed. 5.]
From there, simple algebra gives
$$Pleft(hat p - 1.96sqrt{p(1-p)/n} < p < hat p + 1.96sqrt{p(1-p)/n} right) approx .95.$$
This is promising because $p$ is 'isolated' (after a fashion) between two 'bounds', but not useful in practice for making a confidence interval because $sqrt{p(1-p)/n}$ is unknown.
(2) This leads to the second assumption, that if $n$ is sufficiently
large, then $hat p$ will be sufficiently close to $p$ that we can
write
$$Pleft(hat p - 1.96sqrt{hat p(1-hat p)/n} < p < hat p + 1.96sqrt{hat p(1- hat p)/n} right) approx .95.$$
So that an approximate 95% confidence interval for $p$ is
of the form $hat p pm 1.96sqrt{hat p(1- hat p)/n}.$
Similarly for other confidence levels with an appropriate
number from standard normal tables replacing 1.96. (For
example, 1.645 for a 90% CI and 2.576 for a 99% CI.)
$Notes:;$
Unfortunately, as shown by intensive computations for various values of
$n$ and $p,$ the actual 'coverage probability'
of the Wald interval can be far from 95% (and what is worse,
often far $below$ 95%) with 1.96. Similarly for other 'target'
confidence levels. (A key reference is Brown, Cai, and DasGupta, 2001.)
If $n$ is several hundred or thousand (as in a public opinion poll
or a large-scale
simulation) the Wald interval is tolerably accurate.
Otherwise, for a 95% CI with smaller $n$ a considerable
improvement is artificially to introduce two extra successes
and two extra failures into the data before finding $hat p$ and $n$.
This adjustment (due to Agresti and Coull, 1998) is now widely
used instead of the Wald interval. (See Wikipedia.)
The Wilson interval (again, Wikipedia) results from taking the square
and then solving a quadratic equation to (truly) isolate $p$ in $-1.96 < Z < 1.96$ without making assumption (2). Equating 1.96 and 2 in the
95% Wilson CI gives nearly the same result as the simpler Agresti-Coull interval.
The plots below show $actual$ coverage probabilities
of Wald and Agresti "95%" CIs for 2000 values of $p$
between 0 and 1 for $n = 100$. The rapid oscillation of
coverage probabilities for even small changes in $p$ is due
to the discreteness of the binomial distribution.
The Wald confidence interval for binomial success probability $p$ depends on two approximations.
(1) That $Z = frac{hat p - p}{sqrt{p(1-p)/n}}$ is approximately standard normal, $Norm(0, 1)$. Thus one would have
$P(-1.96 < Z < 1.96) approx 0.95.$ This is a good approximation
if $n$ is large and $p$ is not too far from $1/2.$
[A common rule of thumb is that $np$ and $n(1-p)$ should both
exceed. 5.]
From there, simple algebra gives
$$Pleft(hat p - 1.96sqrt{p(1-p)/n} < p < hat p + 1.96sqrt{p(1-p)/n} right) approx .95.$$
This is promising because $p$ is 'isolated' (after a fashion) between two 'bounds', but not useful in practice for making a confidence interval because $sqrt{p(1-p)/n}$ is unknown.
(2) This leads to the second assumption, that if $n$ is sufficiently
large, then $hat p$ will be sufficiently close to $p$ that we can
write
$$Pleft(hat p - 1.96sqrt{hat p(1-hat p)/n} < p < hat p + 1.96sqrt{hat p(1- hat p)/n} right) approx .95.$$
So that an approximate 95% confidence interval for $p$ is
of the form $hat p pm 1.96sqrt{hat p(1- hat p)/n}.$
Similarly for other confidence levels with an appropriate
number from standard normal tables replacing 1.96. (For
example, 1.645 for a 90% CI and 2.576 for a 99% CI.)
$Notes:;$
Unfortunately, as shown by intensive computations for various values of
$n$ and $p,$ the actual 'coverage probability'
of the Wald interval can be far from 95% (and what is worse,
often far $below$ 95%) with 1.96. Similarly for other 'target'
confidence levels. (A key reference is Brown, Cai, and DasGupta, 2001.)
If $n$ is several hundred or thousand (as in a public opinion poll
or a large-scale
simulation) the Wald interval is tolerably accurate.
Otherwise, for a 95% CI with smaller $n$ a considerable
improvement is artificially to introduce two extra successes
and two extra failures into the data before finding $hat p$ and $n$.
This adjustment (due to Agresti and Coull, 1998) is now widely
used instead of the Wald interval. (See Wikipedia.)
The Wilson interval (again, Wikipedia) results from taking the square
and then solving a quadratic equation to (truly) isolate $p$ in $-1.96 < Z < 1.96$ without making assumption (2). Equating 1.96 and 2 in the
95% Wilson CI gives nearly the same result as the simpler Agresti-Coull interval.
The plots below show $actual$ coverage probabilities
of Wald and Agresti "95%" CIs for 2000 values of $p$
between 0 and 1 for $n = 100$. The rapid oscillation of
coverage probabilities for even small changes in $p$ is due
to the discreteness of the binomial distribution.
edited Oct 3 '15 at 6:45
answered Oct 3 '15 at 6:00
BruceET
34.7k71440
34.7k71440
add a comment |
add a comment |
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f1448233%2fthe-derivation-of-the-wald-interval%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
thx for the edit
– kcesc04
Sep 23 '15 at 16:09
I've edited your question a bit more to tidy it up, but I'm not sure exactly what you're asking. Technically speaking, any interval is a confidence interval, some just have a higher confidence level than others. Are you asking how to show that the given interval matches the specified error level $alpha$?
– Ilmari Karonen
Sep 23 '15 at 17:29
no , by this definition of CI $lim_{n to infty}{P_p left( -z_{1-frac{alpha}{2}}frac{sigma}{sqrt{n}}+bar{X_n} leq p leq z_{1-frac{alpha}{2}}frac{sigma}{sqrt{n}}+bar{X_n} right)} = 1-alpha$
– kcesc04
Sep 23 '15 at 20:48
1
knowing the variace of the binomial is dependant on p
– kcesc04
Sep 23 '15 at 20:49