|
Interactive animation |
Confidence interval
We suggest that you first read the entry on "Interval Estimation".
-----
Let E = (x1, ..., xn) be a sample from an unknown distribution. Let also q be a parameter of this distribution, and q* an estimate of the value of this parameter. It is sometimes possible to identify an interval such that one can assert that this interval "covers" the (unknown) true value q0 of the parameter with a certain given probability P.
This interval is then called a confidence interval for the estimate q*. Its endpoints are random variables that depend only on the sample (and are therefore "statistics").
The width of the confidence interval is a measure of the uncertainty about the the position of the true value q0 of the estimated parameter.
The probability P is arbitrarily chosen by the analyst. It is called the confidence level for the confidence interval, and is denoted by (1 - a). The most frequently chosen values for a are 0.05 and 0.01, corresponding to 95% and 99% confidence level.
So, if one chooses a = 0.05, the corresponding confidence interval has a 0.95 probability to cover the true value q0 of the parameter.
-----
For a given sample, the width of the confidence interval depends on the chosen confidence level.
For a given confidence level, the endpoints of the confidence interval depend on the sample size : the larger the sample, the shorter the confidence interval. This is not surprising : the larger the sample, the more information we have about the distribution, and therefore about the parameter q, and the smaller is the uncertainty about its true value q0.
Given a sample, the only leeway available to the analyst is the tradeoff between confidence level and confidence interval width. If he tries to impose a higher confidence level, the confidence interval will become wider : the only way to increase the probability for q0 to be inside the interval is then to make the interval wider.
But if observations can be obtained at a reasonable cost, then it is possible to:
and then determine the minimal sample size needed to meet both constraints.
This question is addressed in the first Tutorial below.
For a given confidence level, there is an infinity of confidence intervals that share this confidence level (lower image of the illustration below).
Which one should we choose ?
Clearly, the best choice is that of the shortest interval, that is, the interval that most narrows down the region where the true value q0 is expected to lie. For the most classical pivots, (standard normal and t distributions), it can be shown that the shortest confidence intervals are symmetric with respect to the estimated value. So, the systematic (and often unexplained) use of these intervals is fully justified.
Asymmetric intervals
Pivots are not necessarily symmetric. For example,
the pivot relative to the variance of a normal distribution
is a
variable,
whose distribution is not symmetric. It is then customary to build "equal-tailed"
confidence intervals, that is, such that a/2 areas
are defined on either end of the distribution curve of the pivot. The
confidence interval is then asymetric (lower image of the illustration below).
There is no reason why these equal-tailed intervals should be the shortest confidence intervals, and in fact, they are not. Identifying the shortest confidence interval can then only be done by iterative optimization techniques.
Few exact confidence intervals are known. So, most of the time, it is necessary to seek formulas that yield approximate confidence intervals with a reasonable level of accuracy. There are two main ways of obtaining approximate confidence intervals :
Even when no exact confidence interval is known, it is often possible to identify an asymptotic confidence interval, that is, an approximate confidence interval whose accuracy gets better and better when sample size gets larger and larger. For very large samples an asymptotic interval is almost an exact confidence interval.
This approach relies on identifying a quantity that is not a pivot for finite size samples, but whose distribution tends towards a limit distribution that does not depend on the value of the estimated parameter as the sample size grows without limit.
This limit distribution is then used for building a confidence interval, even (and incorrectly so) for finite samples.
Approximate formulas
For moderate size samples, an asymptotic confidence intervals may be grossly wrong. A better solution is then to forego the limit pivotal quantity, and rather to find a genuine pivot whose distribution will be a good approximation of the distribution of the statistic used for defining the asymptotic pivot as in the previous paragraph.
The most classical example of this approach is Welch's approximation. It is used for building approximate confidence intervals on the difference of the means of two independent normal distributions of unknown and different variances.
Welch's approximation is addressed below.
Multiple confidence intervals
It is sometimes possible to define not just one confidence interval for one parameter, but a 2-D confidence region pertaining to a couple of parameters considered simultaneously.
For exemple, one can define a confidence region for the pair (m, s²) of sample drawn from a normal distribution N(µ, s²).
A major application of interval estimation is to calculate confidence intervals for the estimated values of the parameters of a model (for example, see here).
A large confidence interval on the estimated value of a parameter means that a there is a large uncertainty about the true value of this parameter. The two main causes for this large uncertainty are :
The concept of confidence interval may be extended to modeling : it is sometimes possible to associate a confidence interval (for a given confidence level) to each and every prediction of the model. For example, in Simple Linear Regression, to each value of the independent variable corresponds :
* A prediction of the value of the response variable, and
* A confidence interval covering
this prediction
.
This exceptionally favorable situation is made possible by the linearity of the model and the simplistic assumptions about the measurement errors. In general, no exact confidence interval can be found for model predictions, but it is sometimes possible to find approximate confidence intervals by linearizing the model in the parameter space.
The endpoints of the confidence interval then define a "confidence strip" (lower image in the illustration below). The model predictions are more reliable in regions where the strip is narrow than in regions where the strip is wide.
Local data density plays a major role in defining the strip width : the uncertainty about the model predictions is larger in low density areas than in of high density areas.
This notion generalizes to models with more than one independent variable, but there is then no graphic representation.
Confidence intervals and tests
When a confidence interval is available for the estimated value of a parameter, it is possible to devise a test about the value of this parameter. A confidence interval materializes a formula like :
Pr{a < q0 < b}= 1 - a
where a and b are the endpoints of the interval.
Let's now consider the null hypothesis H0 : q = q0 that we wish to test at the a significance level.
Recall that such a tests relies on our ability to identify a statistic S whose value on the sample will be S0, and such that, if H0 is true, we can find an interval such that :
But a confidence interval has all the ingredients we need for the test :
Conversely, an by the same argument taken the other way around, a test on the value of a distribution parameter leads to a confidence interval on the estimated value of the parameter.
So, as long as we consider parameter estimation, "confidence interval" and "test" are identical concepts.
But the domain of tests extends far beyond that of confidence intervals. For example, a normality test bears on the hypothesis :
H0 : "The sample was generated by a normal distribution"
and cannot be reduced to building a confidence interval.
|
Interactive animation |
_____________________________________________________________________
|
These Tutorials are the same as those you'll find on the page about interval estimation |
|
Tutorial 1 |
The first Tutorial describes the methods used to obtain exact confidence intervals for the means of normal distributions under various circumstances.
EXACT CONFIDENCE INTERVALS
FOR THE MEANS OF NORMAL DISTRIBUTIONS
|
Confidence interval of a mean Variance is known Variance is unknow Difference of a mean and a reference value Comparing two means Paired samples Independent samples Variances are known Variances are unknown but known to be equal Variances are unknown, and NOT assumed to be equal : a failure |
||
|
TUTORIAL |
||
_________________________________________________________________
|
Tutorial 2 |
In the general case (variances unknown and not assumed to be equal), no exact confidence interval is known for the difference of the means of two independent normal distributions. But is is possible to calculate two types of approximate intervals :
ASYMPTOTIC CONFIDENCE INTERVALS
AND WELCH'S APPROXIMATION
|
Asymptotic confidence interval (no demonstration) Welch's approximation |
||
|
TUTORIAL |
||
This table of contents is deceivingly short :
Warning : these Tutorials are the same
as those mentioned on the "Interval
estimation" page.
_________________________________________________
Related readings
|
Want to contribute to this site ? |