OPTIMAL LINEAR COMBINATION
OF INDEPENDENT UNBIASED ESTIMATORS
We denote :
* Q the quantity to be estimated,
* Q1 an unbiased estimator of Q, with known variance σ1².
* Q*1 a realization of Q1.
* Q2 a second unbiased estimator of Q, with known variance σ2².
* Q*2 a realization of Q2.
Q1 and Q2
are assumed to be independent (or rather, uncorrelated).
We seek values for λ1 and λ2 such that :
Qc = λ1.Q1 + λ2.Q2
* is unbiased,
* and has the lowest possible variance.
1) Estimator Qc must be unbiased.
We want Qc's expectation to be Q (the common mean of the unbiased estimators Q1 and Q2). So, if E denotes the expectation, we want :
E[λ1.Q1 + λ2.Q2 ] = Q
But :
E[λ1.Q1 + λ2.Q2] = λ1.E[Q1] + λ2.E[Q2 ] = λ1.Q + λ2.Q = ( λ1 + λ2).Q
So we must have :
( λ1 + λ2).Q = Q or λ1 + λ2 = 1
We will further denote the two coefficients λ1 = λ and λ2 = (1 - λ).
2) Estimator Qc must have minimal variance
The variance of Qc is :
Var(Qc) = Var(λ.Q1 + (1 -λ).Q2)
Because Q1 and Q2 are uncorrelated, we have :
Var(λ.Q1 + (1 -λ).Q2) = Var(λ.Q1) + Var((1 -λ).Q2)
So :
Var(Qc) = σ² = λ².Var(Q1) + (1 -λ)².Var(Q2) = λ².σ1² + (1 -λ)².σ2²
We want σ² to be minimal, so we take it's derivative with respect to λ, and make it equal to 0 :
dσ² / dλ = 2.(λ.σ1² - (1 -λ).σ2²) = 0
or :
|
λ1
= σ2²
/ (σ1²
+ σ2²) |
The second derivative of σ² is :
d²σ²/dλ² = 2.(σ1² + 2²) > 0
so the extremum of σ2² is indeed a minimum.
The minimal value of σ² is therefore :
σ² = λ1².σ1² + λ2². σ2² = σ24 .σ1²/ (σ1² + σ2²)² + σ2² .σ14 /(σ1² + σ2²)²
or :
|
σ² = σ1².σ2² / (σ1² + σ2²) |
This value is smaller than both σ1² and σ2². For example, the ratio of σ² to σ1² is :
σ² / σ1² = σ2² / (σ1² + σ2²) < 1
__________________________________
Imagine that we are trying to estimate the mean of a distribution, for example the true position of a star on the celestial sphere.
Suppose that both Q*1 and Q*2 were obtained with the same telescope, whose measurement uncertainty is characterized by the single variance σi². But :
* Q*1 was obtained as the average of n1 measurements,
* while Q*2 was obtained as the average of n2 measurements the next night.
The variance of the distribution of Q1 is σ1² = σi² / n1, while that of Q2 is σ2² = σi² / n2. So the minimal variance of Qc is :
σ² = (σi² / n1).(σi² / n2) / (σi² / n1 + σi² / n2)
or
σ² = σi² /( n1 + n2)
Now imagine that the two samples are merged into one single large sample with n = (n1 + n2) observations. It is a known fact that the average of these n observations is the very best estimator of the true mean (i.e., no other unbiased estimator of the mean has a lower variance than the sample average). The variance of the distribution of the average of the large sample is :
σi² /( n1 + n2)
which is just what we found for the minimal variance
of Qc. So, at least in this case, we know for sure that Qc is
the very best estimator possible.