Jackknife resampling
In statistics, the jackknife is a resampling technique especially useful for variance and bias estimation. The jackknife pre-dates other common resampling methods such as the bootstrap. The jackknife estimator of a parameter is found by systematically leaving out each observation from a dataset and calculating the estimate and then finding the average of these calculations. Given a sample of size , the jackknife estimate is found by aggregating the estimates of each -sized sub-sample.
The jackknife technique was developed by Maurice Quenouille (1924–1973) from 1949 and refined in 1956. John Tukey expanded on the technique in 1958 and proposed the name "jackknife" because, like a physical jack-knife (a compact folding knife), it is a rough-and-ready tool that can improvise a solution for a variety of problems even though specific problems may be more efficiently solved with a purpose-designed tool.[1]
The jackknife is a linear approximation of the bootstrap.[1]
Estimation
The jackknife estimate of a parameter can be found by estimating the parameter for each subsample omitting the i-th observation.[2] For example, if the parameter to be estimated is the population mean of x, we compute the mean for each subsample consisting of all but the i-th data point:
These n estimates form an estimate of the distribution of the sample statistic if it were computed over a large number of samples. In particular, the mean of this sampling distribution is the average of these n estimates:
One can show explicitly that this equals the usual estimate , so the real point emerges for higher moments than the mean. A jackknife estimate of the variance of the estimator can be calculated from the variance of this distribution of :[3][4]
Bias estimation and correction
The jackknife technique can be used to estimate the bias of an estimator calculated over the entire sample. Say is the calculated estimator of the parameter of interest based on all observations. Let
where is the estimate of interest based on the sample with the i-th observation removed, and is the average of these "leave-one-out" estimates. The jackknife estimate of the bias of is given by:
and the resulting bias-corrected jackknife estimate of is given by:
This removes the bias in the special case that the bias is and removes it to in other cases.[1]
See also
Notes
- Cameron & Trivedi 2005, p. 375.
- Efron 1982, p. 2.
- Efron 1982, p. 14.
- McIntosh, Avery I. "The Jackknife Estimation Method" (PDF). Boston University. Avery I. McIntosh. Retrieved 2016-04-30.: p. 3.
References
- Cameron, Adrian; Trivedi, Pravin K. (2005). Microeconometrics : methods and applications. Cambridge New York: Cambridge University Press. ISBN 9780521848053.
- Efron, Bradley; Stein, Charles (May 1981). "The Jackknife Estimate of Variance". The Annals of Statistics. 9 (3): 586–596. doi:10.1214/aos/1176345462. JSTOR 2240822.
- Efron, Bradley (1982). The jackknife, the bootstrap, and other resampling plans. Philadelphia, PA: Society for Industrial and Applied Mathematics. ISBN 9781611970319.
- Quenouille, Maurice H. (September 1949). "Problems in Plane Sampling". The Annals of Mathematical Statistics. 20 (3): 355–375. doi:10.1214/aoms/1177729989. JSTOR 2236533.
- Quenouille, Maurice H. (1956). "Notes on Bias in Estimation". Biometrika. 43 (3–4): 353–360. doi:10.1093/biomet/43.3-4.353. JSTOR 2332914.
- Tukey, John W. (1958). "Bias and confidence in not quite large samples (abstract)". The Annals of Mathematical Statistics. 29 (2): 614. doi:10.1214/aoms/1177706647.