Sheppard's correction

In statistics, Sheppard's corrections are approximate corrections to estimates of moments computed from binned data. The concept is named after William Fleetwood Sheppard.

Let be the measured kth moment, the corresponding corrected moment, and the class interval (bin width). No correction is necessary for the mean (first moment about zero). The first few measured and corrected moments about the mean are then related as follows:

When the data come from a normally distributed population, then binning and using the midpoint of the bin as the observed value results in an overestimate of the variance. That is why the correction to the variance is negative. The reason why the uncorrected estimate of the variance is an overestimate is that the error is negatively correlated with the observation. For the uniform distribution, the error is uncorrelated with the observation, so a correction should be +c2/12, which is the variance of the error itself rather than c2/12. Thus Sheppard's correction is biased in favor of population distributions in which the error is negatively correlated with the observation.

The cumulants of the sum of the grouped variable and the uniform variable are the sums of the cumulants. As odd cumulants of a uniform distribution are zero; only even moments are affected.

The second and fourth cumulants of the uniform distribution on (−0.5c, 0.5c) are respectively, c2/12 and c4/120.

The correction to moments can be derived from the relation between cumulants and moments.

References

  • Weisstein, Eric W. "Sheppard's Correction". MathWorld—A Wolfram Web Resource. Retrieved March 2, 2014.
  • Weatherburn, C.E. (1949), A first course in mathematical statistics, Cambridge University Press
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.