Ancillary statistic



         


In statistics, an ancillary statistic is a statistic whose probability distribution does not depend on which of the probability distributions among those being considered is the distribution of the statistical population from which the data were taken. This concept was introduced by the great statistical geneticist Sir Ronald Fisher.

[Top]

Examples

<math>\overline{X}_n=(X_1+\,\cdots\,+X_n)/n<math>
be the sample mean. The random variable
<math>\overline{X}_n-\mu<math>
is not an ancillary statistic, even though its probability distribution does not depend on μ That is because it is not a statistic, since its value depends on the unobservable population mean μ
The random variable
<math>\max\{\,X_1,\dots,X_n\,\}-\min\{\,X_1,\dots,X_n\,\}<math>
is an ancillary statistic, because
  • Its probability distribution does not change as μ changes, and
  • it depends only on the data X1, ..., Xn and not on the unobservable parameter μ, i.e., it is a statistic.
  • It is a part of the observable data (it is a statistic), and
  • Its probability distribution does not depend on the batter's ability, since it was chosen by a random process independent of the batter's ability.
This ancillary statistic is an ancillary complement to the observed batting average X/N, i.e., the batting average X/N is not a sufficient statistic, in that it conveys less than all of the relevant information in the data, but conjoined with N, it becomes sufficient.






  View Live Article   This article is from Wikipedia. All text is available under the terms of the GNU Free Documentation License