In probability theory, the central limit theorem states that, under certain circumstances, the probability distribution of the scaled mean of a random sample converges to a normal distribution as the sample size increases to infinity. Under stronger assumptions, the Berry–Esseen theorem, or Berry–Esseen inequality, gives a more quantitative result, because it also specifies the rate at which this convergence takes place by giving a bound on the maximal error of approximation between the normal distribution and the true distribution of the scaled sample mean. The approximation is measured by the Kolmogorov–Smirnov distance. In the case of independent samples, the convergence rate is n −1/2, where n is the sample size, and the constant is estimated in terms of the third absolute normalized moment. It is also possible to give non-uniform bounds which become more strict for more extreme events.

Statements of the theorem vary, as it was independently discovered by two mathematicians, Andrew C. Berry (in 1941) and Carl-Gustav Esseen (1942), who then, along with other authors, refined it repeatedly over subsequent decades.

One version, sacrificing generality somewhat for the sake of clarity, is the following:

There exists a positive constant C such that if X 1, X 2,…, are i.i.d. random variables with E (X 1) = 0, E(X 1 2) = σ 2 > 0, and E(| X 1 | 3) = ρ < ∞,1 and if we define

the sample mean, with F n the cumulative distribution function of

and Φ the cumulative distribution function of the standard normal distribution, then for all x and n,

Illustration of the difference in cumulative distribution functions alluded to in the theorem.

That is: given a sequence of independent and identically distributed random variables, each having mean zero and positive variance, if additionally the third absolute moment is finite, then the cumulative distribution functions of the standardized sample mean and the standard normal distribution differ (vertically, on a graph) by no more than the specified amount. Note that the approximation error for all n (and hence the limiting rate of convergence for indefinite n sufficiently large) is bounded by the order of n −1/2.

Calculated upper bounds on the constant C have decreased markedly over the years, from the original value of 7.59 by Esseen in 1942.2 The estimate C < 0.4748 follows from the inequality

since σ 3ρ and 0.33554 · 1.415 < 0.4748. However, if ρ ≥ 1.286 σ 3, then the estimate

is even tighter.3

Esseen (1956) proved that the constant also satisfies the lower bound

Let X 1, X 2,…, be independent random variables with E (X i) = 0, E(X i 2) = σ i 2 > 0, and E(| X i | 3) = ρ i < ∞. Also, let

be the normalized n -th partial sum. Denote F n the cdf of S n, and Φ the cdf of the standard normal distribution. For the sake of convenience denote

In 1941, Andrew C. Berry proved that for all n there exists an absolute constant C 1 such that

where

Independently, in 1942, Carl-Gustav Esseen proved that for all n there exists an absolute constant C 0 such that

where

It is easy to make sure that ψ 0 ≤ψ 1. Due to this circumstance inequality (3) is conventionally called the Berry–Esseen inequality, and the quantity ψ 0 is called the Lyapunov fraction of the third order. Moreover, in the case where the summands X 1,…, X n have identical distributions

and thus the bounds stated by inequalities (1), (2) and (3) coincide apart from the constant.

Regarding C 0, obviously, the lower bound established by Esseen (1956) remains valid:

The lower bound is exactly reached only for certain Bernoulli distributions (see Esseen (1956) for their explicit expressions).

The upper bounds for C 0 were subsequently lowered from Esseen’s original estimate 7.59 to 0.5600.4

Berry–Esseen theorems exist for the sum of a random number of random variables. The following is Theorem 1 from Korolev (1989), substituting in the constants from Remark 3.5 It is only a portion of the results that they established:

Let be independent, identically distributed random variables with , , . Let be a non-negative integer-valued random variable, independent from . Let , and define

Then

Multidimensional version

As with the multidimensional central limit theorem, there is a multidimensional version of the Berry–Esseen theorem.6 7

Let be independent -valued random vectors each having mean zero. Write and assume is invertible. Let be a -dimensional Gaussian with the same mean and covariance matrix as . Then for all convex sets ,

,

where is a universal constant and (the third power of the L 2 norm).

The dependency on is conjectured to be optimal, but might not be.7

Non-uniform bounds

The bounds given above consider the maximal difference between the cdf’s. They are ‘uniform’ in that they do not depend on and quantify the uniform convergence . However, because goes to zero for large by general properties of cdf’s, these uniform bounds will be overestimating the difference for such arguments. This is despite the uniform bounds being sharp in general. It is therefore desirable to obtain upper bounds which depend on and in this way become smaller for large .

One such result going back to (Esseen 1945) that was since improved multiple times is the following.

As above, let X 1, X 2,…, be independent random variables with E (X i) = 0, E(X i 2) = σ i 2 > 0, and E(| X i | 3) = ρ i < ∞. Also, let and

be the normalized n -th partial sum. Denote F n the cdf of S n, and Φ the cdf of the standard normal distribution. Then

,

where is a universal constant.

The constant may be taken as 114.667.8 Moreover, if the are identically distributed, it can be taken as , where is the constant from the first theorem above, and hence 30.2211 works.9

See also

Notes

References

Bibliography

Footnotes

  1. Since the random variables are identically distributed, X 2, X 3,… all have the same moments as X 1.

  2. Esseen (1942). For improvements see van Beek (1972), Shiganov (1986), Shevtsova (2007), Shevtsova (2008), Tyurin (2009), Korolev & Shevtsova (2010a), Tyurin (2010). The detailed review can be found in the papers Korolev & Shevtsova (2010a) and Korolev & Shevtsova (2010b).

  3. Shevtsova (2011).

  4. Esseen (1942); Zolotarev (1967); van Beek (1972); Shiganov (1986); Tyurin (2009); Tyurin (2010); Shevtsova (2010).

  5. Korolev, V. Yu (1989). “On the Accuracy of Normal Approximation for the Distributions of Sums of a Random Number of Independent Random Variables”. Theory of Probability & Its Applications. 33 (3): 540– 544. doi:10.1137/1133079.

  6. Bentkus, Vidmantas. “A Lyapunov-type bound in R d.” Theory of Probability & Its Applications 49.2 (2005): 311–323.

  7. Raič, Martin (2019). “A multivariate Berry—Esseen theorem with explicit constants”. Bernoulli. 25 (4A): 2824– 2853. arXiv:1802.06475. doi:10.3150/18-BEJ1072. ISSN 1350-7265. S2CID 119607520. 2

  8. Paditz, Ludwig (1997). Über die Annäherung der Verteilungsfunktionen von Summen unabhängiger Zufallsgrößen gegen unbegrenzt teilbare Verteilungsfunktionen unter besonderer Beachtung der Verteilungsfunktion der standardisierten Normalverteilung [On the approximation of cumulative distribution functions of sums of independent random variables by infinitely divisible cumulative distribution functions with special attention to the cumulative distribution function of the standard normal distribution] (in German). Dresden. p. 6.{{[cite book](https://en.wikipedia.org/wiki/Template:Cite_book "Template:Cite book")}}: CS1 maint: location missing publisher (link)

  9. Michel, R. (1981). “On the constant in the nonuniform version of the Berry-Esséen theorem”. Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete. 55: 109– 117.