🪴 Anil's Garden

❯

❯

Probability - The Chernoff Bound

Probability - The Chernoff Bound

18 Jul 20253 min read

clippings

The Chernoff Bound

The Chernoff bound is like a genericized trademark: it refers not to a particular inequality, but rather a technique for obtaining exponentially decreasing bounds on tail probabilities.

Much of this material comes from my CS 365 textbook,Randomized Algorithms by Motwani and Raghavan.

Let $X 1, \dots , XN$ be independent random variables where $X i$ takes the value $1$ with probability $p i$ and $0$ otherwise (a Bernoulli distribution). Suppose at least one of the $p i$ is nonzero. Let $X = \sum i = 1 NX i$ , and let $μ = E [X] = \sum i = 1 Np i$ .

Multiplicative Chernoff Bound

We bound $P r [X > (1 + δ) μ]$ for $δ > 0$ via Markov’s inequality and a Taylor series approximation of the exponential function.

We have $P r [X > (1 + δ) μ] = P r [e tX > e t (1 + δ) μ]$ for all $t > 0$ . We’ll later select an optimal value for $t$ . By Markov’s inequality, we have:

P r [e tX > e t (1 + δ) μ] \leq E [e tX] / e t (1 + δ) μ

My textbook stated this inequality is in fact strict if we assume none of the $p i$ are 0 or 1, but I’m not sure this is required, due to a strict inequality later on.

Then since the $X i$ are independent:

E [e tX] = E [e t (X 1 + \dots ​ + X n)] = E [\prod i = 1 N e tX i] = \prod i = 1 NE [e tX i]

We can compute $E [e tX i]$ explicitly: this random variable is $e t$ with probability $p i$ , and $1$ otherwise, that is, with probability $1 - p i$ , thus this is equal to:

\prod i = 1 NE [e tX i] = \prod i = 1 N (1 + p i (e t - 1))

We have $1 + x < e x$ for all $x > 0$ . As long as at least one $p i > 0$ :

\prod i = 1 N (1 + p i (e t - 1)) < \prod i = 1 N e p i (e t - 1) = e (p 1 + ... + p n) (e t - 1) = e (e t - 1) μ

Whence:

P r [X > (1 + δ) μ] < e (e t - 1) μ / e t (1 + δ) μ

It is time to choose $t$ . Differentiating the right-hand side shows we attain the minimum at $t = l n (1 + δ)$ , which is positive when $δ$ is. This value of $t$ yields the Chernoff bound:

P r [X > (1 + δ) μ] < (eδ (1 + δ) 1 + δ) μ

Below Expectations

We use the same technique to bound $P r [X < (1 - δ) μ]$ for $δ > 0$ . We have:

P r [X < (1 - δ) μ] = P r [- X > - (1 - δ) μ] = P r [e - tX > e - (1 - δ) μ]

for any $t > 0$ . If we proceed as before, that is, apply Markov’s inequality, use the approximation $1 + x < e x$ , then pick $t$ to minimize the bound, we have:

P r [X < (1 - δ) μ] < (e - δ (1 - δ) 1 - δ) μ

Bounding the bounds

Unfortunately, the above bounds are difficult to use, so in practice we use cruder but friendlier approximations.

Recall $l n (1 - x) = - x - x 2/2 - x 3/3 - \dots $ . Thus if $δ \leq 1$ , we have:

l n (1 - δ) > - δ - δ 2/2

Exponentiating both sides, raising to the power of $1 - δ$ and dropping the highest order term yields:

(1 - δ) 1 - δ > e - δ + δ 2/2

Thus for $0 < δ < 1$ :

P r [X < (1 - δ) μ] < e - δ 2 μ /2, 0 < δ < 1

As for the other Chernoff bound,Wikipedia states:

P r [X > (1 + δ) μ] < e - δ 2 μ /3, 0 < δ < 1

but my textbook quotes a better bound:

P r [X > (1 + δ) μ] < e - δ 2 μ /4, 0 < δ < 2 e - 1

An Additive Chernoff Bound

Due to Hoeffding, this Chernoff bound appears as Problem 4.6 in Motwani and Raghavan. It was also mentioned in a cryptography class I took long ago.

For $i = 1, \dots , n$ , let $X i$ be a random variable that takes $1$ with probability $p$ and $0$ otherwise, and suppose they are independent. Let $X = \sum i = 1 n X i$ . Then:

P r [∣ X - E [X] ∣ \geq n δ] \leq 2 e - 2 δ 2

Ben Lynn blynn@cs.stanford.edu 💡

Graph View

The Chernoff Bound
Multiplicative Chernoff Bound
Below Expectations
Bounding the bounds
An Additive Chernoff Bound

Backlinks

Statistics and Probability

Website
Bluesky
Twitter/X
GitHub
LinkedIn
Instagram
Goodreads
Letterboxd
🍋