Inequalities

Convexity and Jensen’s Inequality

Definition (Convex and Concave Functions)

Let $ I \subset \mathbb{R} $ be an interval.

A function $ \varphi : I \to \mathbb{R} $ is convex if for all $ x,y \in I $ and $ \lambda \in [0,1] $, $\varphi(\lambda x + (1-\lambda)y) \le \lambda \varphi(x) + (1-\lambda)\varphi(y).$

It is concave if the inequality is reversed.

Equivalently, if $ \varphi $ is twice differentiable on $ I $, then

$ \varphi $ is convex if $ \varphi’’ \ge 0 $,
$ \varphi $ is concave if $ \varphi’’ \le 0 $.

Jensen’s Inequality

Let $ X $ be an integrable random variable with $ \mathbb{E}|X| < \infty $, and let
$ \varphi $ be a convex function such that $ \mathbb{E}|\varphi(X)| < \infty $. Then $\varphi(\mathbb{E}[X]) \le \mathbb{E}[\varphi(X)].$

Concave Jensen (Reverse Inequality)

If $ \varphi $ is concave and the expectations exist, then $\varphi(\mathbb{E}[X]) \ge \mathbb{E}[\varphi(X)].$

Key Uses

Moment bounds, e.g. $ (\mathbb{E} X )^p \le \mathbb{E} X ^p $ for $ p \ge 1 $
Log and exponential inequalities, e.g. $ \log \mathbb{E}[X] \ge \mathbb{E}[\log X] $
Foundational tool for Markov, Chebyshev, and entropy arguments

Markov and Chebyshev Inequalities

Markov’s Inequality

Let $ X \ge 0 $ be a random variable with $ \mathbb{E}[X] < \infty $. Then for any $ a > 0 $, $\mathbb{P}(X \ge a) \le \frac{\mathbb{E}[X]}{a}.$

More generally, if $ \varphi $ is nonnegative and increasing, $\mathbb{P}(X \ge a) \le \frac{\mathbb{E}[\varphi(X)]}{\varphi(a)}.$

Chebyshev’s Inequality

Let $ X $ be a random variable with mean $ \mu = \mathbb{E}[X] $ and variance $ \sigma^2 = \operatorname{Var}(X) < \infty $. Then for any $ \varepsilon > 0 $, $\mathbb{P}(|X - \mu| \ge \varepsilon) \le \frac{\sigma^2}{\varepsilon^2}.$

Mean-Zero ( $ \mu = 0 $ ) Version

If $ \mathbb{E}[X] = 0 $ and $ \mathbb{E}[X^2] < \infty $, then $\mathbb{P}(|X| \ge \varepsilon) \le \frac{\mathbb{E}[X^2]}{\varepsilon^2}.$

Relationship

Chebyshev’s inequality is Markov’s inequality applied to the nonnegative random variable
$ (X - \mu)^2 $.

Hölder and Cauchy–Schwarz Inequalities

Hölder’s Inequality

Let $ X $ and $ Y $ be random variables.
Let $ p,q > 1 $ satisfy $ \frac{1}{p} + \frac{1}{q} = 1 $.
If $ \mathbb{E}|X|^p < \infty $ and $ \mathbb{E}|Y|^q < \infty $, then $\mathbb{E}[|XY|] \le (\mathbb{E}|X|^p)^{1/p} (\mathbb{E}|Y|^q)^{1/q}.$

Cauchy–Schwarz Inequality

Let $ X $ and $ Y $ be random variables with
$ \mathbb{E}[X^2] < \infty $ and $ \mathbb{E}[Y^2] < \infty $. Then $|\mathbb{E}[XY]| \le (\mathbb{E}[X^2])^{1/2} (\mathbb{E}[Y^2])^{1/2}.$

Special Cases

Cauchy–Schwarz is Hölder with $ p = q = 2 $
Setting $ Y = 1 $ in Cauchy–Schwarz gives $\mathbb{E}|X| \le \sqrt{\mathbb{E}X^2}$

Key Uses

Bounding moments and cross terms
Establishing $ L^p \subset L^1 $ for $ p > 1 $
Proving convergence in probability and $ L^p $
Controlling martingale increments and covariance terms

Minkowski Inequality

Minkowski’s Inequality ( $ L^p $ Triangle Inequality )

Let $ X $ and $ Y $ be random variables and let $ p \ge 1 $.
If $ \mathbb{E}|X|^p < \infty $ and $ \mathbb{E}|Y|^p < \infty $, then $(\mathbb{E}|X+Y|^p)^{1/p} \le (\mathbb{E}|X|^p)^{1/p} + (\mathbb{E}|Y|^p)^{1/p}.$

Interpretation

The mapping $\|X\|_p := (\mathbb{E}|X|^p)^{1/p}$ defines a norm on $ L^p $, and Minkowski is the triangle inequality for this norm.

Key Uses

Establishes $ L^p $ as a normed vector space for $ p \ge 1 $
Controls sums of random variables in $ L^p $
Used in proofs of convergence in $ L^p $
Combined with Hölder to manipulate moments and expectations

Fatou’s Lemma

Let $ {X_n}_{n\ge1} $ be a sequence of nonnegative random variables. Then $\mathbb{E}\!\left[ \liminf_{n\to\infty} X_n \right] \le \liminf_{n\to\infty} \mathbb{E}[X_n].$

Remarks

No integrability or domination assumptions are required beyond $ X_n \ge 0 $
Inequality may be strict
Applies pointwise almost surely

Common Uses

Lower bounds on limits of expectations
Establishing integrability of limit random variables
First step toward Dominated Convergence and Monotone Convergence
Controlling limits of truncated variables

Monotone Convergence Theorem gives equality when $ X_n \uparrow X $
Dominated Convergence Theorem allows limit and expectation to be interchanged under domination

Edit this page on GitHub

Convexity and Jensen’s Inequality

Definition (Convex and Concave Functions)

Jensen’s Inequality

Concave Jensen (Reverse Inequality)

Key Uses

Markov and Chebyshev Inequalities

Markov’s Inequality

Chebyshev’s Inequality

Mean-Zero ( $ \mu = 0 $ ) Version

Relationship

Hölder and Cauchy–Schwarz Inequalities

Hölder’s Inequality

Cauchy–Schwarz Inequality

Special Cases

Key Uses

Minkowski Inequality

Minkowski’s Inequality ( $ L^p $ Triangle Inequality )

Interpretation

Key Uses

Fatou’s Lemma

Fatou’s Lemma

Remarks

Common Uses

Related Results

Comments