STT 996 – High Dimensional Probability
Lecture 03 – Sub-Gaussian and Sub-Exponential Variables
Wednesday, January 21, 2026
1. Review: Equivalent Definitions of Sub-Gaussian RVs
Let $ X $ be a real-valued random variable.
There exist absolute constants $ c, k_1, k_2, k_3, k_4 > 0 $ such that \(\frac{1}{c} \le \frac{k_i}{k_j} \le c \quad \text{for all } i,j,\) and the following are equivalent:
TFAE (The Following Are Equivalent)
-
Tail bound \(\mathbb{P}(|X| > t) \le 2 e^{-t^2/k_1^2}, \quad t > 0\)
-
Moment growth \(\|X\|_{L^p} := (\mathbb{E}|X|^p)^{1/p} \le k_2 \sqrt{p}, \quad p \ge 1\)
-
Exponential square integrability \(\mathbb{E}\left[e^{X^2/k_3^2}\right] \le 2\)
If additionally $ \mathbb{E}[X] = 0 $, then these are also equivalent to:
- MGF bound \(\mathbb{E}[e^{\lambda X}] \le e^{k_4^2 \lambda^2}, \quad \lambda \in \mathbb{R}\)
2. The Sub-Gaussian (ψ₂) Norm
Define the ψ₂-norm by \(\|X\|_{\psi_2} := \inf\left\{ t > 0 : \mathbb{E}\left[e^{X^2/t^2}\right] \le 2 \right\}.\)
Key properties
- The function $ t \mapsto \mathbb{E}[e^{X^2/t^2}] $ is decreasing in $ t $.
- As $ t \to \infty $, the expectation tends to 1.
- By monotone and dominated convergence: \(\mathbb{E}\left[e^{X^2/\|X\|_{\psi_2}^2}\right] = 2.\)
3. Example: Standard Normal
Let $ Z \sim \mathcal{N}(0,1) $.
-
Moment generating function: \(\mathbb{E}[e^{tZ}] = e^{t^2/2}\)
-
ψ₂-norm: \(\|Z\|_{\psi_2} = \sqrt{8}\)
This follows from solving \(\mathbb{E}\left[e^{Z^2/t^2}\right] = 2.\)
4. Median-of-Means Estimator (Jeremy’s Presentation)
Let $ X_1, \dots, X_N $ be i.i.d. with \(\mathbb{E}[X^2] < \infty, \quad \sigma^2 = \mathrm{Var}(X).\)
Claim
There exists an estimator $ \widehat{M} $ such that \(\mathbb{P}\left(|\widehat{M} - \mu| > t \frac{\sigma}{\sqrt{N}}\right) \le 2 e^{-c t^2}, \quad 0 \le t \le 2.\)
Construction
-
Divide data into $ B $ blocks of equal length $ L $: \(L = \left\lceil \frac{4N}{t^2} \right\rceil, \quad B = \left\lfloor \frac{N}{L} \right\rfloor.\)
- Ignore leftover samples so that $ BL \le N $.
- Define block means: \(\widehat{M}_b = \frac{1}{L} \sum_{i \in \text{block } b} X_i.\)
- Final estimator: \(\widehat{M} = \mathrm{median}\{\widehat{M}_1, \dots, \widehat{M}_B\}.\)
Step 1: Control a Single Block
By Chebyshev, \(\mathbb{P}\left(|\widehat{M}_b - \mu| \ge t \frac{\sigma}{\sqrt{N}}\right) \le \frac{\sigma^2/L}{t^2\sigma^2/N} = \frac{N}{Lt^2} \le \frac{1}{4}.\)
Define event $ A_b $ as this deviation event.
Step 2: Median Argument
If at least half the block means are “bad,” the median is bad: \(\mathbb{P}(\widehat{M} \ge \mu + t\sigma/\sqrt{N}) \le \mathbb{P}\left(\sum_{b=1}^B \mathbf{1}_{A_b} \ge \frac{B}{2}\right).\)
Step 3: Generalized Hoeffding
For independent $ Y_i \in [a_i,b_i] $, \(\mathbb{P}\left(\sum (Y_i - \mathbb{E}Y_i) \ge s\right) \le \exp\left(-\frac{2s^2}{\sum (b_i - a_i)^2}\right).\)
Apply to Bernoulli indicators $ \mathbf{1}_{A_b} $: \(\mathbb{P}\left(\sum \mathbf{1}_{A_b} \ge \frac{B}{2}\right) \le e^{-B/8}.\)
Step 4: Lower Bound on Number of Blocks
Using algebra and $ t \ge 2 $, \(B \ge c t^2,\) which yields \(\mathbb{P}(|\widehat{M} - \mu| > t\sigma/\sqrt{N}) \le 2 e^{-c t^2}.\)
Instructor remark:
Despite the nice tail behavior, constants are poor and the estimator is not uniformly better than the sample mean.
5. Sum of Independent Sub-Gaussians
Theorem
Let $ X_1, \dots, X_N $ be independent, mean-zero, sub-Gaussian. Then \(\sum_{k=1}^N X_k \text{ is sub-Gaussian}, \quad \left\|\sum X_k\right\|_{\psi_2}^2 \le C \sum \|X_k\|_{\psi_2}^2.\)
Proof Sketch
Using independence: \(\mathbb{E}\left[e^{\lambda \sum X_k}\right] = \prod \mathbb{E}[e^{\lambda X_k}] \le \prod e^{C\lambda^2\|X_k\|_{\psi_2}^2} = e^{C\lambda^2\sum \|X_k\|_{\psi_2}^2}.\)
6. $ L_{\psi_2} $ Is a Normed Space
Define \(L_{\psi_2} := \{Y : \|Y\|_{\psi_2} < \infty\}.\)
Properties:
- Closed under finite sums
- Closed under scalar multiplication
Hence $ L_{\psi_2} $ is a vector space.
7. Generalized Hoeffding (Sub-Gaussian Version)
If $ X_k $ are independent, mean-zero, sub-Gaussian: $$ \mathbb{P}\left(\left|\sum X_k\right| > t\right) \le 2 \exp\left(
- c \frac{t^2}{\sum |X_k|_{\psi_2}^2} \right). $$
Weighted version: $$ \mathbb{P}\left(\left|\sum a_k X_k\right| > t\right) \le 2 \exp\left(
- c \frac{t^2}{\max_k |X_k|_{\psi_2}^2 |a|_2^2} \right). $$
8. Centering Lemma
\[\|X - \mathbb{E}X\|_{\psi_2} \le C \|X\|_{\psi_2}.\]Proof uses triangle inequality and \(|\mathbb{E}X| \le C\|X\|_{\psi_2}.\)
9. Sub-Exponential Random Variables
A RV is sub-exponential if \(\mathbb{P}(|X| > t) \le 2e^{-ct}.\)
Examples:
- $ Z^2 $ where $ Z \sim \mathcal{N}(0,1) $ (χ²)
- Exponential, Gamma, Poisson
10. ψ₁ Norm (Sub-Exponential Norm)
\[\|X\|_{\psi_1} := \inf\left\{ t > 0 : \mathbb{E}[e^{|X|/t}] \le 2 \right\}.\]Example: Exponential($\lambda$)
\(\mathbb{E}[e^{tX}] = \frac{\lambda}{\lambda - t}, \quad t < \lambda\)
Solving gives: \(\|X\|_{\psi_1} = \frac{2}{\lambda}, \quad \mathbb{E}X = \frac{1}{\lambda}.\)
11. Key Moral
- Sub-Gaussian → Gaussian-type tails
- Squaring a sub-Gaussian yields sub-exponential behavior
- ψ-norms encode tail decay cleanly
- One inequality unlocks many results
Next lecture: deeper study of sub-exponential variables and concentration inequalities.
Comments