Edit this page on GitHub

13 — Kolmogorov Extension Theorem

Lecture 13 develops the Kolmogorov Extension Theorem, a fundamental result ensuring the existence of probability measures on infinite-dimensional product spaces (e.g., laws of stochastic processes). The lecture carefully builds the σ-algebras, the finite-dimensional distributions, the consistency property, and Carathéodory’s extension argument with compactness/diagonal subsequence technique.

The Kolmogorov Extension Theorem (1930–1940) constructs probability measures on infinite-dimensional product spaces from consistent finite-dimensional distributions.

1. Infinite Product Space and Coordinate Maps

Let $\Omega = \{\omega = (x_1, x_2, x_3, \ldots) : x_k \in \mathbb{R}\}.$

For each $n$, define the coordinate projection $\pi_n(\omega) = x_n.$

We want a probability measure $P$ on $(\Omega, \mathcal{F})$ such that the random vector $(\pi_1, \ldots, \pi_n)$ has a prescribed probability law on $\mathbb{R}^n$, for every $n$.

2. The Filtration of σ-Algebras $F_n$

For each $n$, define: $F_n = \{\omega : (x_1,\ldots,x_n) \in A,\; A \in \mathcal{B}(\mathbb{R}^n)\}.$

Properties (see page 1 diagram with the nested strips):

$F_n$ is a σ-algebra.
$F_n \subseteq F_{n+1}$ because specifying the first $n$ coordinates gives less information than specifying $n+1$.
Let: $\mathcal{A} = \bigcup_{n=1}^\infty F_n.$ Then $\mathcal{A}$ is an algebra (closed under finite unions & complements).

Important:
$\mathcal{A}$ is not a σ-algebra.
For example (page 1), $A = \left\{\omega : \sum_{k=1}^\infty x_k^2 \le 1\right\}$ is not in any $F_n$, so not in $\mathcal{A}$.

3. Finite-Dimensional Distributions and Consistency

For each $n$, suppose we are given a probability measure $\mu_n \quad \text{on } (\mathbb{R}^n, \mathcal{B}(\mathbb{R}^n)).$

Interpretation: $\mu_n(A) = \mathbb{P}\big( (X_1,\ldots,X_n)\in A\big).$

Consistency condition:
For all Borel sets $A \subseteq \mathbb{R}^n$, $\mu_{n+1}(A \times \mathbb{R}) = \mu_n(A).$

This expresses correct marginalization.

4. Define the Pre-Measure $M$ on $\mathcal{A}$

For $A \in F_n$ with $A = \{\omega : (x_1,\ldots,x_n) \in B\}, \quad B \in \mathcal{B}(\mathbb{R}^n),$ define: $M(A) = \mu_n(B).$

The consistency condition ensures this is well-defined, i.e. if the same set $A$ can be written using $F_{n+k}$, it gives the same number (page 1, “$μ$ is consistent—so $M$ agrees on overlaps” note).

5. Goal

Extend $M$ from the algebra $\mathcal{A}$ to a probability measure on: $\mathcal{F} = \sigma(\mathcal{A}).$

This is analogous to constructing Lebesgue measure from intervals (page 1 note: “we want to establish Lebesgue measure on $\mathbb{R}$”).

6. Carathéodory Condition Needed for Extension

To extend $M$ to a measure on the σ-algebra, we must show:

If $B_n \in \mathcal{A}$, $B_n \downarrow \varnothing$, then $M(B_n) \downarrow 0$.

If this holds, Carathéodory’s theorem yields the complete measure on $\mathcal{F}$.

7. Proof by Contradiction (pages 1–2)

Assume the negation:

There exists $\delta > 0$ such that: $M(B_n) \ge \delta \quad \forall n, \qquad B_n \in \mathcal{A},\; B_n\downarrow\varnothing.$

Since each $B_n\in F_n$, we can write: $B_n = \{\omega : (x_1,\ldots,x_n)\in D_n\}, \quad D_n \subseteq \mathbb{R}^n.$

Because $M(B_n)=\mu_n(D_n)\ge\delta$, the sets $D_n$ must have substantial measure.

The notes draw (page 2) rectangular regions showing that from each $D_n$ we can extract a compact subset $C_n\subseteq D_n$ such that: $\mu_n(C_n) \ge \frac{\delta}{2^n}.$

Thus: $M(B_n\setminus C_n)\le \frac{\delta}{2^n}.$

8. Diagonal Argument (page 2 diagram)

The sets $C_n$ correspond to subsets in $\mathbb{R}^n$: compact, nested in the sense $C_{n+1} \subseteq C_n \times \mathbb{R},$ when interpreted in $\Omega$.

Using the compactness of each $C_n$ and a classical diagonal subsequence argument illustrated on page 2 (the table with $a_{k,n} = a_{k 2^n}$ and diagonal extraction):

From the sequence in $\prod C_n$, one extracts a subsequence that converges to an element in
$\bigcap_{n=1}^\infty C_n.$

But since: $\bigcap B_n = \varnothing,$ we must have $\bigcap C_n = \varnothing.$

Contradiction.

Therefore the assumption is false.

Thus:

\[B_n\downarrow\varnothing \;\Longrightarrow\; M(B_n)\downarrow 0.\]

9. Conclusion: Existence of the Probability Measure

By Carathéodory:

There exists a unique probability measure $P$ on $\mathcal{F}=\sigma(\mathcal{A})$ such that:

\[P(A) = M(A), \quad A\in\mathcal{A}.\]

This measure has the prescribed finite-dimensional marginals.

This is the Kolmogorov Extension Theorem.

10. Application: Constructing a Random Variable with Lebesgue Distribution

As shown on page 3, consider i.i.d. Bernoulli($1/2$) random variables $X_1,X_2,\ldots$, taking values in ${0,1}$.

Define: $Y = \sum_{k=1}^\infty \frac{X_k}{2^k}.$

This is the binary expansion of a uniform random variable on $[0,1]$.
Thus the distribution of $Y$ is Lebesgue measure on $[0,1]$.

Using Kolmogorov Extension:

The joint law of $(X_1,\ldots,X_n)$ is product measure $(1/2)^n$.
The consistency condition holds automatically (independence).
Therefore a probability measure exists on ${0,1}^{\mathbb{N}}$ (page 3), exactly representing the infinite Bernoulli sequence.

Then $Y(\omega)=\sum X_k(\omega)/2^k$ is measurable, and: $\mathbb{P}(0\le Y<1) = 1.$

Thus Kolmogorov builds a probability space supporting the entire infinite process.

11. Final Note in the Lecture

The last line of page 3 writes:

“Dominated Convergence Theorem.”

This signals the next topic: Using DCT within infinite product constructions and for expectations of functionals of processes.