STT 996 — High Dimensional Probability
Lecture 08 — Grothendieck Identity and Max-Cut
(Source: handwritten notes :contentReference[oaicite:0]{index=0})
1. Grothendieck Identity
Let
\(g \sim \mathcal{N}(0, I_n) \in \mathbb{R}^n\)
So: \(g = (g_1, \dots, g_n), \quad g_i \overset{iid}{\sim} \mathcal{N}(0,1)\)
Let: \(u, v \in S^{n-1}, \quad \|u\| = \|v\| = 1\)
Statement (Grothendieck Identity)
\[\mathbb{E}\left[\operatorname{sign}(\langle g, u \rangle)\operatorname{sign}(\langle g, v \rangle)\right] = \frac{2}{\pi} \arcsin(\langle u, v \rangle)\]Key Idea (Missing from lecture)
This is a purely geometric statement:
- $g$ defines a random hyperplane
- $\operatorname{sign}(\langle g, x \rangle)$ tells which side of the hyperplane $x$ lies on
- So we are asking:
What is the probability that $u$ and $v$ lie on the same side of a random hyperplane?
2. Proof Strategy
Step 1: Rotational Invariance
Let $O$ be orthogonal: \(O^T O = I\)
Gaussian is invariant: \(Og \sim \mathcal{N}(0, I)\)
So we can rotate coordinates such that:
\(Ou = (1,0,\dots,0)\) \(Ov = (v_1, v_2, 0,\dots,0)\)
with: $$ v_1^2 + v_2^2 = 1
Step 2: Reduce to 2D
Then: \(\langle g, u \rangle = g_1\)
\[\langle g, v \rangle = v_1 g_1 + v_2 g_2\]So the problem reduces to: \((g_1, g_2) \sim \mathcal{N}(0, I_2)\)
Step 3: Define Random Variable
Let: \(X = \operatorname{sign}(\langle g, u \rangle)\operatorname{sign}(\langle g, v \rangle)\)
So:
- $X = +1$ if same sign
- $X = -1$ if opposite sign
Let: \(p = \mathbb{P}(X = +1), \quad q = \mathbb{P}(X = -1)\)
Then: \(\mathbb{E}[X] = p - q = 1 - 2q\)
Step 4: Geometry (Missing intuition)
Let $\alpha$ = angle between $u$ and $v$: \(\langle u, v \rangle = \cos \alpha\)
Since: \((g_1, g_2) \sim \text{rotationally symmetric}\)
We can write: \(\frac{g}{\|g\|} \sim \text{Uniform}(S^1)\)
So the direction is uniform on the circle.
Step 5: Compute Probability
Event:
- signs differ ⇔ hyperplane separates $u$ and $v$
This happens when angle falls in region of size $\alpha$.
Thus: \(q = \frac{\alpha}{\pi}\)
Step 6: Final Result
\[\mathbb{E}[X] = 1 - 2q = 1 - \frac{2\alpha}{\pi}\]Using: \(\alpha = \arccos(\langle u,v \rangle)\)
We get: \(\mathbb{E}[X] = \frac{2}{\pi}\arcsin(\langle u,v \rangle)\)
(using identity $\arcsin(x) = \frac{\pi}{2} - \arccos(x)$)
3. Connection to Optimization
We consider:
Quadratic Integer Program
\[\max \sum_{i,j} A_{ij} x_i x_j\]subject to: \(x_i \in \{-1,1\}\)
SDP Relaxation
Replace scalars with vectors: \(x_i \rightarrow u_i \in S^{n-1}\)
Problem becomes: \(\max \sum_{i,j} A_{ij} \langle u_i, u_j \rangle\)
Grothendieck Inequality
If: \(\left|\sum_{i,j} A_{ij} x_i x_j\right| \le 1 \quad \forall x_i \in \{-1,1\}\)
Then: \(\left|\sum_{i,j} A_{ij} \langle u_i, v_j \rangle\right| \le K\)
for some universal constant: \(K \le 1.783\)
4. Max-Cut Problem
Graph: \(G = (V,E)\)
Adjacency matrix: \(A_{ij} = \begin{cases} 1 & \text{if edge exists} \\ 0 & \text{otherwise} \end{cases}\)
Cut Representation
Let: \(x_i \in \{-1,1\}\)
Then: \(\text{cut}(G) = \frac{1}{2}\sum_{i<j} A_{ij}(1 - x_i x_j)\)
SDP Relaxation
Replace: \(x_i x_j \rightarrow \langle u_i, u_j \rangle\)
5. Random Hyperplane Rounding
Sample: \(g \sim \mathcal{N}(0,I)\)
Define: \(x_i = \operatorname{sign}(\langle u_i, g \rangle)\)
Key Theorem (Goemans–Williamson)
\[\mathbb{E}[\text{cut}] \ge 0.878 \cdot \text{OPT}\]Proof Sketch
Using Grothendieck identity: \(\mathbb{E}[x_i x_j] = \frac{2}{\pi}\arcsin(\langle u_i, u_j \rangle)\)
Thus: \(\mathbb{E}[\text{cut}] = \frac{1}{2}\sum A_{ij}\left(1 - \frac{2}{\pi}\arcsin(\langle u_i,u_j\rangle)\right)\)
Compare with SDP objective: \(\frac{1}{2}\sum A_{ij}(1 - \langle u_i,u_j\rangle)\)
→ gives approximation ratio $0.878$
6. Functional / Tensor Notes
If: \(X \perp Y\)
then: \(f_{X,Y}(x,y) = f_X(x) f_Y(y)\)
Tensor notation: \(f_X \otimes f_Y = f_{X,Y}\)
Tensor Example
Vectors: \(a \in \mathbb{R}^m, \quad b \in \mathbb{R}^n\)
Outer product: \(a \otimes b = \begin{bmatrix} a_i b_j \end{bmatrix}\)
Inner product: \(\langle A, B \rangle = \sum_{i,j} A_{ij} B_{ij}\)
Higher Order Tensor
\[A \in \mathbb{R}^{n \times n \times k}, \quad B \in \mathbb{R}^{n \times n \times k}\] \[\langle A, B \rangle = \sum_{i,j,\ell} A_{ij\ell} B_{ij\ell}\]Key Takeaways (What professor skipped)
- Rotation trick = Gaussian invariance
- Everything reduces to 2D geometry
- Random hyperplane = sign(g·x)
- Angle controls probability
- This identity powers SDP rounding
- Grothendieck bridges discrete ↔ continuous optimization
Mental Model (Very Important)
Think:
- vectors $u_i$ live on a sphere
- we slice space with random hyperplane
- signs = partition of vertices
- expected agreement = arcsin(inner product)
This is the entire engine behind modern approximation algorithms.
Comments