Misplaced Pages

Min-max theorem

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Variational characterization of eigenvalues of compact Hermitian operators on Hilbert spaces Not to be confused with Minimax theorem. "Variational theorem" redirects here. Not to be confused with variational principle.
This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "Min-max theorem" – news · newspapers · books · scholar · JSTOR (November 2011) (Learn how and when to remove this message)

In linear algebra and functional analysis, the min-max theorem, or variational theorem, or Courant–Fischer–Weyl min-max principle, is a result that gives a variational characterization of eigenvalues of compact Hermitian operators on Hilbert spaces. It can be viewed as the starting point of many results of similar nature.

This article first discusses the finite-dimensional case and its applications before considering compact operators on infinite-dimensional Hilbert spaces. We will see that for compact operators, the proof of the main theorem uses essentially the same idea from the finite-dimensional argument.

In the case that the operator is non-Hermitian, the theorem provides an equivalent characterization of the associated singular values. The min-max theorem can be extended to self-adjoint operators that are bounded below.

Matrices

Let A be a n × n Hermitian matrix. As with many other variational results on eigenvalues, one considers the Rayleigh–Ritz quotient RA : C \ {0} → R defined by

R A ( x ) = ( A x , x ) ( x , x ) {\displaystyle R_{A}(x)={\frac {(Ax,x)}{(x,x)}}}

where (⋅, ⋅) denotes the Euclidean inner product on C. Clearly, the Rayleigh quotient of an eigenvector is its associated eigenvalue. Equivalently, the Rayleigh–Ritz quotient can be replaced by

f ( x ) = ( A x , x ) , x = 1. {\displaystyle f(x)=(Ax,x),\;\|x\|=1.}

For Hermitian matrices A, the range of the continuous function RA(x), or f(x), is a compact interval of the real line. The maximum b and the minimum a are the largest and smallest eigenvalue of A, respectively. The min-max theorem is a refinement of this fact.

Min-max theorem

Let A {\textstyle A} be Hermitian on an inner product space V {\textstyle V} with dimension n {\textstyle n} , with spectrum ordered in descending order λ 1 . . . λ n {\textstyle \lambda _{1}\geq ...\geq \lambda _{n}} .

Let v 1 , . . . , v n {\textstyle v_{1},...,v_{n}} be the corresponding unit-length orthogonal eigenvectors.

Reverse the spectrum ordering, so that ξ 1 = λ n , . . . , ξ n = λ 1 {\textstyle \xi _{1}=\lambda _{n},...,\xi _{n}=\lambda _{1}} .

(Poincaré’s inequality) — Let M {\textstyle M} be a subspace of V {\textstyle V} with dimension k {\textstyle k} , then there exists unit vectors x , y M {\textstyle x,y\in M} , such that

x , A x λ k {\textstyle \langle x,Ax\rangle \leq \lambda _{k}} , and y , A y ξ k {\textstyle \langle y,Ay\rangle \geq \xi _{k}} .

Proof

Part 2 is a corollary, using A {\textstyle -A} .

M {\textstyle M} is a k {\textstyle k} dimensional subspace, so if we pick any list of n k + 1 {\textstyle n-k+1} vectors, their span N := s p a n ( v k , . . . v n ) {\textstyle N:=span(v_{k},...v_{n})} must intersect M {\textstyle M} on at least a single line.

Take unit x M N {\textstyle x\in M\cap N} . That’s what we need.

x = i = k n a i v i {\textstyle x=\sum _{i=k}^{n}a_{i}v_{i}} , since x N {\textstyle x\in N} .
Since i = k n | a i | 2 = 1 {\textstyle \sum _{i=k}^{n}|a_{i}|^{2}=1} , we find x , A x = i = k n | a i | 2 λ i λ k {\textstyle \langle x,Ax\rangle =\sum _{i=k}^{n}|a_{i}|^{2}\lambda _{i}\leq \lambda _{k}} .

min-max theorem —  λ k = max M V dim ( M ) = k min x M x = 1 x , A x = min M V dim ( M ) = n k + 1 max x M x = 1 x , A x {\displaystyle {\begin{aligned}\lambda _{k}&=\max _{\begin{array}{c}{\mathcal {M}}\subset V\\\operatorname {dim} ({\mathcal {M}})=k\end{array}}\min _{\begin{array}{c}x\in {\mathcal {M}}\\\|x\|=1\end{array}}\langle x,Ax\rangle \\&=\min _{\begin{array}{c}{\mathcal {M}}\subset V\\\operatorname {dim} ({\mathcal {M}})=n-k+1\end{array}}\max _{\begin{array}{c}x\in {\mathcal {M}}\\\|x\|=1\end{array}}\langle x,Ax\rangle {\text{. }}\end{aligned}}}

Proof

Part 2 is a corollary of part 1, by using A {\textstyle -A} .

By Poincare’s inequality, λ k {\textstyle \lambda _{k}} is an upper bound to the right side.

By setting M = s p a n ( v 1 , . . . v k ) {\textstyle {\mathcal {M}}=span(v_{1},...v_{k})} , the upper bound is achieved.

Counterexample in the non-Hermitian case

Let N be the nilpotent matrix

[ 0 1 0 0 ] . {\displaystyle {\begin{bmatrix}0&1\\0&0\end{bmatrix}}.}

Define the Rayleigh quotient R N ( x ) {\displaystyle R_{N}(x)} exactly as above in the Hermitian case. Then it is easy to see that the only eigenvalue of N is zero, while the maximum value of the Rayleigh quotient is ⁠1/2⁠. That is, the maximum value of the Rayleigh quotient is larger than the maximum eigenvalue.

Applications

Min-max principle for singular values

The singular values {σk} of a square matrix M are the square roots of the eigenvalues of M*M (equivalently MM*). An immediate consequence of the first equality in the min-max theorem is:

σ k = max S : dim ( S ) = k min x S , x = 1 ( M M x , x ) 1 2 = max S : dim ( S ) = k min x S , x = 1 M x . {\displaystyle \sigma _{k}^{\downarrow }=\max _{S:\dim(S)=k}\min _{x\in S,\|x\|=1}(M^{*}Mx,x)^{\frac {1}{2}}=\max _{S:\dim(S)=k}\min _{x\in S,\|x\|=1}\|Mx\|.}

Similarly,

σ k = min S : dim ( S ) = n k + 1 max x S , x = 1 M x . {\displaystyle \sigma _{k}^{\downarrow }=\min _{S:\dim(S)=n-k+1}\max _{x\in S,\|x\|=1}\|Mx\|.}

Here σ k {\displaystyle \sigma _{k}^{\downarrow }} denotes the k entry in the decreasing sequence of the singular values, so that σ 1 σ 2 {\displaystyle \sigma _{1}^{\downarrow }\geq \sigma _{2}^{\downarrow }\geq \cdots } .

Cauchy interlacing theorem

Main article: Poincaré separation theorem

Let A be a symmetric n × n matrix. The m × m matrix B, where mn, is called a compression of A if there exists an orthogonal projection P onto a subspace of dimension m such that PAP* = B. The Cauchy interlacing theorem states:

Theorem. If the eigenvalues of A are α1 ≤ ... ≤ αn, and those of B are β1 ≤ ... ≤ βj ≤ ... ≤ βm, then for all jm,
α j β j α n m + j . {\displaystyle \alpha _{j}\leq \beta _{j}\leq \alpha _{n-m+j}.}

This can be proven using the min-max principle. Let βi have corresponding eigenvector bi and Sj be the j dimensional subspace Sj = span{b1, ..., bj}, then

β j = max x S j , x = 1 ( B x , x ) = max x S j , x = 1 ( P A P x , x ) min S j max x S j , x = 1 ( A ( P x ) , P x ) = α j . {\displaystyle \beta _{j}=\max _{x\in S_{j},\|x\|=1}(Bx,x)=\max _{x\in S_{j},\|x\|=1}(PAP^{*}x,x)\geq \min _{S_{j}}\max _{x\in S_{j},\|x\|=1}(A(P^{*}x),P^{*}x)=\alpha _{j}.}

According to first part of min-max, αjβj. On the other hand, if we define Smj+1 = span{bj, ..., bm}, then

β j = min x S m j + 1 , x = 1 ( B x , x ) = min x S m j + 1 , x = 1 ( P A P x , x ) = min x S m j + 1 , x = 1 ( A ( P x ) , P x ) α n m + j , {\displaystyle \beta _{j}=\min _{x\in S_{m-j+1},\|x\|=1}(Bx,x)=\min _{x\in S_{m-j+1},\|x\|=1}(PAP^{*}x,x)=\min _{x\in S_{m-j+1},\|x\|=1}(A(P^{*}x),P^{*}x)\leq \alpha _{n-m+j},}

where the last inequality is given by the second part of min-max.

When nm = 1, we have αjβjαj+1, hence the name interlacing theorem.

Compact operators

Let A be a compact, Hermitian operator on a Hilbert space H. Recall that the spectrum of such an operator (the set of eigenvalues) is a set of real numbers whose only possible cluster point is zero. It is thus convenient to list the positive eigenvalues of A as

λ k λ 1 , {\displaystyle \cdots \leq \lambda _{k}\leq \cdots \leq \lambda _{1},}

where entries are repeated with multiplicity, as in the matrix case. (To emphasize that the sequence is decreasing, we may write λ k = λ k {\displaystyle \lambda _{k}=\lambda _{k}^{\downarrow }} .) When H is infinite-dimensional, the above sequence of eigenvalues is necessarily infinite. We now apply the same reasoning as in the matrix case. Letting SkH be a k dimensional subspace, we can obtain the following theorem.

Theorem (Min-Max). Let A be a compact, self-adjoint operator on a Hilbert space H, whose positive eigenvalues are listed in decreasing order ... ≤ λk ≤ ... ≤ λ1. Then:
max S k min x S k , x = 1 ( A x , x ) = λ k , min S k 1 max x S k 1 , x = 1 ( A x , x ) = λ k . {\displaystyle {\begin{aligned}\max _{S_{k}}\min _{x\in S_{k},\|x\|=1}(Ax,x)&=\lambda _{k}^{\downarrow },\\\min _{S_{k-1}}\max _{x\in S_{k-1}^{\perp },\|x\|=1}(Ax,x)&=\lambda _{k}^{\downarrow }.\end{aligned}}}

A similar pair of equalities hold for negative eigenvalues.

Proof

Let S' be the closure of the linear span S = span { u k , u k + 1 , } {\displaystyle S'=\operatorname {span} \{u_{k},u_{k+1},\ldots \}} . The subspace S' has codimension k − 1. By the same dimension count argument as in the matrix case, S' Sk has positive dimension. So there exists xS' Sk with x = 1 {\displaystyle \|x\|=1} . Since it is an element of S' , such an x necessarily satisfy

( A x , x ) λ k . {\displaystyle (Ax,x)\leq \lambda _{k}.}

Therefore, for all Sk

inf x S k , x = 1 ( A x , x ) λ k {\displaystyle \inf _{x\in S_{k},\|x\|=1}(Ax,x)\leq \lambda _{k}}

But A is compact, therefore the function f(x) = (Ax, x) is weakly continuous. Furthermore, any bounded set in H is weakly compact. This lets us replace the infimum by minimum:

min x S k , x = 1 ( A x , x ) λ k . {\displaystyle \min _{x\in S_{k},\|x\|=1}(Ax,x)\leq \lambda _{k}.}

So

sup S k min x S k , x = 1 ( A x , x ) λ k . {\displaystyle \sup _{S_{k}}\min _{x\in S_{k},\|x\|=1}(Ax,x)\leq \lambda _{k}.}

Because equality is achieved when S k = span { u 1 , , u k } {\displaystyle S_{k}=\operatorname {span} \{u_{1},\ldots ,u_{k}\}} ,

max S k min x S k , x = 1 ( A x , x ) = λ k . {\displaystyle \max _{S_{k}}\min _{x\in S_{k},\|x\|=1}(Ax,x)=\lambda _{k}.}

This is the first part of min-max theorem for compact self-adjoint operators.

Analogously, consider now a (k − 1)-dimensional subspace Sk−1, whose the orthogonal complement is denoted by Sk−1. If S' = span{u1...uk},

S S k 1 0 . {\displaystyle S'\cap S_{k-1}^{\perp }\neq {0}.}

So

x S k 1 x = 1 , ( A x , x ) λ k . {\displaystyle \exists x\in S_{k-1}^{\perp }\,\|x\|=1,(Ax,x)\geq \lambda _{k}.}

This implies

max x S k 1 , x = 1 ( A x , x ) λ k {\displaystyle \max _{x\in S_{k-1}^{\perp },\|x\|=1}(Ax,x)\geq \lambda _{k}}

where the compactness of A was applied. Index the above by the collection of k-1-dimensional subspaces gives

inf S k 1 max x S k 1 , x = 1 ( A x , x ) λ k . {\displaystyle \inf _{S_{k-1}}\max _{x\in S_{k-1}^{\perp },\|x\|=1}(Ax,x)\geq \lambda _{k}.}

Pick Sk−1 = span{u1, ..., uk−1} and we deduce

min S k 1 max x S k 1 , x = 1 ( A x , x ) = λ k . {\displaystyle \min _{S_{k-1}}\max _{x\in S_{k-1}^{\perp },\|x\|=1}(Ax,x)=\lambda _{k}.}

Self-adjoint operators

The min-max theorem also applies to (possibly unbounded) self-adjoint operators. Recall the essential spectrum is the spectrum without isolated eigenvalues of finite multiplicity. Sometimes we have some eigenvalues below the essential spectrum, and we would like to approximate the eigenvalues and eigenfunctions.

Theorem (Min-Max). Let A be self-adjoint, and let E 1 E 2 E 3 {\displaystyle E_{1}\leq E_{2}\leq E_{3}\leq \cdots } be the eigenvalues of A below the essential spectrum. Then

E n = min ψ 1 , , ψ n max { ψ , A ψ : ψ span ( ψ 1 , , ψ n ) , ψ = 1 } {\displaystyle E_{n}=\min _{\psi _{1},\ldots ,\psi _{n}}\max\{\langle \psi ,A\psi \rangle :\psi \in \operatorname {span} (\psi _{1},\ldots ,\psi _{n}),\,\|\psi \|=1\}} .

If we only have N eigenvalues and hence run out of eigenvalues, then we let E n := inf σ e s s ( A ) {\displaystyle E_{n}:=\inf \sigma _{ess}(A)} (the bottom of the essential spectrum) for n>N, and the above statement holds after replacing min-max with inf-sup.

Theorem (Max-Min). Let A be self-adjoint, and let E 1 E 2 E 3 {\displaystyle E_{1}\leq E_{2}\leq E_{3}\leq \cdots } be the eigenvalues of A below the essential spectrum. Then

E n = max ψ 1 , , ψ n 1 min { ψ , A ψ : ψ ψ 1 , , ψ n 1 , ψ = 1 } {\displaystyle E_{n}=\max _{\psi _{1},\ldots ,\psi _{n-1}}\min\{\langle \psi ,A\psi \rangle :\psi \perp \psi _{1},\ldots ,\psi _{n-1},\,\|\psi \|=1\}} .

If we only have N eigenvalues and hence run out of eigenvalues, then we let E n := inf σ e s s ( A ) {\displaystyle E_{n}:=\inf \sigma _{ess}(A)} (the bottom of the essential spectrum) for n > N, and the above statement holds after replacing max-min with sup-inf.

The proofs use the following results about self-adjoint operators:

Theorem. Let A be self-adjoint. Then ( A E ) 0 {\displaystyle (A-E)\geq 0} for E R {\displaystyle E\in \mathbb {R} } if and only if σ ( A ) [ E , ) {\displaystyle \sigma (A)\subseteq [E,\infty )} .
Theorem. If A is self-adjoint, then

inf σ ( A ) = inf ψ D ( A ) , ψ = 1 ψ , A ψ {\displaystyle \inf \sigma (A)=\inf _{\psi \in {\mathfrak {D}}(A),\|\psi \|=1}\langle \psi ,A\psi \rangle }

and

sup σ ( A ) = sup ψ D ( A ) , ψ = 1 ψ , A ψ {\displaystyle \sup \sigma (A)=\sup _{\psi \in {\mathfrak {D}}(A),\|\psi \|=1}\langle \psi ,A\psi \rangle } .

See also

References

  1. ^ G. Teschl, Mathematical Methods in Quantum Mechanics (GSM 99) https://www.mat.univie.ac.at/~gerald/ftp/book-schroe/schroe.pdf
  2. ^ Lieb; Loss (2001). Analysis. GSM. Vol. 14 (2nd ed.). Providence: American Mathematical Society. ISBN 0-8218-2783-9.

External links and citations to related work

Functional analysis (topicsglossary)
Spaces
Properties
Theorems
Operators
Algebras
Open problems
Applications
Advanced topics
Analysis in topological vector spaces
Basic concepts
Derivatives
Measurability
Integrals
Results
Related
Functional calculus
Applications
Spectral theory and -algebras
Basic concepts
Main results
Special Elements/Operators
Spectrum
Decomposition
Spectral Theorem
Special algebras
Finite-Dimensional
Generalizations
Miscellaneous
Examples
Applications
Categories: