Misplaced Pages

Banach–Alaoglu theorem

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Theorem in functional analysis

In functional analysis and related branches of mathematics, the Banach–Alaoglu theorem (also known as Alaoglu's theorem) states that the closed unit ball of the dual space of a normed vector space is compact in the weak* topology. A common proof identifies the unit ball with the weak-* topology as a closed subset of a product of compact sets with the product topology. As a consequence of Tychonoff's theorem, this product, and hence the unit ball within, is compact.

This theorem has applications in physics when one describes the set of states of an algebra of observables, namely that any state can be written as a convex linear combination of so-called pure states.

History

According to Lawrence Narici and Edward Beckenstein, the Alaoglu theorem is a “very important result—maybe the most important fact about the weak-* topology— echos throughout functional analysis.” In 1912, Helly proved that the unit ball of the continuous dual space of C ( [ a , b ] ) {\displaystyle C()} is countably weak-* compact. In 1932, Stefan Banach proved that the closed unit ball in the continuous dual space of any separable normed space is sequentially weak-* compact (Banach only considered sequential compactness). The proof for the general case was published in 1940 by the mathematician Leonidas Alaoglu. According to Pietsch , there are at least twelve mathematicians who can lay claim to this theorem or an important predecessor to it.

The Bourbaki–Alaoglu theorem is a generalization of the original theorem by Bourbaki to dual topologies on locally convex spaces. This theorem is also called the Banach–Alaoglu theorem or the weak-* compactness theorem and it is commonly called simply the Alaoglu theorem.

Statement

See also: Topological vector space § Dual space, Dual system, and Polar set

If X {\displaystyle X} is a vector space over the field K {\displaystyle \mathbb {K} } then X # {\displaystyle X^{\#}} will denote the algebraic dual space of X {\displaystyle X} and these two spaces are henceforth associated with the bilinear evaluation map , : X × X # K {\displaystyle \left\langle \cdot ,\cdot \right\rangle :X\times X^{\#}\to \mathbb {K} } defined by x , f   = def   f ( x ) {\displaystyle \left\langle x,f\right\rangle ~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~f(x)} where the triple X , X # , , {\displaystyle \left\langle X,X^{\#},\left\langle \cdot ,\cdot \right\rangle \right\rangle } forms a dual system called the canonical dual system.

If X {\displaystyle X} is a topological vector space (TVS) then its continuous dual space will be denoted by X , {\displaystyle X^{\prime },} where X X # {\displaystyle X^{\prime }\subseteq X^{\#}} always holds. Denote the weak-* topology on X # {\displaystyle X^{\#}} by σ ( X # , X ) {\displaystyle \sigma \left(X^{\#},X\right)} and denote the weak-* topology on X {\displaystyle X^{\prime }} by σ ( X , X ) . {\displaystyle \sigma \left(X^{\prime },X\right).} The weak-* topology is also called the topology of pointwise convergence because given a map f {\displaystyle f} and a net of maps f = ( f i ) i I , {\displaystyle f_{\bullet }=\left(f_{i}\right)_{i\in I},} the net f {\displaystyle f_{\bullet }} converges to f {\displaystyle f} in this topology if and only if for every point x {\displaystyle x} in the domain, the net of values ( f i ( x ) ) i I {\displaystyle \left(f_{i}(x)\right)_{i\in I}} converges to the value f ( x ) . {\displaystyle f(x).}

Alaoglu theorem — For any topological vector space (TVS) X {\displaystyle X} (not necessarily Hausdorff or locally convex) with continuous dual space X , {\displaystyle X^{\prime },} the polar U = { f X   :   sup u U | f ( u ) | 1 } {\displaystyle U^{\circ }=\left\{f\in X^{\prime }~:~\sup _{u\in U}|f(u)|\leq 1\right\}} of any neighborhood U {\displaystyle U} of origin in X {\displaystyle X} is compact in the weak-* topology σ ( X , X ) {\displaystyle \sigma \left(X^{\prime },X\right)} on X . {\displaystyle X^{\prime }.} Moreover, U {\displaystyle U^{\circ }} is equal to the polar of U {\displaystyle U} with respect to the canonical system X , X # {\displaystyle \left\langle X,X^{\#}\right\rangle } and it is also a compact subset of ( X # , σ ( X # , X ) ) . {\displaystyle \left(X^{\#},\sigma \left(X^{\#},X\right)\right).}

Proof involving duality theory

Proof

Denote by the underlying field of X {\displaystyle X} by K , {\displaystyle \mathbb {K} ,} which is either the real numbers R {\displaystyle \mathbb {R} } or complex numbers C . {\displaystyle \mathbb {C} .} This proof will use some of the basic properties that are listed in the articles: polar set, dual system, and continuous linear operator.

To start the proof, some definitions and readily verified results are recalled. When X # {\displaystyle X^{\#}} is endowed with the weak-* topology σ ( X # , X ) , {\displaystyle \sigma \left(X^{\#},X\right),} then this Hausdorff locally convex topological vector space is denoted by ( X # , σ ( X # , X ) ) . {\displaystyle \left(X^{\#},\sigma \left(X^{\#},X\right)\right).} The space ( X # , σ ( X # , X ) ) {\displaystyle \left(X^{\#},\sigma \left(X^{\#},X\right)\right)} is always a complete TVS; however, ( X , σ ( X , X ) ) {\displaystyle \left(X^{\prime },\sigma \left(X^{\prime },X\right)\right)} may fail to be a complete space, which is the reason why this proof involves the space ( X # , σ ( X # , X ) ) . {\displaystyle \left(X^{\#},\sigma \left(X^{\#},X\right)\right).} Specifically, this proof will use the fact that a subset of a complete Hausdorff space is compact if (and only if) it is closed and totally bounded. Importantly, the subspace topology that X {\displaystyle X^{\prime }} inherits from ( X # , σ ( X # , X ) ) {\displaystyle \left(X^{\#},\sigma \left(X^{\#},X\right)\right)} is equal to σ ( X , X ) . {\displaystyle \sigma \left(X^{\prime },X\right).} This can be readily verified by showing that given any f X , {\displaystyle f\in X^{\prime },} a net in X {\displaystyle X^{\prime }} converges to f {\displaystyle f} in one of these topologies if and only if it also converges to f {\displaystyle f} in the other topology (the conclusion follows because two topologies are equal if and only if they have the exact same convergent nets).

The triple X , X {\displaystyle \left\langle X,X^{\prime }\right\rangle } is a dual pairing although unlike X , X # , {\displaystyle \left\langle X,X^{\#}\right\rangle ,} it is in general not guaranteed to be a dual system. Throughout, unless stated otherwise, all polar sets will be taken with respect to the canonical pairing X , X . {\displaystyle \left\langle X,X^{\prime }\right\rangle .}

Let U {\displaystyle U} be a neighborhood of the origin in X {\displaystyle X} and let:

  • U = { f X   :   sup u U | f ( u ) | 1 } {\displaystyle U^{\circ }=\left\{f\in X^{\prime }~:~\sup _{u\in U}|f(u)|\leq 1\right\}} be the polar of U {\displaystyle U} with respect to the canonical pairing X , X {\displaystyle \left\langle X,X^{\prime }\right\rangle } ;
  • U = { x X   :   sup f U | f ( x ) | 1 } {\displaystyle U^{\circ \circ }=\left\{x\in X~:~\sup _{f\in U^{\circ }}|f(x)|\leq 1\right\}} be the bipolar of U {\displaystyle U} with respect to X , X {\displaystyle \left\langle X,X^{\prime }\right\rangle } ;
  • U # = { f X #   :   sup u U | f ( u ) | 1 } {\displaystyle U^{\#}=\left\{f\in X^{\#}~:~\sup _{u\in U}|f(u)|\leq 1\right\}} be the polar of U {\displaystyle U} with respect to the canonical dual system X , X # . {\displaystyle \left\langle X,X^{\#}\right\rangle .} Note that U = U # X . {\displaystyle U^{\circ }=U^{\#}\cap X^{\prime }.}

A well known fact about polar sets is that U U . {\displaystyle U^{\circ \circ \circ }\subseteq U^{\circ }.}

  1. Show that U # {\displaystyle U^{\#}} is a σ ( X # , X ) {\displaystyle \sigma \left(X^{\#},X\right)} -closed subset of X # : {\displaystyle X^{\#}:} Let f X # {\displaystyle f\in X^{\#}} and suppose that f = ( f i ) i I {\displaystyle f_{\bullet }=\left(f_{i}\right)_{i\in I}} is a net in U # {\displaystyle U^{\#}} that converges to f {\displaystyle f} in ( X # , σ ( X # , X ) ) . {\displaystyle \left(X^{\#},\sigma \left(X^{\#},X\right)\right).} To conclude that f U # , {\displaystyle f\in U^{\#},} it is sufficient (and necessary) to show that | f ( u ) | 1 {\displaystyle |f(u)|\leq 1} for every u U . {\displaystyle u\in U.} Because f i ( u ) f ( u ) {\displaystyle f_{i}(u)\to f(u)} in the scalar field K {\displaystyle \mathbb {K} } and every value f i ( u ) {\displaystyle f_{i}(u)} belongs to the closed (in K {\displaystyle \mathbb {K} } ) subset { s K : | s | 1 } , {\displaystyle \left\{s\in \mathbb {K} :|s|\leq 1\right\},} so too must this net's limit f ( u ) {\displaystyle f(u)} belong to this set. Thus | f ( u ) | 1. {\displaystyle |f(u)|\leq 1.}
  2. Show that U # = U {\displaystyle U^{\#}=U^{\circ }} and then conclude that U {\displaystyle U^{\circ }} is a closed subset of both ( X # , σ ( X # , X ) ) {\displaystyle \left(X^{\#},\sigma \left(X^{\#},X\right)\right)} and ( X , σ ( X , X ) ) : {\displaystyle \left(X^{\prime },\sigma \left(X^{\prime },X\right)\right):} The inclusion U U # {\displaystyle U^{\circ }\subseteq U^{\#}} holds because every continuous linear functional is (in particular) a linear functional. For the reverse inclusion U # U , {\displaystyle \,U^{\#}\subseteq U^{\circ },\,} let f U # {\displaystyle f\in U^{\#}} so that sup u U | f ( u ) | 1 , {\displaystyle \;\sup _{u\in U}|f(u)|\leq 1,\,} which states exactly that the linear functional f {\displaystyle f} is bounded on the neighborhood U {\displaystyle U} ; thus f {\displaystyle f} is a continuous linear functional (that is, f X {\displaystyle f\in X^{\prime }} ) and so f U , {\displaystyle f\in U^{\circ },} as desired. Using (1) and the fact that the intersection U # X = U X = U {\displaystyle U^{\#}\cap X^{\prime }=U^{\circ }\cap X^{\prime }=U^{\circ }} is closed in the subspace topology on X , {\displaystyle X^{\prime },} the claim about U {\displaystyle U^{\circ }} being closed follows.
  3. Show that U {\displaystyle U^{\circ }} is a σ ( X , X ) {\displaystyle \sigma \left(X^{\prime },X\right)} -totally bounded subset of X : {\displaystyle X^{\prime }:} By the bipolar theorem, U U {\displaystyle U\subseteq U^{\circ \circ }} where because the neighborhood U {\displaystyle U} is an absorbing subset of X , {\displaystyle X,} the same must be true of the set U ; {\displaystyle U^{\circ \circ };} it is possible to prove that this implies that U {\displaystyle U^{\circ }} is a σ ( X , X ) {\displaystyle \sigma \left(X^{\prime },X\right)} -bounded subset of X . {\displaystyle X^{\prime }.} Because X {\displaystyle X} distinguishes points of X , {\displaystyle X^{\prime },} a subset of X {\displaystyle X^{\prime }} is σ ( X , X ) {\displaystyle \sigma \left(X^{\prime },X\right)} -bounded if and only if it is σ ( X , X ) {\displaystyle \sigma \left(X^{\prime },X\right)} -totally bounded. So in particular, U {\displaystyle U^{\circ }} is also σ ( X , X ) {\displaystyle \sigma \left(X^{\prime },X\right)} -totally bounded.
  4. Conclude that U {\displaystyle U^{\circ }} is also a σ ( X # , X ) {\displaystyle \sigma \left(X^{\#},X\right)} -totally bounded subset of X # : {\displaystyle X^{\#}:} Recall that the σ ( X , X ) {\displaystyle \sigma \left(X^{\prime },X\right)} topology on X {\displaystyle X^{\prime }} is identical to the subspace topology that X {\displaystyle X^{\prime }} inherits from ( X # , σ ( X # , X ) ) . {\displaystyle \left(X^{\#},\sigma \left(X^{\#},X\right)\right).} This fact, together with (3) and the definition of "totally bounded", implies that U {\displaystyle U^{\circ }} is a σ ( X # , X ) {\displaystyle \sigma \left(X^{\#},X\right)} -totally bounded subset of X # . {\displaystyle X^{\#}.}
  5. Finally, deduce that U {\displaystyle U^{\circ }} is a σ ( X , X ) {\displaystyle \sigma \left(X^{\prime },X\right)} -compact subset of X : {\displaystyle X^{\prime }:} Because ( X # , σ ( X # , X ) ) {\displaystyle \left(X^{\#},\sigma \left(X^{\#},X\right)\right)} is a complete TVS and U {\displaystyle U^{\circ }} is a closed (by (2)) and totally bounded (by (4)) subset of ( X # , σ ( X # , X ) ) , {\displaystyle \left(X^{\#},\sigma \left(X^{\#},X\right)\right),} it follows that U {\displaystyle U^{\circ }} is compact. {\displaystyle \blacksquare }

If X {\displaystyle X} is a normed vector space, then the polar of a neighborhood is closed and norm-bounded in the dual space. In particular, if U {\displaystyle U} is the open (or closed) unit ball in X {\displaystyle X} then the polar of U {\displaystyle U} is the closed unit ball in the continuous dual space X {\displaystyle X^{\prime }} of X {\displaystyle X} (with the usual dual norm). Consequently, this theorem can be specialized to:

Banach–Alaoglu theorem — If X {\displaystyle X} is a normed space then the closed unit ball in the continuous dual space X {\displaystyle X^{\prime }} (endowed with its usual operator norm) is compact with respect to the weak-* topology.

When the continuous dual space X {\displaystyle X^{\prime }} of X {\displaystyle X} is an infinite dimensional normed space then it is impossible for the closed unit ball in X {\displaystyle X^{\prime }} to be a compact subset when X {\displaystyle X^{\prime }} has its usual norm topology. This is because the unit ball in the norm topology is compact if and only if the space is finite-dimensional (cf. F. Riesz theorem). This theorem is one example of the utility of having different topologies on the same vector space.

It should be cautioned that despite appearances, the Banach–Alaoglu theorem does not imply that the weak-* topology is locally compact. This is because the closed unit ball is only a neighborhood of the origin in the strong topology, but is usually not a neighborhood of the origin in the weak-* topology, as it has empty interior in the weak* topology, unless the space is finite-dimensional. In fact, it is a result of Weil that all locally compact Hausdorff topological vector spaces must be finite-dimensional.

Elementary proof

The following elementary proof does not utilize duality theory and requires only basic concepts from set theory, topology, and functional analysis. What is needed from topology is a working knowledge of net convergence in topological spaces and familiarity with the fact that a linear functional is continuous if and only if it is bounded on a neighborhood of the origin (see the articles on continuous linear functionals and sublinear functionals for details). Also required is a proper understanding of the technical details of how the space K X {\displaystyle \mathbb {K} ^{X}} of all functions of the form X K {\displaystyle X\to \mathbb {K} } is identified as the Cartesian product x X K , {\textstyle \prod _{x\in X}\mathbb {K} ,} and the relationship between pointwise convergence, the product topology, and subspace topologies they induce on subsets such as the algebraic dual space X # {\displaystyle X^{\#}} and products of subspaces such as x X B r x . {\textstyle \prod _{x\in X}B_{r_{x}}.} An explanation of these details is now given for readers who are interested.

Primer on product/function spaces, nets, and pointwise convergence

For every real r , {\displaystyle r,} B r   = def   { c K : | c | r } {\displaystyle B_{r}~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\{c\in \mathbb {K} :|c|\leq r\}} will denote the closed ball of radius r {\displaystyle r} centered at 0 {\displaystyle 0} and r U   = def   { r u : u U } {\displaystyle rU~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\{ru:u\in U\}} for any U X , {\displaystyle U\subseteq X,}

Identification of functions with tuples

The Cartesian product x X K {\textstyle \prod _{x\in X}\mathbb {K} } is usually thought of as the set of all X {\displaystyle X} -indexed tuples s = ( s x ) x X {\displaystyle s_{\bullet }=\left(s_{x}\right)_{x\in X}} but, since tuples are technically just functions from an indexing set, it can also be identified with the space K X {\displaystyle \mathbb {K} ^{X}} of all functions having prototype X K , {\displaystyle X\to \mathbb {K} ,} as is now described:

  • Function {\displaystyle \to } Tuple: A function s : X K {\displaystyle s:X\to \mathbb {K} } belonging to K X {\displaystyle \mathbb {K} ^{X}} is identified with its ( X {\displaystyle X} -indexed) "tuple of values" s   = def   ( s ( x ) ) x X . {\displaystyle s_{\bullet }~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~(s(x))_{x\in X}.}
  • Tuple {\displaystyle \to } Function: A tuple s = ( s x ) x X {\displaystyle s_{\bullet }=\left(s_{x}\right)_{x\in X}} in x X K {\textstyle \prod _{x\in X}\mathbb {K} } is identified with the function s : X K {\displaystyle s:X\to \mathbb {K} } defined by s ( x )   = def   s x {\displaystyle s(x)~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~s_{x}} ; this function's "tuple of values" is the original tuple ( s x ) x X . {\displaystyle \left(s_{x}\right)_{x\in X}.}

This is the reason why many authors write, often without comment, the equality K X = x X K {\displaystyle \mathbb {K} ^{X}=\prod _{x\in X}\mathbb {K} } and why the Cartesian product x X K {\textstyle \prod _{x\in X}\mathbb {K} } is sometimes taken as the definition of the set of maps K X {\displaystyle \mathbb {K} ^{X}} (or conversely). However, the Cartesian product, being the (categorical) product in the category of sets (which is a type of inverse limit), also comes equipped with associated maps that are known as its (coordinate) projections.

The canonical projection of the Cartesian product at a given point z X {\displaystyle z\in X} is the function Pr z : x X K K  defined by  s = ( s x ) x X s z {\displaystyle \Pr {}_{z}:\prod _{x\in X}\mathbb {K} \to \mathbb {K} \quad {\text{ defined by }}\quad s_{\bullet }=\left(s_{x}\right)_{x\in X}\mapsto s_{z}} where under the above identification, Pr z {\displaystyle \Pr {}_{z}} sends a function s : X K {\displaystyle s:X\to \mathbb {K} } to Pr z ( s )   = def   s ( z ) . {\displaystyle \Pr {}_{z}(s)~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~s(z).} Stated in words, for a point z {\displaystyle z} and function s , {\displaystyle s,} "plugging z {\displaystyle z} into s {\displaystyle s} " is the same as "plugging s {\displaystyle s} into Pr z {\displaystyle \Pr {}_{z}} ".

In particular, suppose that ( r x ) x X {\displaystyle \left(r_{x}\right)_{x\in X}} are non-negative real numbers. Then x X B r x x X K = K X , {\displaystyle \prod _{x\in X}B_{r_{x}}\subseteq \prod _{x\in X}\mathbb {K} =\mathbb {K} ^{X},} where under the above identification of tuples with functions, x X B r x {\displaystyle \prod _{x\in X}B_{r_{x}}} is the set of all functions s K X {\displaystyle s\in \mathbb {K} ^{X}} such that s ( x ) B r x {\displaystyle s(x)\in B_{r_{x}}} for every x X . {\displaystyle x\in X.}

If a subset U X {\displaystyle U\subseteq X} partitions X {\displaystyle X} into X = U ( X U ) {\displaystyle X=U\,\cup \,(X\setminus U)} then the linear bijection H : x X K ( u U K ) × x X U K ( f x ) x X ( ( f u ) u U , ( f x ) x X U ) {\displaystyle {\begin{alignedat}{4}H:\;&&\prod _{x\in X}\mathbb {K} &&\;\to \;&\left(\prod _{u\in U}\mathbb {K} \right)\times \prod _{x\in X\setminus U}\mathbb {K} \\&&\left(f_{x}\right)_{x\in X}&&\;\mapsto \;&\left(\left(f_{u}\right)_{u\in U},\;\left(f_{x}\right)_{x\in X\setminus U}\right)\\\end{alignedat}}} canonically identifies these two Cartesian products; moreover, this map is a homeomorphism when these products are endowed with their product topologies. In terms of function spaces, this bijection could be expressed as H : K X K U × K X U f ( f | U , f | X U ) . {\displaystyle {\begin{alignedat}{4}H:\;&&\mathbb {K} ^{X}&&\;\to \;&\mathbb {K} ^{U}\times \mathbb {K} ^{X\setminus U}\\&&f&&\;\mapsto \;&\left(f{\big \vert }_{U},\;f{\big \vert }_{X\setminus U}\right)\\\end{alignedat}}.}

Notation for nets and function composition with nets

A net x = ( x i ) i I {\displaystyle x_{\bullet }=\left(x_{i}\right)_{i\in I}} in X {\displaystyle X} is by definition a function x : I X {\displaystyle x_{\bullet }:I\to X} from a non-empty directed set ( I , ) . {\displaystyle (I,\leq ).} Every sequence in X , {\displaystyle X,} which by definition is just a function of the form N X , {\displaystyle \mathbb {N} \to X,} is also a net. As with sequences, the value of a net x {\displaystyle x_{\bullet }} at an index i I {\displaystyle i\in I} is denoted by x i {\displaystyle x_{i}} ; however, for this proof, this value x i {\displaystyle x_{i}} may also be denoted by the usual function parentheses notation x ( i ) . {\displaystyle x_{\bullet }(i).} Similarly for function composition, if F : X Y {\displaystyle F:X\to Y} is any function then the net (or sequence) that results from "plugging x {\displaystyle x_{\bullet }} into F {\displaystyle F} " is just the function F x : I Y , {\displaystyle F\circ x_{\bullet }:I\to Y,} although this is typically denoted by ( F ( x i ) ) i I {\displaystyle \left(F\left(x_{i}\right)\right)_{i\in I}} (or by ( F ( x i ) ) i = 1 {\displaystyle \left(F\left(x_{i}\right)\right)_{i=1}^{\infty }} if x {\displaystyle x_{\bullet }} is a sequence). In the proofs below, this resulting net may be denoted by any of the following notations F ( x ) = ( F ( x i ) ) i I   = def   F x , {\displaystyle F\left(x_{\bullet }\right)=\left(F\left(x_{i}\right)\right)_{i\in I}~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~F\circ x_{\bullet },} depending on whichever notation is cleanest or most clearly communicates the intended information. In particular, if F : X Y {\displaystyle F:X\to Y} is continuous and x x {\displaystyle x_{\bullet }\to x} in X , {\displaystyle X,} then the conclusion commonly written as ( F ( x i ) ) i I F ( x ) {\displaystyle \left(F\left(x_{i}\right)\right)_{i\in I}\to F(x)} may instead be written as F ( x ) F ( x ) {\displaystyle F\left(x_{\bullet }\right)\to F(x)} or F x F ( x ) . {\displaystyle F\circ x_{\bullet }\to F(x).}

Topology

The set K X = x X K {\textstyle \mathbb {K} ^{X}=\prod _{x\in X}\mathbb {K} } is assumed to be endowed with the product topology. It is well known that the product topology is identical to the topology of pointwise convergence. This is because given f {\displaystyle f} and a net ( f i ) i I , {\displaystyle \left(f_{i}\right)_{i\in I},} where f {\displaystyle f} and every f i {\displaystyle f_{i}} is an element of K X = x X K , {\textstyle \mathbb {K} ^{X}=\prod _{x\in X}\mathbb {K} ,} then the net ( f i ) i I f {\displaystyle \left(f_{i}\right)_{i\in I}\to f} converges in the product topology if and only if

for every z X , {\displaystyle z\in X,} the net Pr z ( ( f i ) i I ) Pr z ( f ) {\displaystyle \Pr {}_{z}\left(\left(f_{i}\right)_{i\in I}\right)\to \Pr {}_{z}(f)} converges in K , {\displaystyle \mathbb {K} ,}

where because Pr z ( f ) = f ( z ) {\displaystyle \;\Pr {}_{z}(f)=f(z)\;} and Pr z ( ( f i ) i I )   = def   ( Pr z ( f i ) ) i I = ( f i ( z ) ) i I , {\textstyle \Pr {}_{z}\left(\left(f_{i}\right)_{i\in I}\right)~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\left(\Pr {}_{z}\left(f_{i}\right)\right)_{i\in I}=\left(f_{i}(z)\right)_{i\in I},} this happens if and only if

for every z X , {\displaystyle z\in X,} the net ( f i ( z ) ) i I f ( z ) {\displaystyle \left(f_{i}(z)\right)_{i\in I}\to f(z)} converges in K , {\displaystyle \mathbb {K} ,}

Thus ( f i ) i I {\displaystyle \left(f_{i}\right)_{i\in I}} converges to f {\displaystyle f} in the product topology if and only if it converges to f {\displaystyle f} pointwise on X . {\displaystyle X.}

This proof will also use the fact that the topology of pointwise convergence is preserved when passing to topological subspaces. This means, for example, that if for every x X , {\displaystyle x\in X,} S x K {\displaystyle S_{x}\subseteq \mathbb {K} } is some (topological) subspace of K {\displaystyle \mathbb {K} } then the topology of pointwise convergence (or equivalently, the product topology) on x X S x {\textstyle \prod _{x\in X}S_{x}} is equal to the subspace topology that the set x X S x {\textstyle \prod _{x\in X}S_{x}} inherits from x X K . {\textstyle \prod _{x\in X}\mathbb {K} .} And if S x {\displaystyle S_{x}} is closed in K {\displaystyle \mathbb {K} } for every x X , {\displaystyle x\in X,} then x X S x {\textstyle \prod _{x\in X}S_{x}} is a closed subset of x X K . {\textstyle \prod _{x\in X}\mathbb {K} .}

Characterization of sup u U | f ( u ) | r {\displaystyle \sup _{u\in U}|f(u)|\leq r}

An important fact used by the proof is that for any real r , {\displaystyle r,} sup u U | f ( u ) | r  if and only if  f ( U ) B r {\displaystyle \sup _{u\in U}|f(u)|\leq r\qquad {\text{ if and only if }}\qquad f(U)\subseteq B_{r}} where sup {\displaystyle \,\sup \,} denotes the supremum and f ( U )   = def   { f ( u ) : u U } . {\displaystyle f(U)~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\{f(u):u\in U\}.} As a side note, this characterization does not hold if the closed ball B r {\displaystyle B_{r}} is replaced with the open ball { c K : | c | < r } {\displaystyle \{c\in \mathbb {K} :|c|<r\}} (and replacing sup u U | f ( u ) | r {\displaystyle \;\sup _{u\in U}|f(u)|\leq r\;} with the strict inequality sup u U | f ( u ) | < r {\displaystyle \;\sup _{u\in U}|f(u)|<r\;} will not change this; for counter-examples, consider X   = def   K {\displaystyle X~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\mathbb {K} } and the identity map f   = def   Id {\displaystyle f~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\operatorname {Id} } on X {\displaystyle X} ).

The essence of the Banach–Alaoglu theorem can be found in the next proposition, from which the Banach–Alaoglu theorem follows. Unlike the Banach–Alaoglu theorem, this proposition does not require the vector space X {\displaystyle X} to endowed with any topology.

Proposition — Let U {\displaystyle U} be a subset of a vector space X {\displaystyle X} over the field K {\displaystyle \mathbb {K} } (where K = R  or  K = C {\displaystyle \mathbb {K} =\mathbb {R} {\text{ or }}\mathbb {K} =\mathbb {C} } ) and for every real number r , {\displaystyle r,} endow the closed ball B r   = def   { s K : | s | r } {\textstyle B_{r}~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\{s\in \mathbb {K} :|s|\leq r\}} with its usual topology ( X {\displaystyle X} need not be endowed with any topology, but K {\displaystyle \mathbb {K} } has its usual Euclidean topology). Define U #   = def   { f X #   :   sup u U | f ( u ) | 1 } . {\displaystyle U^{\#}~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~{\Big \{}f\in X^{\#}~:~\sup _{u\in U}|f(u)|\leq 1{\Big \}}.}

If for every x X , {\displaystyle x\in X,} r x > 0 {\displaystyle r_{x}>0} is a real number such that x r x U , {\displaystyle x\in r_{x}U,} then U # {\displaystyle U^{\#}} is a closed and compact subspace of the product space x X B r x {\displaystyle \prod _{x\in X}B_{r_{x}}} (where because this product topology is identical to the topology of pointwise convergence, which is also called the weak-* topology in functional analysis, this means that U # {\displaystyle U^{\#}} is compact in the weak-* topology or "weak-* compact" for short).

Before proving the proposition above, it is first shown how the Banach–Alaoglu theorem follows from it (unlike the proposition, Banach–Alaoglu assumes that X {\displaystyle X} is a topological vector space (TVS) and that U {\displaystyle U} is a neighborhood of the origin).

Proof that Banach–Alaoglu follows from the proposition above

Assume that X {\displaystyle X} is a topological vector space with continuous dual space X {\displaystyle X^{\prime }} and that U {\displaystyle U} is a neighborhood of the origin. Because U {\displaystyle U} is a neighborhood of the origin in X , {\displaystyle X,} it is also an absorbing subset of X , {\displaystyle X,} so for every x X , {\displaystyle x\in X,} there exists a real number r x > 0 {\displaystyle r_{x}>0} such that x r x U . {\displaystyle x\in r_{x}U.} Thus the hypotheses of the above proposition are satisfied, and so the set U # {\displaystyle U^{\#}} is therefore compact in the weak-* topology. The proof of the Banach–Alaoglu theorem will be complete once it is shown that U # = U , {\displaystyle U^{\#}=U^{\circ },} where recall that U {\displaystyle U^{\circ }} was defined as U   = def   { f X   :   sup u U | f ( u ) | 1 }   =   U # X . {\displaystyle U^{\circ }~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~{\Big \{}f\in X^{\prime }~:~\sup _{u\in U}|f(u)|\leq 1{\Big \}}~=~U^{\#}\cap X^{\prime }.}

Proof that U = U # : {\displaystyle U^{\circ }=U^{\#}:} Because U = U # X , {\displaystyle U^{\circ }=U^{\#}\cap X^{\prime },} the conclusion is equivalent to U # X . {\displaystyle U^{\#}\subseteq X^{\prime }.} If f U # {\displaystyle f\in U^{\#}} then sup u U | f ( u ) | 1 , {\displaystyle \;\sup _{u\in U}|f(u)|\leq 1,\,} which states exactly that the linear functional f {\displaystyle f} is bounded on the neighborhood U ; {\displaystyle U;} thus f {\displaystyle f} is a continuous linear functional (that is, f X {\displaystyle f\in X^{\prime }} ), as desired. {\displaystyle \blacksquare }

Proof of Proposition

The product space x X B r x {\textstyle \prod _{x\in X}B_{r_{x}}} is compact by Tychonoff's theorem (since each closed ball B r x   = def   { s K : | s | r x } {\displaystyle B_{r_{x}}~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\{s\in \mathbb {K} :|s|\leq r_{x}\}} is a Hausdorff compact space). Because a closed subset of a compact space is compact, the proof of the proposition will be complete once it is shown that U #   = def   { f X #   :   sup u U | f ( u ) | 1 }   =   { f X #   :   f ( U ) B 1 } {\displaystyle U^{\#}~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~{\Big \{}f\in X^{\#}~:~\sup _{u\in U}|f(u)|\leq 1{\Big \}}~=~\left\{f\in X^{\#}~:~f(U)\subseteq B_{1}\right\}} is a closed subset of x X B r x . {\textstyle \prod _{x\in X}B_{r_{x}}.} The following statements guarantee this conclusion:

  1. U # x X B r x . {\displaystyle U^{\#}\subseteq \prod _{x\in X}B_{r_{x}}.}
  2. U # {\displaystyle U^{\#}} is a closed subset of the product space x X K = K X . {\displaystyle \prod _{x\in X}\mathbb {K} =\mathbb {K} ^{X}.}

Proof of (1):

For any z X , {\displaystyle z\in X,} let Pr z : x X K K {\textstyle \Pr {}_{z}:\prod _{x\in X}\mathbb {K} \to \mathbb {K} } denote the projection to the z {\displaystyle z} coordinate (as defined above). To prove that U # x X B r x , {\textstyle U^{\#}\subseteq \prod _{x\in X}B_{r_{x}},} it is sufficient (and necessary) to show that Pr x ( U # ) B r x {\displaystyle \Pr {}_{x}\left(U^{\#}\right)\subseteq B_{r_{x}}} for every x X . {\displaystyle x\in X.} So fix x X {\displaystyle x\in X} and let f U # . {\displaystyle f\in U^{\#}.} Because Pr x ( f ) = f ( x ) , {\displaystyle \Pr {}_{x}(f)\,=\,f(x),} it remains to show that f ( x ) B r x . {\displaystyle f(x)\in B_{r_{x}}.} Recall that r x > 0 {\displaystyle r_{x}>0} was defined in the proposition's statement as being any positive real number that satisfies x r x U {\displaystyle x\in r_{x}U} (so for example, r u := 1 {\displaystyle r_{u}:=1} would be a valid choice for each u U {\displaystyle u\in U} ), which implies u x   = def   1 r x x U . {\displaystyle \,u_{x}~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~{\frac {1}{r_{x}}}\,x\in U.\,} Because f {\displaystyle f} is a positive homogeneous function that satisfies sup u U | f ( u ) | 1 , {\displaystyle \;\sup _{u\in U}|f(u)|\leq 1,\,} 1 r x | f ( x ) | = | 1 r x f ( x ) | = | f ( 1 r x x ) | = | f ( u x ) | sup u U | f ( u ) | 1. {\displaystyle {\frac {1}{r_{x}}}|f(x)|=\left|{\frac {1}{r_{x}}}f(x)\right|=\left|f\left({\frac {1}{r_{x}}}x\right)\right|=\left|f\left(u_{x}\right)\right|\leq \sup _{u\in U}|f(u)|\leq 1.}

Thus | f ( x ) | r x , {\displaystyle |f(x)|\leq r_{x},} which shows that f ( x ) B r x , {\displaystyle f(x)\in B_{r_{x}},} as desired.

Proof of (2):

The algebraic dual space X # {\displaystyle X^{\#}} is always a closed subset of K X = x X K {\textstyle \mathbb {K} ^{X}=\prod _{x\in X}\mathbb {K} } (this is proved in the lemma below for readers who are not familiar with this result). The set U B 1 = def {                 f   K X         : sup u U | f ( u ) | 1 } = {                 f K X         : f ( u ) B 1  for all  u U } = { ( f x ) x X x X K   :     f u   B 1  for all  u U } = x X C x  where  C x   = def   { B 1  if  x U K  if  x U {\displaystyle {\begin{alignedat}{9}U_{B_{1}}&\,{\stackrel {\scriptscriptstyle {\text{def}}}{=}}\,{\Big \{}~~\;~~\;~~\;~~f\ \in \mathbb {K} ^{X}~~\;~~:\sup _{u\in U}|f(u)|\leq 1{\Big \}}\\&={\big \{}~~\;~~\;~~\;~~f\,\in \mathbb {K} ^{X}~~\;~~:f(u)\in B_{1}{\text{ for all }}u\in U{\big \}}\\&={\Big \{}\left(f_{x}\right)_{x\in X}\in \prod _{x\in X}\mathbb {K} \,~:~\;~f_{u}~\in B_{1}{\text{ for all }}u\in U{\Big \}}\\&=\prod _{x\in X}C_{x}\quad {\text{ where }}\quad C_{x}~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~{\begin{cases}B_{1}&{\text{ if }}x\in U\\\mathbb {K} &{\text{ if }}x\not \in U\\\end{cases}}\\\end{alignedat}}} is closed in the product topology on x X K = K X {\displaystyle \prod _{x\in X}\mathbb {K} =\mathbb {K} ^{X}} since it is a product of closed subsets of K . {\displaystyle \mathbb {K} .} Thus U B 1 X # = U # {\displaystyle U_{B_{1}}\cap X^{\#}=U^{\#}} is an intersection of two closed subsets of K X , {\displaystyle \mathbb {K} ^{X},} which proves (2). {\displaystyle \blacksquare }

The conclusion that the set U B 1 = { f K X : f ( U ) B 1 } {\displaystyle U_{B_{1}}=\left\{f\in \mathbb {K} ^{X}:f(U)\subseteq B_{1}\right\}} is closed can also be reached by applying the following more general result, this time proved using nets, to the special case Y := K {\displaystyle Y:=\mathbb {K} } and B := B 1 . {\displaystyle B:=B_{1}.}

Observation: If U X {\displaystyle U\subseteq X} is any set and if B Y {\displaystyle B\subseteq Y} is a closed subset of a topological space Y , {\displaystyle Y,} then U B   = def   { f Y X : f ( U ) B } {\displaystyle U_{B}~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\left\{f\in Y^{X}:f(U)\subseteq B\right\}} is a closed subset of Y X {\displaystyle Y^{X}} in the topology of pointwise convergence.
Proof of observation: Let f Y X {\displaystyle f\in Y^{X}} and suppose that ( f i ) i I {\displaystyle \left(f_{i}\right)_{i\in I}} is a net in U B {\displaystyle U_{B}} that converges pointwise to f . {\displaystyle f.} It remains to show that f U B , {\displaystyle f\in U_{B},} which by definition means f ( U ) B . {\displaystyle f(U)\subseteq B.} For any u U , {\displaystyle u\in U,} because ( f i ( u ) ) i I f ( u ) {\displaystyle \left(f_{i}(u)\right)_{i\in I}\to f(u)} in Y {\displaystyle Y} and every value f i ( u ) f i ( U ) B {\displaystyle f_{i}(u)\in f_{i}(U)\subseteq B} belongs to the closed (in Y {\displaystyle Y} ) subset B , {\displaystyle B,} so too must this net's limit belong to this closed set; thus f ( u ) B , {\displaystyle f(u)\in B,} which completes the proof. {\displaystyle \blacksquare }

Lemma ( X # {\displaystyle X^{\#}} is closed in K X {\displaystyle \mathbb {K} ^{X}} ) — The algebraic dual space X # {\displaystyle X^{\#}} of any vector space X {\displaystyle X} over a field K {\displaystyle \mathbb {K} } (where K {\displaystyle \mathbb {K} } is R {\displaystyle \mathbb {R} } or C {\displaystyle \mathbb {C} } ) is a closed subset of K X = x X K {\textstyle \mathbb {K} ^{X}=\prod _{x\in X}\mathbb {K} } in the topology of pointwise convergence. (The vector space X {\displaystyle X} need not be endowed with any topology).

Proof of lemma

Let f K X {\displaystyle f\in \mathbb {K} ^{X}} and suppose that f = ( f i ) i I {\displaystyle f_{\bullet }=\left(f_{i}\right)_{i\in I}} is a net in X # {\displaystyle X^{\#}} the converges to f {\displaystyle f} in K X . {\displaystyle \mathbb {K} ^{X}.} To conclude that f X # , {\displaystyle f\in X^{\#},} it must be shown that f {\displaystyle f} is a linear functional. So let s {\displaystyle s} be a scalar and let x , y X . {\displaystyle x,y\in X.}

For any z X , {\displaystyle z\in X,} let f ( z ) : I K {\displaystyle f_{\bullet }(z):I\to \mathbb {K} } denote f {\displaystyle f_{\bullet }} 's net of values at z {\displaystyle z} f ( z )   = def   ( f i ( z ) ) i I . {\displaystyle f_{\bullet }(z)~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\left(f_{i}(z)\right)_{i\in I}.} Because f f {\displaystyle f_{\bullet }\to f} in K X , {\displaystyle \mathbb {K} ^{X},} which has the topology of pointwise convergence, f ( z ) f ( z ) {\displaystyle f_{\bullet }(z)\to f(z)} in K {\displaystyle \mathbb {K} } for every z X . {\displaystyle z\in X.} By using x , y , s x ,  and  x + y , {\displaystyle x,y,sx,{\text{ and }}x+y,} in place of z , {\displaystyle z,} it follows that each of the following nets of scalars converges in K : {\displaystyle \mathbb {K} :} f ( x ) f ( x ) , f ( y ) f ( y ) , f ( x + y ) f ( x + y ) ,  and  f ( s x ) f ( s x ) . {\displaystyle f_{\bullet }(x)\to f(x),\quad f_{\bullet }(y)\to f(y),\quad f_{\bullet }(x+y)\to f(x+y),\quad {\text{ and }}\quad f_{\bullet }(sx)\to f(sx).}


Proof that f ( s x ) = s f ( x ) : {\displaystyle f(sx)=sf(x):} Let M : K K {\displaystyle M:\mathbb {K} \to \mathbb {K} } be the "multiplication by s {\displaystyle s} " map defined by M ( c )   = def   s c . {\displaystyle M(c)~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~sc.} Because M {\displaystyle M} is continuous and f ( x ) f ( x ) {\displaystyle f_{\bullet }(x)\to f(x)} in K , {\displaystyle \mathbb {K} ,} it follows that M ( f ( x ) ) M ( f ( x ) ) {\displaystyle M\left(f_{\bullet }(x)\right)\to M(f(x))} where the right hand side is M ( f ( x ) ) = s f ( x ) {\displaystyle M(f(x))=sf(x)} and the left hand side is M ( f ( x ) ) = def   M f ( x )  by definition of notation  =   ( M ( f i ( x ) ) ) i I        because  f ( x ) = ( f i ( x ) ) i I : I K =   ( s f i ( x ) ) i I M ( f i ( x ) )   = def   s f i ( x ) =   ( f i ( s x ) ) i I  by linearity of  f i =   f ( s x )  notation  {\displaystyle {\begin{alignedat}{4}M\left(f_{\bullet }(x)\right){\stackrel {\scriptscriptstyle {\text{def}}}{=}}&~M\circ f_{\bullet }(x)&&{\text{ by definition of notation }}\\=&~\left(M\left(f_{i}(x)\right)\right)_{i\in I}~~~&&{\text{ because }}f_{\bullet }(x)=\left(f_{i}(x)\right)_{i\in I}:I\to \mathbb {K} \\=&~\left(sf_{i}(x)\right)_{i\in I}&&M\left(f_{i}(x)\right)~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~sf_{i}(x)\\=&~\left(f_{i}(sx)\right)_{i\in I}&&{\text{ by linearity of }}f_{i}\\=&~f_{\bullet }(sx)&&{\text{ notation }}\end{alignedat}}} which proves that f ( s x ) s f ( x ) . {\displaystyle f_{\bullet }(sx)\to sf(x).} Because also f ( s x ) f ( s x ) {\displaystyle f_{\bullet }(sx)\to f(sx)} and limits in K {\displaystyle \mathbb {K} } are unique, it follows that s f ( x ) = f ( s x ) , {\displaystyle sf(x)=f(sx),} as desired.


Proof that f ( x + y ) = f ( x ) + f ( y ) : {\displaystyle f(x+y)=f(x)+f(y):} Define a net z = ( z i ) i I : I K × K {\displaystyle z_{\bullet }=\left(z_{i}\right)_{i\in I}:I\to \mathbb {K} \times \mathbb {K} } by letting z i   = def   ( f i ( x ) , f i ( y ) ) {\displaystyle z_{i}~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\left(f_{i}(x),f_{i}(y)\right)} for every i I . {\displaystyle i\in I.} Because f ( x ) = ( f i ( x ) ) i I f ( x ) {\displaystyle f_{\bullet }(x)=\left(f_{i}(x)\right)_{i\in I}\to f(x)} and f ( y ) = ( f i ( y ) ) i I f ( y ) , {\displaystyle f_{\bullet }(y)=\left(f_{i}(y)\right)_{i\in I}\to f(y),} it follows that z ( f ( x ) , f ( y ) ) {\displaystyle z_{\bullet }\to (f(x),f(y))} in K × K . {\displaystyle \mathbb {K} \times \mathbb {K} .} Let A : K × K K {\displaystyle A:\mathbb {K} \times \mathbb {K} \to \mathbb {K} } be the addition map defined by A ( x , y )   = def   x + y . {\displaystyle A(x,y)~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~x+y.} The continuity of A {\displaystyle A} implies that A ( z ) A ( f ( x ) , f ( y ) ) {\displaystyle A\left(z_{\bullet }\right)\to A(f(x),f(y))} in K {\displaystyle \mathbb {K} } where the right hand side is A ( f ( x ) , f ( y ) ) = f ( x ) + f ( y ) {\displaystyle A(f(x),f(y))=f(x)+f(y)} and the left hand side is A ( z )   = def   A z = ( A ( z i ) ) i I = ( A ( f i ( x ) , f i ( y ) ) ) i I = ( f i ( x ) + f i ( y ) ) i I = ( f i ( x + y ) ) i I = f ( x + y ) {\displaystyle A\left(z_{\bullet }\right)~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~A\circ z_{\bullet }=\left(A\left(z_{i}\right)\right)_{i\in I}=\left(A\left(f_{i}(x),f_{i}(y)\right)\right)_{i\in I}=\left(f_{i}(x)+f_{i}(y)\right)_{i\in I}=\left(f_{i}(x+y)\right)_{i\in I}=f_{\bullet }(x+y)} which proves that f ( x + y ) f ( x ) + f ( y ) . {\displaystyle f_{\bullet }(x+y)\to f(x)+f(y).} Because also f ( x + y ) f ( x + y ) , {\displaystyle f_{\bullet }(x+y)\to f(x+y),} it follows that f ( x + y ) = f ( x ) + f ( y ) , {\displaystyle f(x+y)=f(x)+f(y),} as desired. {\displaystyle \blacksquare }

The lemma above actually also follows from its corollary below since x X K {\displaystyle \prod _{x\in X}\mathbb {K} } is a Hausdorff complete uniform space and any subset of such a space (in particular X # {\displaystyle X^{\#}} ) is closed if and only if it is complete.

Corollary to lemma ( X # {\displaystyle X^{\#}} is weak-* complete) — When the algebraic dual space X # {\displaystyle X^{\#}} of a vector space X {\displaystyle X} is equipped with the topology σ ( X # , X ) {\displaystyle \sigma \left(X^{\#},X\right)} of pointwise convergence (also known as the weak-* topology) then the resulting topological space ( X # , σ ( X # , X ) ) {\displaystyle \left(X^{\#},\sigma \left(X^{\#},X\right)\right)} is a complete Hausdorff locally convex topological vector space.

Proof of corollary to lemma

Because the underlying field K {\displaystyle \mathbb {K} } is a complete Hausdorff locally convex topological vector space, the same is true of the product space K X = x X K . {\textstyle \mathbb {K} ^{X}=\prod _{x\in X}\mathbb {K} .} A closed subset of a complete space is complete, so by the lemma, the space ( X # , σ ( X # , X ) ) {\displaystyle \left(X^{\#},\sigma \left(X^{\#},X\right)\right)} is complete. {\displaystyle \blacksquare }


The above elementary proof of the Banach–Alaoglu theorem actually shows that if U X {\displaystyle U\subseteq X} is any subset that satisfies X = ( 0 , ) U   = def   { r u : r > 0 , u U } {\displaystyle X=(0,\infty )U~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\{ru:r>0,u\in U\}} (such as any absorbing subset of X {\displaystyle X} ), then U #   = def   { f X # : f ( U ) B 1 } {\displaystyle U^{\#}~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\left\{f\in X^{\#}:f(U)\subseteq B_{1}\right\}} is a weak-* compact subset of X # . {\displaystyle X^{\#}.}

As a side note, with the help of the above elementary proof, it may be shown (see this footnote) that there exist X {\displaystyle X} -indexed non-negative real numbers m = ( m x ) x X {\displaystyle m_{\bullet }=\left(m_{x}\right)_{x\in X}} such that U = U # = X # x X B m x = X x X B m x {\displaystyle {\begin{alignedat}{4}U^{\circ }&=U^{\#}&&\\&=X^{\#}&&\cap \prod _{x\in X}B_{m_{x}}\\&=X^{\prime }&&\cap \prod _{x\in X}B_{m_{x}}\\\end{alignedat}}} where these real numbers m {\displaystyle m_{\bullet }} can also be chosen to be "minimal" in the following sense: using P   = def   U {\displaystyle P~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~U^{\circ }} (so P = U # {\displaystyle P=U^{\#}} as in the proof) and defining the notation B R   = def   x X B R x {\displaystyle \prod B_{R_{\bullet }}~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\prod _{x\in X}B_{R_{x}}} for any R = ( R x ) x X R X , {\displaystyle R_{\bullet }=\left(R_{x}\right)_{x\in X}\in \mathbb {R} ^{X},} if T P   = def   { R R X   :   P B R } {\displaystyle T_{P}~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\left\{R_{\bullet }\in \mathbb {R} ^{X}~:~P\subseteq \prod B_{R_{\bullet }}\right\}} then m T P {\displaystyle m_{\bullet }\in T_{P}} and for every x X , {\displaystyle x\in X,} m x = inf { R x : R T P } , {\displaystyle m_{x}=\inf \left\{R_{x}:R_{\bullet }\in T_{P}\right\},} which shows that these numbers m {\displaystyle m_{\bullet }} are unique; indeed, this infimum formula can be used to define them.

In fact, if Box P {\displaystyle \operatorname {Box} _{P}} denotes the set of all such products of closed balls containing the polar set P , {\displaystyle P,} Box P   = def   { B R   :   R T P }   =   { B R   :   P B R } , {\displaystyle \operatorname {Box} _{P}~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\left\{\prod B_{R_{\bullet }}~:~R_{\bullet }\in T_{P}\right\}~=~\left\{\prod B_{R_{\bullet }}~:~P\subseteq \prod B_{R_{\bullet }}\right\},} then B m = Box P Box P {\textstyle \prod B_{m_{\bullet }}=\cap \operatorname {Box} _{P}\in \operatorname {Box} _{P}} where Box P {\textstyle \bigcap \operatorname {Box} _{P}} denotes the intersection of all sets belonging to Box P . {\displaystyle \operatorname {Box} _{P}.}

This implies (among other things) that B m = x X B m x {\textstyle \prod B_{m_{\bullet }}=\prod _{x\in X}B_{m_{x}}} the unique least element of Box P {\displaystyle \operatorname {Box} _{P}} with respect to ; {\displaystyle \,\subseteq ;} this may be used as an alternative definition of this (necessarily convex and balanced) set. The function m   = def   ( m x ) x X : X [ 0 , ) {\displaystyle m_{\bullet }~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\left(m_{x}\right)_{x\in X}:X\to [0,\infty )} is a seminorm and it is unchanged if U {\displaystyle U} is replaced by the convex balanced hull of U {\displaystyle U} (because U # = [ cobal U ] # {\displaystyle U^{\#}=^{\#}} ). Similarly, because U = [ cl X U ] , {\displaystyle U^{\circ }=\left^{\circ },} m {\displaystyle m_{\bullet }} is also unchanged if U {\displaystyle U} is replaced by its closure in X . {\displaystyle X.}

Sequential Banach–Alaoglu theorem

A special case of the Banach–Alaoglu theorem is the sequential version of the theorem, which asserts that the closed unit ball of the dual space of a separable normed vector space is sequentially compact in the weak-* topology. In fact, the weak* topology on the closed unit ball of the dual of a separable space is metrizable, and thus compactness and sequential compactness are equivalent.

Specifically, let X {\displaystyle X} be a separable normed space and B {\displaystyle B} the closed unit ball in X . {\displaystyle X^{\prime }.} Since X {\displaystyle X} is separable, let x = ( x n ) n = 1 {\displaystyle x_{\bullet }=\left(x_{n}\right)_{n=1}^{\infty }} be a countable dense subset. Then the following defines a metric, where for any x , y B {\displaystyle x,y\in B} ρ ( x , y ) = n = 1 2 n | x y , x n | 1 + | x y , x n | {\displaystyle \rho (x,y)=\sum _{n=1}^{\infty }\,2^{-n}\,{\frac {\left|\langle x-y,x_{n}\rangle \right|}{1+\left|\langle x-y,x_{n}\rangle \right|}}} in which , {\displaystyle \langle \cdot ,\cdot \rangle } denotes the duality pairing of X {\displaystyle X^{\prime }} with X . {\displaystyle X.} Sequential compactness of B {\displaystyle B} in this metric can be shown by a diagonalization argument similar to the one employed in the proof of the Arzelà–Ascoli theorem.

Due to the constructive nature of its proof (as opposed to the general case, which is based on the axiom of choice), the sequential Banach–Alaoglu theorem is often used in the field of partial differential equations to construct solutions to PDE or variational problems. For instance, if one wants to minimize a functional F : X R {\displaystyle F:X^{\prime }\to \mathbb {R} } on the dual of a separable normed vector space X , {\displaystyle X,} one common strategy is to first construct a minimizing sequence x 1 , x 2 , X {\displaystyle x_{1},x_{2},\ldots \in X^{\prime }} which approaches the infimum of F , {\displaystyle F,} use the sequential Banach–Alaoglu theorem to extract a subsequence that converges in the weak* topology to a limit x , {\displaystyle x,} and then establish that x {\displaystyle x} is a minimizer of F . {\displaystyle F.} The last step often requires F {\displaystyle F} to obey a (sequential) lower semi-continuity property in the weak* topology.

When X {\displaystyle X^{\prime }} is the space of finite Radon measures on the real line (so that X = C 0 ( R ) {\displaystyle X=C_{0}(\mathbb {R} )} is the space of continuous functions vanishing at infinity, by the Riesz representation theorem), the sequential Banach–Alaoglu theorem is equivalent to the Helly selection theorem.

Proof

For every x X , {\displaystyle x\in X,} let D x = { c C : | c | x } {\displaystyle D_{x}=\{c\in \mathbb {C} :|c|\leq \|x\|\}} and let D = x X D x {\displaystyle D=\prod _{x\in X}D_{x}} be endowed with the product topology. Because every D x {\displaystyle D_{x}} is a compact subset of the complex plane, Tychonoff's theorem guarantees that their product D {\displaystyle D} is compact.

The closed unit ball in X , {\displaystyle X^{\prime },} denoted by B 1 , {\displaystyle B_{1}^{\,\prime },} can be identified as a subset of D {\displaystyle D} in a natural way: F : B 1 D f ( f ( x ) ) x X . {\displaystyle {\begin{alignedat}{4}F:\;&&B_{1}^{\,\prime }&&\;\to \;&D\\&&f&&\;\mapsto \;&(f(x))_{x\in X}.\\\end{alignedat}}}

This map is injective and it is continuous when B 1 {\displaystyle B_{1}^{\,\prime }} has the weak-* topology. This map's inverse, defined on its image, is also continuous.

It will now be shown that the image of the above map is closed, which will complete the proof of the theorem. Given a point λ = ( λ x ) x X D {\displaystyle \lambda _{\bullet }=\left(\lambda _{x}\right)_{x\in X}\in D} and a net ( f i ( x ) ) x X {\displaystyle \left(f_{i}(x)\right)_{x\in X}} in the image of F {\displaystyle F} indexed by i I {\displaystyle i\in I} such that lim i ( f i ( x ) ) x X λ  in  D , {\displaystyle \lim _{i}\left(f_{i}(x)\right)_{x\in X}\to \lambda _{\bullet }\quad {\text{ in }}D,} the functional g : X C {\displaystyle g:X\to \mathbb {C} } defined by g ( x ) = λ x  for every  x X , {\displaystyle g(x)=\lambda _{x}\qquad {\text{ for every }}x\in X,} lies in B 1 {\displaystyle B_{1}^{\,\prime }} and F ( g ) = λ . {\displaystyle F(g)=\lambda _{\bullet }.} {\displaystyle \blacksquare }

Consequences

Consequences for normed spaces

Assume that X {\displaystyle X} is a normed space and endow its continuous dual space X {\displaystyle X^{\prime }} with the usual dual norm.

  • The closed unit ball in X {\displaystyle X^{\prime }} is weak-* compact. So if X {\displaystyle X^{\prime }} is infinite dimensional then its closed unit ball is necessarily not compact in the norm topology by F. Riesz's theorem (despite it being weak-* compact).
  • A Banach space is reflexive if and only if its closed unit ball is σ ( X , X ) {\displaystyle \sigma \left(X,X^{\prime }\right)} -compact; this is known as James' theorem.
  • If X {\displaystyle X} is a reflexive Banach space, then every bounded sequence in X {\displaystyle X} has a weakly convergent subsequence. (This follows by applying the Banach–Alaoglu theorem to a weakly metrizable subspace of X {\displaystyle X} ; or, more succinctly, by applying the Eberlein–Šmulian theorem.) For example, suppose that X {\displaystyle X} is the space Lp space L p ( μ ) {\displaystyle L^{p}(\mu )} where 1 < p < {\displaystyle 1<p<\infty } and let q {\displaystyle q} satisfy 1 p + 1 q = 1. {\displaystyle {\frac {1}{p}}+{\frac {1}{q}}=1.} Let f 1 , f 2 , {\displaystyle f_{1},f_{2},\ldots } be a bounded sequence of functions in X . {\displaystyle X.} Then there exists a subsequence ( f n k ) k = 1 {\displaystyle \left(f_{n_{k}}\right)_{k=1}^{\infty }} and an f X {\displaystyle f\in X} such that f n k g d μ f g d μ  for all  g L q ( μ ) = X . {\displaystyle \int f_{n_{k}}g\,d\mu \to \int fg\,d\mu \qquad {\text{ for all }}g\in L^{q}(\mu )=X^{\prime }.} The corresponding result for p = 1 {\displaystyle p=1} is not true, as L 1 ( μ ) {\displaystyle L^{1}(\mu )} is not reflexive.

Consequences for Hilbert spaces

  • In a Hilbert space, every bounded and closed set is weakly relatively compact, hence every bounded net has a weakly convergent subnet (Hilbert spaces are reflexive).
  • As norm-closed, convex sets are weakly closed (Hahn–Banach theorem), norm-closures of convex bounded sets in Hilbert spaces or reflexive Banach spaces are weakly compact.
  • Closed and bounded sets in B ( H ) {\displaystyle B(H)} are precompact with respect to the weak operator topology (the weak operator topology is weaker than the ultraweak topology which is in turn the weak-* topology with respect to the predual of B ( H ) , {\displaystyle B(H),} the trace class operators). Hence bounded sequences of operators have a weak accumulation point. As a consequence, B ( H ) {\displaystyle B(H)} has the Heine–Borel property, if equipped with either the weak operator or the ultraweak topology.

Relation to the axiom of choice and other statements

See also: Krein–Milman theorem § Relation to other statements

The Banach–Alaoglu may be proven by using Tychonoff's theorem, which under the Zermelo–Fraenkel set theory (ZF) axiomatic framework is equivalent to the axiom of choice. Most mainstream functional analysis relies on ZF + the axiom of choice, which is often denoted by ZFC. However, the theorem does not rely upon the axiom of choice in the separable case (see above): in this case there actually exists a constructive proof. In the general case of an arbitrary normed space, the ultrafilter Lemma, which is strictly weaker than the axiom of choice and equivalent to Tychonoff's theorem for compact Hausdorff spaces, suffices for the proof of the Banach–Alaoglu theorem, and is in fact equivalent to it.

The Banach–Alaoglu theorem is equivalent to the ultrafilter lemma, which implies the Hahn–Banach theorem for real vector spaces (HB) but is not equivalent to it (said differently, Banach–Alaoglu is also strictly stronger than HB). However, the Hahn–Banach theorem is equivalent to the following weak version of the Banach–Alaoglu theorem for normed space in which the conclusion of compactness (in the weak-* topology of the closed unit ball of the dual space) is replaced with the conclusion of quasicompactness (also sometimes called convex compactness);

Weak version of Alaoglu theorem — Let X {\displaystyle X} be a normed space and let B {\displaystyle B} denote the closed unit ball of its continuous dual space X . {\displaystyle X^{\prime }.} Then B {\displaystyle B} has the following property, which is called (weak-*) quasicompactness or convex compactness: whenever C {\displaystyle {\mathcal {C}}} is a cover of B {\displaystyle B} by convex weak-* closed subsets of X {\displaystyle X^{\prime }} such that { B C : C C } {\displaystyle \{B\cap C:C\in {\mathcal {C}}\}} has the finite intersection property, then B C C C {\displaystyle B\cap \bigcap _{C\in {\mathcal {C}}}C} is not empty.

Compactness implies convex compactness because a topological space is compact if and only if every family of closed subsets having the finite intersection property (FIP) has non-empty intersection. The definition of convex compactness is similar to this characterization of compact spaces in terms of the FIP, except that it only involves those closed subsets that are also convex (rather than all closed subsets).

See also

Notes

  1. Explicitly, a subset B X {\displaystyle B^{\prime }\subseteq X^{\prime }} is said to be "compact (resp. totally bounded, etc.) in the weak-* topology" if when X {\displaystyle X^{\prime }} is given the weak-* topology and the subset B {\displaystyle B^{\prime }} is given the subspace topology inherited from ( X , σ ( X , X ) ) , {\displaystyle \left(X^{\prime },\sigma \left(X^{\prime },X\right)\right),} then B {\displaystyle B^{\prime }} is a compact (resp. totally bounded, etc.) space.
  2. If τ {\displaystyle \tau } denotes the topology that X {\displaystyle X} is (originally) endowed with, then the equality U = U # {\displaystyle U^{\circ }=U^{\#}} shows that the polar U = { f X   :   sup u U | f ( u ) | 1 } {\displaystyle U^{\circ }={\Big \{}f\in X^{\prime }~:~\sup _{u\in U}|f(u)|\leq 1{\Big \}}} of U {\displaystyle U} is dependent only on U {\displaystyle U} (and X # {\displaystyle X^{\#}} ) and that the rest of the topology τ {\displaystyle \tau } can be ignored. To clarify what is meant, suppose σ {\displaystyle \sigma } is any TVS topology on X {\displaystyle X} such that the set U {\displaystyle U} is (also) a neighborhood of the origin in ( X , σ ) . {\displaystyle (X,\sigma ).} Denote the continuous dual space of ( X , σ ) {\displaystyle (X,\sigma )} by ( X , σ ) {\displaystyle (X,\sigma )^{\prime }} and denote the polar of U {\displaystyle U} with respect to ( X , σ ) {\displaystyle (X,\sigma )} by U , σ   = def   { f ( X , σ )   :   sup u U | f ( u ) | 1 } {\displaystyle U^{\circ ,\sigma }~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~{\Big \{}f\in (X,\sigma )^{\prime }~:~\sup _{u\in U}|f(u)|\leq 1{\Big \}}} so that U , τ {\displaystyle U^{\circ ,\tau }} is just the set U {\displaystyle U^{\circ }} from above. Then U , τ = U , σ {\displaystyle U^{\circ ,\tau }=U^{\circ ,\sigma }} because both of these sets are equal to U # . {\displaystyle U^{\#}.} Said differently, the polar set U , σ {\displaystyle U^{\circ ,\sigma }} 's defining "requirement" that U , σ {\displaystyle U^{\circ ,\sigma }} be a subset of the continuous dual space ( X , σ ) {\displaystyle (X,\sigma )^{\prime }} is inconsequential and can be ignored because it does not have any effect on the resulting set of linear functionals. However, if ν {\displaystyle \nu } is a TVS topology on X {\displaystyle X} such that U {\displaystyle U} is not a neighborhood of the origin in ( X , ν ) {\displaystyle (X,\nu )} then the polar U , ν {\displaystyle U^{\circ ,\nu }} of U {\displaystyle U} with respect to ( X , ν ) {\displaystyle (X,\nu )} is not guaranteed to equal U # {\displaystyle U^{\#}} and so the topology ν {\displaystyle \nu } can not be ignored.
  3. Because every B r x {\displaystyle B_{r_{x}}} is also a Hausdorff space, the conclusion that x X B r x {\displaystyle \prod _{x\in X}B_{r_{x}}} is compact only requires the so-called "Tychonoff's theorem for compact Hausdorff spaces," which is equivalent to the ultrafilter lemma and strictly weaker than the axiom of choice.
  4. The conclusion U B 1 = x X C x {\displaystyle U_{B_{1}}=\prod _{x\in X}C_{x}} can be written as U B 1   =   ( u U B 1 ) × x X U K . {\displaystyle U_{B_{1}}~=~{\Big (}\prod _{u\in U}B_{1}{\Big )}\times \prod _{x\in X\setminus U}\mathbb {K} .} The set U # {\displaystyle U^{\#}} may thus equivalently be defined by U #   = def   X # [ ( u U B 1 ) × x X U K ] . {\displaystyle U^{\#}~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~X^{\#}\cap \left.} Rewriting the definition in this way helps make it apparent that the set U # {\displaystyle U^{\#}} is closed in x X K {\displaystyle \prod _{x\in X}\mathbb {K} } because this is true of X # . {\displaystyle X^{\#}.}
  5. This tuple m   = def   ( m x ) x X {\displaystyle m_{\bullet }~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\left(m_{x}\right)_{x\in X}} is the least element of T P {\displaystyle T_{P}} with respect to natural induced pointwise partial order defined by R S {\displaystyle R_{\bullet }\leq S_{\bullet }} if and only if R x S x {\displaystyle R_{x}\leq S_{x}} for every x X . {\displaystyle x\in X.} Thus, every neighborhood U {\displaystyle U} of the origin in X {\displaystyle X} can be associated with this unique (minimum) function m : X [ 0 , ) . {\displaystyle m_{\bullet }:X\to [0,\infty ).} For any x X , {\displaystyle x\in X,} if r > 0 {\displaystyle r>0} is such that x r U {\displaystyle x\in rU} then m x r {\displaystyle m_{x}\leq r} so that in particular, m 0 = 0 {\displaystyle m_{0}=0} and m u 1 {\displaystyle m_{u}\leq 1} for every u U . {\displaystyle u\in U.}

Proofs

  1. For any non-empty subset A [ 0 , ) , {\displaystyle A\subseteq [0,\infty ),} the equality { B a : a A } = B inf A {\displaystyle \cap \left\{B_{a}:a\in A\right\}=B_{\inf _{}A}} holds (the intersection on the left is a closed, rather than open, disk − possibly of radius 0 {\displaystyle 0} − because it is an intersection of closed subsets of K {\displaystyle \mathbb {K} } and so must itself be closed). For every x X , {\displaystyle x\in X,} let m x = inf { R x : R T P } {\displaystyle m_{x}=\inf _{}\left\{R_{x}:R_{\bullet }\in T_{P}\right\}} so that the previous set equality implies Box P = R T P x X B R x = x X R T P B R x = x X B m x . {\displaystyle \cap \operatorname {Box} _{P}=\bigcap _{R_{\bullet }\in T_{P}}\prod _{x\in X}B_{R_{x}}=\prod _{x\in X}\bigcap _{R_{\bullet }\in T_{P}}B_{R_{x}}=\prod _{x\in X}B_{m_{x}}.} From P Box P {\displaystyle P\subseteq \cap \operatorname {Box} _{P}} it follows that m T P {\displaystyle m_{\bullet }\in T_{P}} and Box P Box P , {\displaystyle \cap \operatorname {Box} _{P}\in \operatorname {Box} _{P},} thereby making Box P {\displaystyle \cap \operatorname {Box} _{P}} the least element of Box P {\displaystyle \operatorname {Box} _{P}} with respect to . {\displaystyle \,\subseteq .\,} (In fact, the family Box P {\displaystyle \operatorname {Box} _{P}} is closed under (non-nullary) arbitrary intersections and also under finite unions of at least one set). The elementary proof showed that T P {\displaystyle T_{P}} and Box P {\displaystyle \operatorname {Box} _{P}} are not empty and moreover, it also even showed that T P {\displaystyle T_{P}} has an element ( r x ) x X {\displaystyle \left(r_{x}\right)_{x\in X}} that satisfies r u = 1 {\displaystyle r_{u}=1} for every u U , {\displaystyle u\in U,} which implies that m u 1 {\displaystyle m_{u}\leq 1} for every u U . {\displaystyle u\in U.} The inclusion P     ( Box P ) X     ( Box P ) X # {\displaystyle P~\subseteq ~\left(\cap \operatorname {Box} _{P}\right)\cap X^{\prime }~\subseteq ~\left(\cap \operatorname {Box} _{P}\right)\cap X^{\#}} is immediate; to prove the reverse inclusion, let f ( Box P ) X # . {\displaystyle f\in \left(\cap \operatorname {Box} _{P}\right)\cap X^{\#}.} By definition, f P   = def   U # {\displaystyle f\in P~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~U^{\#}} if and only if sup u U | f ( u ) | 1 , {\displaystyle \sup _{u\in U}|f(u)|\leq 1,} so let u U {\displaystyle u\in U} and it remains to show that | f ( u ) | 1. {\displaystyle |f(u)|\leq 1.} From f Box P = B m , {\displaystyle f\in \cap \operatorname {Box} _{P}=\prod B_{m_{\bullet }},} it follows that f ( u ) = Pr u ( f ) Pr u ( x X B m x ) = B m u , {\displaystyle f(u)=\Pr {}_{u}(f)\in \Pr {}_{u}\left(\prod _{x\in X}B_{m_{x}}\right)=B_{m_{u}},} which implies that | f ( u ) | m u 1 , {\displaystyle |f(u)|\leq m_{u}\leq 1,} as desired. {\displaystyle \blacksquare }

Citations

  1. Rudin 1991, Theorem 3.15.
  2. ^ Narici & Beckenstein 2011, pp. 235–240.
  3. ^ Narici & Beckenstein 2011, pp. 225–273.
  4. Köthe 1983, Theorem (4) in §20.9.
  5. Meise & Vogt 1997, Theorem 23.5.
  6. ^ Bell, J.; Fremlin, David (1972). "A Geometric Form of the Axiom of Choice" (PDF). Fundamenta Mathematicae. 77 (2): 167–170. doi:10.4064/fm-77-2-167-170. Retrieved 26 Dec 2021.

References

Further reading

Duality and spaces of linear maps
Basic concepts
Topologies
Main results
Maps
Subsets
Other concepts
Functional analysis (topicsglossary)
Spaces
Properties
Theorems
Operators
Algebras
Open problems
Applications
Advanced topics
Topological vector spaces (TVSs)
Basic concepts
Main results
Maps
Types of sets
Set operations
Types of TVSs
Categories: