Misplaced Pages

Convergence proof techniques

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.

Convergence proof techniques are canonical patterns of mathematical proofs that sequences or functions converge to a finite limit when the argument tends to infinity.

There are many types of sequences and modes of convergence, and different proof techniques may be more appropriate than others for proving each type of convergence of each type of sequence. Below are some of the more common and typical examples. This article is intended as an introduction aimed to help practitioners explore appropriate techniques. The links below give details of necessary conditions and generalizations to more abstract settings. Proof techniques for the convergence of series, a particular type of sequences corresponding to sums of many terms, are covered in the article on convergence tests.

Convergence in R

It is common to want to prove convergence of a sequence f : N R n {\displaystyle f:\mathbb {N} \rightarrow \mathbb {R} ^{n}} or function f : R R n {\displaystyle f:\mathbb {R} \rightarrow \mathbb {R} ^{n}} , where N {\displaystyle \mathbb {N} } and R {\displaystyle \mathbb {R} } refer to the natural numbers and the real numbers, respectively, and convergence is with respect to the Euclidean norm, | | | | 2 {\displaystyle ||\cdot ||_{2}} .

Useful approaches for this are as follows.

First principles

The analytic definition of convergence of f {\displaystyle f} to a limit f {\displaystyle f_{\infty }} is that for all ϵ {\displaystyle \epsilon } there exists a k 0 N {\displaystyle k_{0}\in \mathbb {N} } such for all k > k 0 {\displaystyle k>k_{0}} , f ( k ) f < ϵ {\displaystyle \|f(k)-f_{\infty }\|<\epsilon } . The most direct proof technique from this definition is to find such a k 0 {\displaystyle k_{0}} and prove the required inequality. If the value of f {\displaystyle f_{\infty }} is not known in advance, the techniques below may be useful.

Contraction mappings

In many cases, the function whose convergence is of interest has the form f ( k + 1 ) = T ( f ( k ) ) {\displaystyle f(k+1)=T(f(k))} for some transformation T {\displaystyle T} . For example, T {\displaystyle T} could map f ( k ) {\displaystyle f(k)} to f ( k + 1 ) = A f ( k ) {\displaystyle f(k+1)=Af(k)} for some conformable matrix A {\displaystyle A} , so that f ( k ) = A k f ( 0 ) {\displaystyle f(k)=A^{k}f(0)} , a matrix generalization of the geometric progression. Alternatively, T {\displaystyle T} may be an elementwise operation, such as replacing each element of f ( k ) {\displaystyle f(k)} by the square root of its magnitude.

In such cases, if the problem satisfies the conditions of Banach fixed-point theorem (the domain is a non-empty complete metric space) then it is sufficient to prove convergence to prove that T {\displaystyle T} is a contraction mapping to prove that it has a fixed point. This requires that T ( x ) T ( y ) < λ ( x y ) {\displaystyle \|T(x)-T(y)\|<\|\lambda (x-y)\|} for some constant | λ | < 1 {\displaystyle |\lambda |<1} which is fixed for all x {\displaystyle x} and y {\displaystyle y} . The composition of two contraction mappings is a contraction mapping, so if T = T 1 T 2 {\displaystyle T=T_{1}\circ T_{2}} , then it is sufficient to show that T 1 {\displaystyle T_{1}} and T 2 {\displaystyle T_{2}} are both contraction mappings.

Example

Famous examples of applications of this approach include

  • If T {\displaystyle T} has the form T ( x ) = A x + B {\displaystyle T(x)=Ax+B} for some matrices A {\displaystyle A} and B {\displaystyle B} , then T k ( x ) {\displaystyle T^{k}(x)} converges to ( I A ) 1 B {\displaystyle (I-A)^{-1}B} if the magnitudes of all eigenvalues of A {\displaystyle A} are less than 1.

Non-expansion mappings

If both above inequalities in the definition of a contraction mapping are weakened from "strictly less than" to "less than or equal to", the mapping is a non-expansion mapping. It is not sufficient to prove convergence to prove that T {\displaystyle T} is a non-expansion mapping. For example, T ( x ) = x {\displaystyle T(x)=-x} is a non-expansion mapping, but the sequence T n ( x ) {\displaystyle T^{n}(x)} does not converge for any x 0 {\displaystyle x\neq 0} . However, the composition of a contraction mapping and a non-expansion mapping (or vice versa) is a contraction mapping.

Contraction mappings on limited domains

If T {\displaystyle T} is not a contraction mapping on its entire domain, but it is on its codomain (the image of the domain), that is also sufficient for convergence. This also applies for decompositions. For example, consider T ( x ) = cos ( sin ( x ) ) {\displaystyle T(x)=\cos(\sin(x))} . The function cos {\displaystyle \cos } is not a contraction mapping, but it is on the restricted domain [ 1 , 1 ] {\displaystyle } , which is the codomain of sin {\displaystyle \sin } for real arguments. Since sin {\displaystyle \sin } is a non-expansion mapping, this implies T {\displaystyle T} is a contraction mapping.

Convergent subsequences

Every bounded sequence in R n {\displaystyle \mathbb {R} ^{n}} has a convergent subsequence, by the Bolzano–Weierstrass theorem. If these subsequences all have the same limit, then the original sequence also converges to that limit. If it can be shown that all of the subsequences of f {\displaystyle f} must have the same limit, such as by showing that there is a unique fixed point of the transformation T {\displaystyle T} and that there are no invariant sets of T {\displaystyle T} that contain no fixed points of T {\displaystyle T} , then the initial sequence must also converge to that limit.

Monotonicity (Lyapunov functions)

Every bounded monotonic sequence in R n {\displaystyle \mathbb {R} ^{n}} converges to a limit.

This fact can be used directly and can also be used to prove the convergence of sequences that are not monotonic using techniques and theorems named for Aleksandr Lyapunov. In these cases, one defines a function V : R n R {\displaystyle V:\mathbb {R} ^{n}\rightarrow \mathbb {R} } such that V ( f ( k ) ) {\displaystyle V(f(k))} is monotonic in k {\displaystyle k} and thus V ( f ( k ) ) {\displaystyle V(f(k))} converges. If V {\displaystyle V} satisfies the conditions to be a Lyapunov function then Lyapunov's theorem implies that f {\displaystyle f} is also convergent. Lyapunov's theorem is normally stated for ordinary differential equations, but it can also be applied to sequences of iterates by replacing derivatives with discrete differences.

The basic requirements on V {\displaystyle V} to be a Lyapunov function are that

  1. V ( x ) > 0 {\displaystyle V(x)>0} for all x 0 {\displaystyle x\neq 0} and V ( 0 ) = 0 {\displaystyle V(0)=0}
  2. V ( f ( k + 1 ) ) V ( f ( k ) ) < 0 {\displaystyle V(f(k+1))-V(f(k))<0} for f ( k ) 0 {\displaystyle f(k)\neq 0} (discrete case) or V ˙ ( x ) < 0 {\displaystyle {\dot {V}}(x)<0} for x 0 {\displaystyle x\neq 0} (continuous case)
  3. V {\displaystyle V} is "radially unbounded", i.e., that lim k V ( f ( k ) ) = {\textstyle \lim _{k\rightarrow \infty }V(f(k))=\infty } for any sequence with lim k | | f ( k ) | | = {\textstyle \lim _{k\rightarrow \infty }||f(k)||=\infty } .

In many cases a quadratic Lyapunov function of the form V ( x ) = x T A x {\displaystyle V(x)=x^{T}Ax} can be found, although more complex forms are also common, for instance entropies in the study of convergence of probability distributions.

For delay differential equations, a similar approach applies with Lyapunov functions replaced by Lyapunov functionals also called Lyapunov-Krasovskii functionals.

If the inequality in the condition 2 is weak, LaSalle's invariance principle may be used.

Convergence of sequences of functions

To consider the convergence of sequences of functions, it is necessary to define a distance between functions to replace the Euclidean norm. These often include

  • Convergence in the norm (strong convergence) -- a function norm, such as g f = x A g ( x ) d x {\textstyle \|g\|_{f}=\int _{x\in A}\|g(x)\|dx} is defined, and convergence occurs if | | f ( n ) f | | f 0 {\displaystyle ||f(n)-f_{\infty }||_{f}\rightarrow 0} . For this case, all of the above techniques can be applied with this function norm.
  • Pointwise convergence -- convergence occurs if for each x {\displaystyle x} , f n ( x ) f ( x ) {\displaystyle f_{n}(x)\rightarrow f_{\infty }(x)} . For this case, the above techniques can be applied for each point x {\displaystyle x} with the norm appropriate for f ( x ) {\displaystyle f(x)} .
  • uniform convergence -- In pointwise convergence, some (open) regions can converge arbitrarily slowly. With uniform convergence, there is a fixed convergence rate such that all points converge at least that fast. Formally, lim n sup { | f n ( x ) f ( x ) | : x A } = 0 , {\displaystyle \lim _{n\to \infty }\,\sup\{\,\left|f_{n}(x)-f_{\infty }(x)\right|:x\in A\,\}=0,} where A {\displaystyle A} is the domain of each f n {\displaystyle f_{n}} .

See also

Convergence of random variables

Random variables are more complicated than simple elements of R n {\displaystyle \mathbb {R} ^{n}} . (Formally, a random variable is a mapping x : Ω V {\displaystyle x:\Omega \rightarrow V} from an event space Ω {\displaystyle \Omega } to a value space V {\displaystyle V} . The value space may be R n {\displaystyle \mathbb {R} ^{n}} , such as the roll of a dice, and such a random variable is often spoken of informally as being in R n {\displaystyle \mathbb {R} ^{n}} , but convergence of sequence of random variables corresponds to convergence of the sequence of functions, or the distributions, rather than the sequence of values.)

There are multiple types of convergence, depending on how the distance between functions is measured.

Each has its own proof techniques, which are beyond the current scope of this article.

See also

Topological convergence

For all of the above techniques, some form the basic analytic definition of convergence above applies. However, topology has its own definitions of convergence. For example, in a non-Hausdorff space, it is possible for a sequence to converge to multiple different limits.

References

  1. Ross, Kenneth. Elementary Analysis: The Theory of Calculus. Springer.
  2. Haase, Markus. Functional Analysis: An Elementary Introduction. American Mathematics Society.
  3. Billingsley, Patrick (1995). Probability and Measure. John Wesley.
Category: