Misplaced Pages

Moment problem

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.

Trying to map moments to a measure that generates them
Example: Given the mean and variance σ 2 {\displaystyle \sigma ^{2}} (as well as all further cumulants equal 0) the normal distribution is the distribution solving the moment problem.

In mathematics, a moment problem arises as the result of trying to invert the mapping that takes a measure μ {\displaystyle \mu } to the sequence of moments

m n = x n d μ ( x ) . {\displaystyle m_{n}=\int _{-\infty }^{\infty }x^{n}\,d\mu (x)\,.}

More generally, one may consider

m n = M n ( x ) d μ ( x ) . {\displaystyle m_{n}=\int _{-\infty }^{\infty }M_{n}(x)\,d\mu (x)\,.}

for an arbitrary sequence of functions M n {\displaystyle M_{n}} .

Introduction

In the classical setting, μ {\displaystyle \mu } is a measure on the real line, and M {\displaystyle M} is the sequence { x n : n = 1 , 2 , } {\displaystyle \{x^{n}:n=1,2,\dotsc \}} . In this form the question appears in probability theory, asking whether there is a probability measure having specified mean, variance and so on, and whether it is unique.

There are three named classical moment problems: the Hamburger moment problem in which the support of μ {\displaystyle \mu } is allowed to be the whole real line; the Stieltjes moment problem, for [ 0 , ) {\displaystyle [0,\infty )} [ 0 , 1 ] {\displaystyle } .

The moment problem also extends to complex analysis as the trigonometric moment problem in which the Hankel matrices are replaced by Toeplitz matrices and the support of μ is the complex unit circle instead of the real line.

Existence

A sequence of numbers m n {\displaystyle m_{n}} is the sequence of moments of a measure μ {\displaystyle \mu } if and only if a certain positivity condition is fulfilled; namely, the Hankel matrices H n {\displaystyle H_{n}} ,

( H n ) i j = m i + j , {\displaystyle (H_{n})_{ij}=m_{i+j}\,,}

should be positive semi-definite. This is because a positive-semidefinite Hankel matrix corresponds to a linear functional Λ {\displaystyle \Lambda } such that Λ ( x n ) = m n {\displaystyle \Lambda (x^{n})=m_{n}} and Λ ( f 2 ) 0 {\displaystyle \Lambda (f^{2})\geq 0} (non-negative for sum of squares of polynomials). Assume Λ {\displaystyle \Lambda } can be extended to R [ x ] {\displaystyle \mathbb {R} ^{*}} . In the univariate case, a non-negative polynomial can always be written as a sum of squares. So the linear functional Λ {\displaystyle \Lambda } is positive for all the non-negative polynomials in the univariate case. By Haviland's theorem, the linear functional has a measure form, that is Λ ( x n ) = x n d μ {\displaystyle \Lambda (x^{n})=\int _{-\infty }^{\infty }x^{n}d\mu } . A condition of similar form is necessary and sufficient for the existence of a measure μ {\displaystyle \mu } supported on a given interval [ a , b ] {\displaystyle } .

One way to prove these results is to consider the linear functional φ {\displaystyle \varphi } that sends a polynomial

P ( x ) = k a k x k {\displaystyle P(x)=\sum _{k}a_{k}x^{k}}

to

k a k m k . {\displaystyle \sum _{k}a_{k}m_{k}.}

If m k {\displaystyle m_{k}} are the moments of some measure μ {\displaystyle \mu } supported on [ a , b ] {\displaystyle } , then evidently

φ ( P ) 0 {\displaystyle \varphi (P)\geq 0} for any polynomial P {\displaystyle P} that is non-negative on [ a , b ] {\displaystyle } . 1

Vice versa, if (1) holds, one can apply the M. Riesz extension theorem and extend φ {\displaystyle \varphi } to a functional on the space of continuous functions with compact support C c ( [ a , b ] ) {\displaystyle C_{c}()} ), so that

φ ( f ) 0 {\displaystyle \varphi (f)\geq 0} for any f C c ( [ a , b ] ) , f 0. {\displaystyle f\in C_{c}(),\;f\geq 0.} 2

By the Riesz representation theorem, (2) holds iff there exists a measure μ {\displaystyle \mu } supported on [ a , b ] {\displaystyle } , such that

φ ( f ) = f d μ {\displaystyle \varphi (f)=\int f\,d\mu }

for every f C c ( [ a , b ] ) {\displaystyle f\in C_{c}()} .

Thus the existence of the measure μ {\displaystyle \mu } is equivalent to (1). Using a representation theorem for positive polynomials on [ a , b ] {\displaystyle } , one can reformulate (1) as a condition on Hankel matrices.

Uniqueness (or determinacy)

See also: Carleman's condition and Krein's condition

The uniqueness of μ {\displaystyle \mu } in the Hausdorff moment problem follows from the Weierstrass approximation theorem, which states that polynomials are dense under the uniform norm in the space of continuous functions on [ 0 , 1 ] {\displaystyle } . For the problem on an infinite interval, uniqueness is a more delicate question. There are distributions, such as log-normal distributions, which have finite moments for all the positive integers but where other distributions have the same moments.

Formal solution

When the solution exists, it can be formally written using derivatives of the Dirac delta function as

d μ ( x ) = ρ ( x ) d x , ρ ( x ) = n = 0 ( 1 ) n n ! δ ( n ) ( x ) m n {\displaystyle d\mu (x)=\rho (x)dx,\quad \rho (x)=\sum _{n=0}^{\infty }{\frac {(-1)^{n}}{n!}}\delta ^{(n)}(x)m_{n}} .

The expression can be derived by taking the inverse Fourier transform of its characteristic function.

Variations

See also: Chebyshev–Markov–Stieltjes inequalities

An important variation is the truncated moment problem, which studies the properties of measures with fixed first k moments (for a finite k). Results on the truncated moment problem have numerous applications to extremal problems, optimisation and limit theorems in probability theory.

Probability

The moment problem has applications to probability theory. The following is commonly used:

Theorem (Fréchet-Shohat) — If μ {\textstyle \mu } is a determinate measure (i.e. its moments determine it uniquely), and the measures μ n {\textstyle \mu _{n}} are such that k 0 lim n m k [ μ n ] = m k [ μ ] , {\displaystyle \forall k\geq 0\quad \lim _{n\rightarrow \infty }m_{k}\left=m_{k},} then μ n μ {\textstyle \mu _{n}\rightarrow \mu } in distribution.

By checking Carleman's condition, we know that the standard normal distribution is a determinate measure, thus we have the following form of the central limit theorem:

Corollary — If a sequence of probability distributions ν n {\textstyle \nu _{n}} satisfy m 2 k [ ν n ] ( 2 k ) ! 2 k k ! ; m 2 k + 1 [ ν n ] 0 {\displaystyle m_{2k}\to {\frac {(2k)!}{2^{k}k!}};\quad m_{2k+1}\to 0} then ν n {\textstyle \nu _{n}} converges to N ( 0 , 1 ) {\textstyle N(0,1)} in distribution.

See also

Notes

  1. Schmüdgen 2017, p. 257.
  2. Shohat & Tamarkin 1943.
  3. ^ Kreĭn & Nudel′man 1977.
  4. Akhiezer 1965.
  5. Sodin, Sasha (March 5, 2019). "The classical moment problem" (PDF). Archived (PDF) from the original on 1 Jul 2022.

References

Categories: