Misplaced Pages

Random matrix

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
(Redirected from Random matrices) Matrix-valued random variable

In probability theory and mathematical physics, a random matrix is a matrix-valued random variable—that is, a matrix in which some or all of its entries are sampled randomly from a probability distribution. Random matrix theory (RMT) is the study of properties of random matrices, often as they become large. RMT provides techniques like mean-field theory, diagrammatic methods, the cavity method, or the replica method to compute quantities like traces, spectral densities, or scalar products between eigenvectors. Many physical phenomena, such as the spectrum of nuclei of heavy atoms, the thermal conductivity of a lattice, or the emergence of quantum chaos, can be modeled mathematically as problems concerning large, random matrices.

Applications

Physics

In nuclear physics, random matrices were introduced by Eugene Wigner to model the nuclei of heavy atoms. Wigner postulated that the spacings between the lines in the spectrum of a heavy atom nucleus should resemble the spacings between the eigenvalues of a random matrix, and should depend only on the symmetry class of the underlying evolution. In solid-state physics, random matrices model the behaviour of large disordered Hamiltonians in the mean-field approximation.

In quantum chaos, the Bohigas–Giannoni–Schmit (BGS) conjecture asserts that the spectral statistics of quantum systems whose classical counterparts exhibit chaotic behaviour are described by random matrix theory.

In quantum optics, transformations described by random unitary matrices are crucial for demonstrating the advantage of quantum over classical computation (see, e.g., the boson sampling model). Moreover, such random unitary transformations can be directly implemented in an optical circuit, by mapping their parameters to optical circuit components (that is beam splitters and phase shifters).

Random matrix theory has also found applications to the chiral Dirac operator in quantum chromodynamics, quantum gravity in two dimensions, mesoscopic physics, spin-transfer torque, the fractional quantum Hall effect, Anderson localization, quantum dots, and superconductors

Mathematical statistics and numerical analysis

In multivariate statistics, random matrices were introduced by John Wishart, who sought to estimate covariance matrices of large samples. Chernoff-, Bernstein-, and Hoeffding-type inequalities can typically be strengthened when applied to the maximal eigenvalue (i.e. the eigenvalue of largest magnitude) of a finite sum of random Hermitian matrices. Random matrix theory is used to study the spectral properties of random matrices—such as sample covariance matrices—which is of particular interest in high-dimensional statistics. Random matrix theory also saw applications in neuronal networks and deep learning, with recent work utilizing random matrices to show that hyper-parameter tunings can be cheaply transferred between large neural networks without the need for re-training.

In numerical analysis, random matrices have been used since the work of John von Neumann and Herman Goldstine to describe computation errors in operations such as matrix multiplication. Although random entries are traditional "generic" inputs to an algorithm, the concentration of measure associated with random matrix distributions implies that random matrices will not test large portions of an algorithm's input space.

Number theory

In number theory, the distribution of zeros of the Riemann zeta function (and other L-functions) is modeled by the distribution of eigenvalues of certain random matrices. The connection was first discovered by Hugh Montgomery and Freeman Dyson. It is connected to the Hilbert–Pólya conjecture.

Free probability

The relation of free probability with random matrices is a key reason for the wide use of free probability in other subjects. Voiculescu introduced the concept of freeness around 1983 in an operator algebraic context; at the beginning there was no relation at all with random matrices. This connection was only revealed later in 1991 by Voiculescu; he was motivated by the fact that the limit distribution which he found in his free central limit theorem had appeared before in Wigner's semi-circle law in the random matrix context.

Computational neuroscience

In the field of computational neuroscience, random matrices are increasingly used to model the network of synaptic connections between neurons in the brain. Dynamical models of neuronal networks with random connectivity matrix were shown to exhibit a phase transition to chaos when the variance of the synaptic weights crosses a critical value, at the limit of infinite system size. Results on random matrices have also shown that the dynamics of random-matrix models are insensitive to mean connection strength. Instead, the stability of fluctuations depends on connection strength variation and time to synchrony depends on network topology.

In the analysis of massive data such as fMRI, random matrix theory has been applied in order to perform dimension reduction. When applying an algorithm such as PCA, it is important to be able to select the number of significant components. The criteria for selecting components can be multiple (based on explained variance, Kaiser's method, eigenvalue, etc.). Random matrix theory in this content has its representative the Marchenko-Pastur distribution, which guarantees the theoretical high and low limits of the eigenvalues associated with a random variable covariance matrix. This matrix calculated in this way becomes the null hypothesis that allows one to find the eigenvalues (and their eigenvectors) that deviate from the theoretical random range. The components thus excluded become the reduced dimensional space (see examples in fMRI ).

Optimal control

In optimal control theory, the evolution of n state variables through time depends at any time on their own values and on the values of k control variables. With linear evolution, matrices of coefficients appear in the state equation (equation of evolution). In some problems the values of the parameters in these matrices are not known with certainty, in which case there are random matrices in the state equation and the problem is known as one of stochastic control. A key result in the case of linear-quadratic control with stochastic matrices is that the certainty equivalence principle does not apply: while in the absence of multiplier uncertainty (that is, with only additive uncertainty) the optimal policy with a quadratic loss function coincides with what would be decided if the uncertainty were ignored, the optimal policy may differ if the state equation contains random coefficients.

Computational mechanics

In computational mechanics, epistemic uncertainties underlying the lack of knowledge about the physics of the modeled system give rise to mathematical operators associated with the computational model, which are deficient in a certain sense. Such operators lack certain properties linked to unmodeled physics. When such operators are discretized to perform computational simulations, their accuracy is limited by the missing physics. To compensate for this deficiency of the mathematical operator, it is not enough to make the model parameters random, it is necessary to consider a mathematical operator that is random and can thus generate families of computational models in the hope that one of these captures the missing physics. Random matrices have been used in this sense, with applications in vibroacoustics, wave propagations, materials science, fluid mechanics, heat transfer, etc.

Engineering

Random matrix theory can be applied to the electrical and communications engineering research efforts to study, model and develop Massive Multiple-Input Multiple-Output (MIMO) radio systems.

History

This section needs expansion. You can help by adding to it. (April 2024)

Random matrix theory first gained attention beyond mathematics literature in the context of nuclear physics. Experiments by Enrico Fermi and others demonstrated evidence that individual nucleons cannot be approximated to move independently, leading Niels Bohr to formulate the idea of a compound nucleus. Because there was no knowledge of direct nucleon-nucleon interactions, Eugene Wigner and Leonard Eisenbud approximated that the nuclear Hamiltonian could be modeled as a random matrix. For larger atoms, the distribution of the energy eigenvalues of the Hamiltonian could be computed in order to approximate scattering cross sections by invoking the Wishart distribution.

Gaussian ensembles

The most-commonly studied random matrix distributions are the Gaussian ensembles: GOE, GUE and GSE. They are often denoted by their Dyson index, β = 1 for GOE, β = 2 for GUE, and β = 4 for GSE. This index counts the number of real components per matrix element.

Definitions

The Gaussian unitary ensemble GUE ( n ) {\displaystyle {\text{GUE}}(n)} is described by the Gaussian measure with density 1 Z GUE ( n ) e n 2 t r H 2 {\displaystyle {\frac {1}{Z_{{\text{GUE}}(n)}}}e^{-{\frac {n}{2}}\mathrm {tr} H^{2}}} on the space of n × n {\displaystyle n\times n} Hermitian matrices H = ( H i j ) i , j = 1 n {\displaystyle H=(H_{ij})_{i,j=1}^{n}} . Here Z GUE ( n ) = 2 n / 2 ( π n ) 1 2 n 2 {\displaystyle Z_{{\text{GUE}}(n)}=2^{n/2}\left({\frac {\pi }{n}}\right)^{{\frac {1}{2}}n^{2}}} is a normalization constant, chosen so that the integral of the density is equal to one. The term unitary refers to the fact that the distribution is invariant under unitary conjugation. The Gaussian unitary ensemble models Hamiltonians lacking time-reversal symmetry.

The Gaussian orthogonal ensemble GOE ( n ) {\displaystyle {\text{GOE}}(n)} is described by the Gaussian measure with density 1 Z GOE ( n ) e n 4 t r H 2 {\displaystyle {\frac {1}{Z_{{\text{GOE}}(n)}}}e^{-{\frac {n}{4}}\mathrm {tr} H^{2}}} on the space of n × n real symmetric matrices H = (Hij)
i,j=1. Its distribution is invariant under orthogonal conjugation, and it models Hamiltonians with time-reversal symmetry. Equivalently, it is generated by H = ( G + G T ) / 2 n {\displaystyle H=(G+G^{T})/{\sqrt {2n}}} , where G {\displaystyle G} is an n × n {\displaystyle n\times n} matrix with IID samples from the standard normal distribution.

The Gaussian symplectic ensemble GSE ( n ) {\displaystyle {\text{GSE}}(n)} is described by the Gaussian measure with density 1 Z GSE ( n ) e n t r H 2 {\displaystyle {\frac {1}{Z_{{\text{GSE}}(n)}}}e^{-n\mathrm {tr} H^{2}}} on the space of n × n Hermitian quaternionic matrices, e.g. symmetric square matrices composed of quaternions, H = (Hij)
i,j=1. Its distribution is invariant under conjugation by the symplectic group, and it models Hamiltonians with time-reversal symmetry but no rotational symmetry.

Point correlation functions

The ensembles as defined here have Gaussian distributed matrix elements with mean ⟨Hij⟩ = 0, and two-point correlations given by H i j H m n = H i j H n m = 1 n δ i m δ j n + 2 β n β δ i n δ j m , {\displaystyle \langle H_{ij}H_{mn}^{*}\rangle =\langle H_{ij}H_{nm}\rangle ={\frac {1}{n}}\delta _{im}\delta _{jn}+{\frac {2-\beta }{n\beta }}\delta _{in}\delta _{jm},} from which all higher correlations follow by Isserlis' theorem.

Moment generating functions

The moment generating function for the GOE is E [ e t r ( V H ) ] = e 1 4 N V + V T F 2 {\displaystyle E=e^{{\frac {1}{4N}}\|V+V^{T}\|_{F}^{2}}} where F {\displaystyle \|\cdot \|_{F}} is the Frobenius norm.

Spectral density

Spectral density of GOE/GUE/GSE, as N = 2 0 , 2 1 , . . . , 2 5 {\displaystyle N=2^{0},2^{1},...,2^{5}} . They are normalized so that the distributions converge to the semicircle distribution. The number of "humps" is equal to N.

The joint probability density for the eigenvalues λ1, λ2, ..., λn of GUE/GOE/GSE is given by

1 Z β , n k = 1 n e β 4 λ k 2 i < j | λ j λ i | β   , {\displaystyle {\frac {1}{Z_{\beta ,n}}}\prod _{k=1}^{n}e^{-{\frac {\beta }{4}}\lambda _{k}^{2}}\prod _{i<j}\left|\lambda _{j}-\lambda _{i}\right|^{\beta }~,} 1

where Zβ,n is a normalization constant which can be explicitly computed, see Selberg integral. In the case of GUE (β = 2), the formula (1) describes a determinantal point process. Eigenvalues repel as the joint probability density has a zero (of β {\displaystyle \beta } th order) for coinciding eigenvalues λ j = λ i {\displaystyle \lambda _{j}=\lambda _{i}} .

The distribution of the largest eigenvalue for GOE, and GUE, are explicitly solvable. They converge to the Tracy–Widom distribution after shifting and scaling appropriately.

Convergence to Wigner semicircular distribution

The spectrum, divided by N σ 2 {\displaystyle {\sqrt {N\sigma ^{2}}}} , converges in distribution to the semicircular distribution on the interval [ 2 , + 2 ] {\displaystyle } : ρ ( x ) = 1 2 π 4 x 2 {\displaystyle \rho (x)={\frac {1}{2\pi }}{\sqrt {4-x^{2}}}} . Here σ 2 {\displaystyle \sigma ^{2}} is the variance of off-diagonal entries. The variance of the on-diagonal entries do not matter.

Distribution of level spacings

From the ordered sequence of eigenvalues λ 1 < < λ n < λ n + 1 < {\displaystyle \lambda _{1}<\ldots <\lambda _{n}<\lambda _{n+1}<\ldots } , one defines the normalized spacings s = ( λ n + 1 λ n ) / s {\displaystyle s=(\lambda _{n+1}-\lambda _{n})/\langle s\rangle } , where s = λ n + 1 λ n {\displaystyle \langle s\rangle =\langle \lambda _{n+1}-\lambda _{n}\rangle } is the mean spacing. The probability distribution of spacings is approximately given by, p 1 ( s ) = π 2 s e π 4 s 2 {\displaystyle p_{1}(s)={\frac {\pi }{2}}s\,e^{-{\frac {\pi }{4}}s^{2}}} for the orthogonal ensemble GOE β = 1 {\displaystyle \beta =1} , p 2 ( s ) = 32 π 2 s 2 e 4 π s 2 {\displaystyle p_{2}(s)={\frac {32}{\pi ^{2}}}s^{2}\mathrm {e} ^{-{\frac {4}{\pi }}s^{2}}} for the unitary ensemble GUE β = 2 {\displaystyle \beta =2} , and p 4 ( s ) = 2 18 3 6 π 3 s 4 e 64 9 π s 2 {\displaystyle p_{4}(s)={\frac {2^{18}}{3^{6}\pi ^{3}}}s^{4}e^{-{\frac {64}{9\pi }}s^{2}}} for the symplectic ensemble GSE β = 4 {\displaystyle \beta =4} .

The numerical constants are such that p β ( s ) {\displaystyle p_{\beta }(s)} is normalized: 0 d s p β ( s ) = 1 {\displaystyle \int _{0}^{\infty }ds\,p_{\beta }(s)=1} and the mean spacing is, 0 d s s p β ( s ) = 1 , {\displaystyle \int _{0}^{\infty }ds\,s\,p_{\beta }(s)=1,} for β = 1 , 2 , 4 {\displaystyle \beta =1,2,4} .

Generalizations

Wigner matrices are random Hermitian matrices H n = ( H n ( i , j ) ) i , j = 1 n {\textstyle H_{n}=(H_{n}(i,j))_{i,j=1}^{n}} such that the entries { H n ( i , j )   , 1 i j n } {\displaystyle \left\{H_{n}(i,j)~,\,1\leq i\leq j\leq n\right\}} above the main diagonal are independent random variables with zero mean and have identical second moments.

Invariant matrix ensembles are random Hermitian matrices with density on the space of real symmetric/Hermitian/quaternionic Hermitian matrices, which is of the form 1 Z n e n V ( t r ( H ) )   , {\textstyle {\frac {1}{Z_{n}}}e^{-nV(\mathrm {tr} (H))}~,} where the function V is called the potential.

The Gaussian ensembles are the only common special cases of these two classes of random matrices. This is a consequence of a theorem by Porter and Rosenzweig.

Spectral theory of random matrices

The spectral theory of random matrices studies the distribution of the eigenvalues as the size of the matrix goes to infinity.

Empirical spectral measure

The empirical spectral measure μH of H is defined by μ H ( A ) = 1 n # { eigenvalues of  H  in  A } = N 1 A , H , A R . {\displaystyle \mu _{H}(A)={\frac {1}{n}}\,\#\left\{{\text{eigenvalues of }}H{\text{ in }}A\right\}=N_{1_{A},H},\quad A\subset \mathbb {R} .}

Usually, the limit of μ H {\displaystyle \mu _{H}} is a deterministic measure; this is a particular case of self-averaging. The cumulative distribution function of the limiting measure is called the integrated density of states and is denoted N(λ). If the integrated density of states is differentiable, its derivative is called the density of states and is denoted ρ(λ).

Alternative expressions

μ H ( A ) = 1 n i δ λ i {\displaystyle \mu _{H}(A)={\frac {1}{n}}\sum _{i}\delta _{\lambda _{i}}}

Types of convergence

Given a matrix ensemble, we say that its spectral measures converge weakly to ρ {\displaystyle \rho } iff for any measurable set A {\displaystyle A} , the ensemble-average converges: lim n E H [ μ H ( A ) ] = ρ ( A ) {\displaystyle \lim _{n\to \infty }\mathbb {E} _{H}=\rho (A)} Convergence weakly almost surely: If we sample H 1 , H 2 , H 3 , {\displaystyle H_{1},H_{2},H_{3},\dots } independently from the ensemble, then with probability 1, lim n μ H n ( A ) = ρ ( A ) {\displaystyle \lim _{n\to \infty }\mu _{H_{n}}(A)=\rho (A)} for any measurable set A {\displaystyle A} .

In another sense, weak almost sure convergence means that we sample H 1 , H 2 , H 3 , {\displaystyle H_{1},H_{2},H_{3},\dots } , not independently, but by "growing" (a stochastic process), then with probability 1, lim n μ H n ( A ) = ρ ( A ) {\displaystyle \lim _{n\to \infty }\mu _{H_{n}}(A)=\rho (A)} for any measurable set A {\displaystyle A} .

For example, we can "grow" a sequence of matrices from the Gaussian ensemble as follows:

  • Sample an infinite doubly infinite sequence of standard random variables { G i , j } i , j = 1 , 2 , 3 , {\displaystyle \{G_{i,j}\}_{i,j=1,2,3,\dots }} .
  • Define each H n = ( G n + G n T ) / 2 n {\displaystyle H_{n}=(G_{n}+G_{n}^{T})/{\sqrt {2n}}} where G n {\displaystyle G_{n}} is the matrix made of entries { G i , j } i , j = 1 , 2 , , n {\displaystyle \{G_{i,j}\}_{i,j=1,2,\dots ,n}} .

Note that generic matrix ensembles do not allow us to grow, but most of the common ones, such as the three Gaussian ensembles, do allow us to grow.

Global regime

In the global regime, one is interested in the distribution of linear statistics of the form N f , H = n 1 tr f ( H ) {\displaystyle N_{f,H}=n^{-1}{\text{tr}}f(H)} .

The limit of the empirical spectral measure for Wigner matrices was described by Eugene Wigner; see Wigner semicircle distribution and Wigner surmise. As far as sample covariance matrices are concerned, a theory was developed by Marčenko and Pastur.

The limit of the empirical spectral measure of invariant matrix ensembles is described by a certain integral equation which arises from potential theory.

Fluctuations

For the linear statistics Nf,H = n Σ f(λj), one is also interested in the fluctuations about ∫ f(λdN(λ). For many classes of random matrices, a central limit theorem of the form N f , H f ( λ ) d N ( λ ) σ f , n D N ( 0 , 1 ) {\displaystyle {\frac {N_{f,H}-\int f(\lambda )\,dN(\lambda )}{\sigma _{f,n}}}{\overset {D}{\longrightarrow }}N(0,1)} is known.

The variational problem for the unitary ensembles

Consider the measure

d μ N ( μ ) = 1 Z ~ N e H N ( λ ) d λ , H N ( λ ) = j k ln | λ j λ k | + N j = 1 N Q ( λ j ) , {\displaystyle \mathrm {d} \mu _{N}(\mu )={\frac {1}{{\widetilde {Z}}_{N}}}e^{-H_{N}(\lambda )}\mathrm {d} \lambda ,\qquad H_{N}(\lambda )=-\sum \limits _{j\neq k}\ln |\lambda _{j}-\lambda _{k}|+N\sum \limits _{j=1}^{N}Q(\lambda _{j}),}

where Q ( M ) {\displaystyle Q(M)} is the potential of the ensemble and let ν {\displaystyle \nu } be the empirical spectral measure.

We can rewrite H N ( λ ) {\displaystyle H_{N}(\lambda )} with ν {\displaystyle \nu } as

H N ( λ ) = N 2 [ x y ln | x y | d ν ( x ) d ν ( y ) + Q ( x ) d ν ( x ) ] , {\displaystyle H_{N}(\lambda )=N^{2}\left,}

the probability measure is now of the form

d μ N ( μ ) = 1 Z ~ N e N 2 I Q ( ν ) d λ , {\displaystyle \mathrm {d} \mu _{N}(\mu )={\frac {1}{{\widetilde {Z}}_{N}}}e^{-N^{2}I_{Q}(\nu )}\mathrm {d} \lambda ,}

where I Q ( ν ) {\displaystyle I_{Q}(\nu )} is the above functional inside the squared brackets.

Let now

M 1 ( R ) = { ν : ν 0 ,   R d ν = 1 } {\displaystyle M_{1}(\mathbb {R} )=\left\{\nu :\nu \geq 0,\ \int _{\mathbb {R} }\mathrm {d} \nu =1\right\}}

be the space of one-dimensional probability measures and consider the minimizer

E Q = inf ν M 1 ( R ) x y ln | x y | d ν ( x ) d ν ( y ) + Q ( x ) d ν ( x ) . {\displaystyle E_{Q}=\inf \limits _{\nu \in M_{1}(\mathbb {R} )}-\int \int _{x\neq y}\ln |x-y|\mathrm {d} \nu (x)\mathrm {d} \nu (y)+\int Q(x)\mathrm {d} \nu (x).}

For E Q {\displaystyle E_{Q}} there exists a unique equilibrium measure ν Q {\displaystyle \nu _{Q}} through the Euler-Lagrange variational conditions for some real constant l {\displaystyle l}

2 R log | x y | d ν ( y ) Q ( x ) = l , x J {\displaystyle 2\int _{\mathbb {R} }\log |x-y|\mathrm {d} \nu (y)-Q(x)=l,\quad x\in J}
2 R log | x y | d ν ( y ) Q ( x ) l , x R J {\displaystyle 2\int _{\mathbb {R} }\log |x-y|\mathrm {d} \nu (y)-Q(x)\leq l,\quad x\in \mathbb {R} \setminus J}

where J = j = 1 q [ a j , b j ] {\displaystyle J=\bigcup \limits _{j=1}^{q}} is the support of the measure and define

q ( x ) = ( Q ( x ) 2 ) 2 + Q ( x ) Q ( y ) x y d ν Q ( y ) {\displaystyle q(x)=-\left({\frac {Q'(x)}{2}}\right)^{2}+\int {\frac {Q'(x)-Q'(y)}{x-y}}\mathrm {d} \nu _{Q}(y)} .

The equilibrium measure ν Q {\displaystyle \nu _{Q}} has the following Radon–Nikodym density

d ν Q ( x ) d x = 1 π q ( x ) . {\displaystyle {\frac {\mathrm {d} \nu _{Q}(x)}{\mathrm {d} x}}={\frac {1}{\pi }}{\sqrt {q(x)}}.}

Mesoscopic regime

The typical statement of the Wigner semicircular law is equivalent to the following statement: For each fixed interval [ λ 0 Δ λ , λ 0 + Δ λ ] {\displaystyle } centered at a point λ 0 {\displaystyle \lambda _{0}} , as N {\displaystyle N} , the number of dimensions of the gaussian ensemble increases, the proportion of the eigenvalues falling within the interval converges to [ λ 0 Δ λ , λ 0 + Δ λ ] ρ ( t ) d t {\displaystyle \int _{}\rho (t)dt} , where ρ ( t ) {\displaystyle \rho (t)} is the density of the semicircular distribution.

If Δ λ {\displaystyle \Delta \lambda } can be allowed to decrease as N {\displaystyle N} increases, then we obtain strictly stronger theorems, named "local laws" or "mesoscopic regime".

The mesoscopic regime is intermediate between the local and the global. In the mesoscopic regime, one is interested in the limit distribution of eigenvalues in a set that shrinks to zero, but slow enough, such that the number of eigenvalues inside {\displaystyle \to \infty } .

For example, the Ginibre ensemble has a mesoscopic law: For any sequence of shrinking disks with areas u {\displaystyle u} inside the unite disk, if the disks have area A n = O ( n 1 + ϵ ) {\displaystyle A_{n}=O(n^{-1+\epsilon })} , the conditional distribution of the spectrum inside the disks also converges to a uniform distribution. That is, if we cut the shrinking disks along with the spectrum falling inside the disks, then scale the disks up to unit area, we would see the spectra converging to a flat distribution in the disks.

Local regime

In the local regime, one is interested in the limit distribution of eigenvalues in a set that shrinks so fast that the number of eigenvalues remains O ( 1 ) {\displaystyle O(1)} .

Typically this means the study of spacings between eigenvalues, and, more generally, in the joint distribution of eigenvalues in an interval of length of order 1/n. One distinguishes between bulk statistics, pertaining to intervals inside the support of the limiting spectral measure, and edge statistics, pertaining to intervals near the boundary of the support.

Bulk statistics

Formally, fix λ 0 {\displaystyle \lambda _{0}} in the interior of the support of N ( λ ) {\displaystyle N(\lambda )} . Then consider the point process Ξ ( λ 0 ) = j δ ( n ρ ( λ 0 ) ( λ j λ 0 ) )   , {\displaystyle \Xi (\lambda _{0})=\sum _{j}\delta {\Big (}{\cdot }-n\rho (\lambda _{0})(\lambda _{j}-\lambda _{0}){\Big )}~,} where λ j {\displaystyle \lambda _{j}} are the eigenvalues of the random matrix.

The point process Ξ ( λ 0 ) {\displaystyle \Xi (\lambda _{0})} captures the statistical properties of eigenvalues in the vicinity of λ 0 {\displaystyle \lambda _{0}} . For the Gaussian ensembles, the limit of Ξ ( λ 0 ) {\displaystyle \Xi (\lambda _{0})} is known; thus, for GUE it is a determinantal point process with the kernel K ( x , y ) = sin π ( x y ) π ( x y ) {\displaystyle K(x,y)={\frac {\sin \pi (x-y)}{\pi (x-y)}}} (the sine kernel).

The universality principle postulates that the limit of Ξ ( λ 0 ) {\displaystyle \Xi (\lambda _{0})} as n {\displaystyle n\to \infty } should depend only on the symmetry class of the random matrix (and neither on the specific model of random matrices nor on λ 0 {\displaystyle \lambda _{0}} ). Rigorous proofs of universality are known for invariant matrix ensembles and Wigner matrices.

Edge statistics

Main article: Tracy–Widom distribution

One example of edge statistics is the Tracy–Widom distribution.

As another example, consider the Ginibre ensemble. It can be real or complex. The real Ginibre ensemble has i.i.d. standard Gaussian entries N ( 0 , 1 ) {\displaystyle {\mathcal {N}}(0,1)} , and the complex Ginibre ensemble has i.i.d. standard complex Gaussian entries N ( 0 , 1 / 2 ) + i N ( 0 , 1 / 2 ) {\displaystyle {\mathcal {N}}(0,1/2)+i{\mathcal {N}}(0,1/2)} .

Now let G n {\displaystyle G_{n}} be sampled from the real or complex ensemble, and let ρ ( G n ) {\displaystyle \rho (G_{n})} be the absolute value of its maximal eigenvalue: ρ ( G n ) := max j | λ j | {\displaystyle \rho (G_{n}):=\max _{j}|\lambda _{j}|} We have the following theorem for the edge statistics:

Edge statistics of the Ginibre ensemble — For G n {\displaystyle G_{n}} and ρ ( G n ) {\displaystyle \rho \left(G_{n}\right)} as above, with probability one, lim n 1 n ρ ( G n ) = 1 {\displaystyle \lim _{n\rightarrow \infty }{\frac {1}{\sqrt {n}}}\rho \left(G_{n}\right)=1}

Moreover, if γ n = log ( n 2 π ) 2 log ( log ( n ) ) {\displaystyle \gamma _{n}=\log \left({\frac {n}{2\pi }}\right)-2\log(\log(n))} and Y n := 4 n γ n ( 1 n ρ ( G n ) 1 γ n 4 n ) , {\displaystyle Y_{n}:={\sqrt {4n\gamma _{n}}}\left({\frac {1}{\sqrt {n}}}\rho \left(G_{n}\right)-1-{\sqrt {\frac {\gamma _{n}}{4n}}}\right),} then Y n {\displaystyle Y_{n}} converges in distribution to the Gumbel law, i.e., the probability measure on R {\displaystyle \mathbb {R} } with cumulative distribution function F G u m ( x ) = e e x {\displaystyle F_{\mathrm {Gum} }(x)=e^{-e^{-x}}} .

This theorem refines the circular law of the Ginibre ensemble. In words, the circular law says that the spectrum of 1 n G n {\displaystyle {\frac {1}{\sqrt {n}}}G_{n}} almost surely falls uniformly on the unit disc. and the edge statistics theorem states that the radius of the almost-unit-disk is about 1 γ n 4 n {\displaystyle 1-{\sqrt {\frac {\gamma _{n}}{4n}}}} , and fluctuates on a scale of 1 4 n γ n {\displaystyle {\frac {1}{\sqrt {4n\gamma _{n}}}}} , according to the Gumbel law.

Correlation functions

The joint probability density of the eigenvalues of n × n {\displaystyle n\times n} random Hermitian matrices M H n × n {\displaystyle M\in \mathbf {H} ^{n\times n}} , with partition functions of the form Z n = M H n × n d μ 0 ( M ) e tr ( V ( M ) ) {\displaystyle Z_{n}=\int _{M\in \mathbf {H} ^{n\times n}}d\mu _{0}(M)e^{{\text{tr}}(V(M))}} where V ( x ) := j = 1 v j x j {\displaystyle V(x):=\sum _{j=1}^{\infty }v_{j}x^{j}} and d μ 0 ( M ) {\displaystyle d\mu _{0}(M)} is the standard Lebesgue measure on the space H n × n {\displaystyle \mathbf {H} ^{n\times n}} of Hermitian n × n {\displaystyle n\times n} matrices, is given by p n , V ( x 1 , , x n ) = 1 Z n , V i < j ( x i x j ) 2 e i V ( x i ) . {\displaystyle p_{n,V}(x_{1},\dots ,x_{n})={\frac {1}{Z_{n,V}}}\prod _{i<j}(x_{i}-x_{j})^{2}e^{-\sum _{i}V(x_{i})}.} The k {\displaystyle k} -point correlation functions (or marginal distributions) are defined as R n , V ( k ) ( x 1 , , x k ) = n ! ( n k ) ! R d x k + 1 R d x n p n , V ( x 1 , x 2 , , x n ) , {\displaystyle R_{n,V}^{(k)}(x_{1},\dots ,x_{k})={\frac {n!}{(n-k)!}}\int _{\mathbf {R} }dx_{k+1}\cdots \int _{\mathbb {R} }dx_{n}\,p_{n,V}(x_{1},x_{2},\dots ,x_{n}),} which are skew symmetric functions of their variables. In particular, the one-point correlation function, or density of states, is R n , V ( 1 ) ( x 1 ) = n R d x 2 R d x n p n , V ( x 1 , x 2 , , x n ) . {\displaystyle R_{n,V}^{(1)}(x_{1})=n\int _{\mathbb {R} }dx_{2}\cdots \int _{\mathbf {R} }dx_{n}\,p_{n,V}(x_{1},x_{2},\dots ,x_{n}).} Its integral over a Borel set B R {\displaystyle B\subset \mathbf {R} } gives the expected number of eigenvalues contained in B {\displaystyle B} : B R n , V ( 1 ) ( x ) d x = E ( # { eigenvalues in  B } ) . {\displaystyle \int _{B}R_{n,V}^{(1)}(x)dx=\mathbf {E} \left(\#\{{\text{eigenvalues in }}B\}\right).}

The following result expresses these correlation functions as determinants of the matrices formed from evaluating the appropriate integral kernel at the pairs ( x i , x j ) {\displaystyle (x_{i},x_{j})} of points appearing within the correlator.

Theorem For any k {\displaystyle k} , 1 k n {\displaystyle 1\leq k\leq n} the k {\displaystyle k} -point correlation function R n , V ( k ) {\displaystyle R_{n,V}^{(k)}} can be written as a determinant R n , V ( k ) ( x 1 , x 2 , , x k ) = det 1 i , j k ( K n , V ( x i , x j ) ) , {\displaystyle R_{n,V}^{(k)}(x_{1},x_{2},\dots ,x_{k})=\det _{1\leq i,j\leq k}\left(K_{n,V}(x_{i},x_{j})\right),} where K n , V ( x , y ) {\displaystyle K_{n,V}(x,y)} is the n {\displaystyle n} th Christoffel-Darboux kernel K n , V ( x , y ) := k = 0 n 1 ψ k ( x ) ψ k ( y ) , {\displaystyle K_{n,V}(x,y):=\sum _{k=0}^{n-1}\psi _{k}(x)\psi _{k}(y),} associated to V {\displaystyle V} , written in terms of the quasipolynomials ψ k ( x ) = 1 h k p k ( z ) e V ( z ) / 2 , {\displaystyle \psi _{k}(x)={1 \over {\sqrt {h_{k}}}}\,p_{k}(z)\,e^{-V(z)/2},} where { p k ( x ) } k N {\displaystyle \{p_{k}(x)\}_{k\in \mathbf {N} }} is a complete sequence of monic polynomials, of the degrees indicated, satisfying the orthogonilty conditions R ψ j ( x ) ψ k ( x ) d x = δ j k . {\displaystyle \int _{\mathbf {R} }\psi _{j}(x)\psi _{k}(x)dx=\delta _{jk}.}

Other classes of random matrices

Wishart matrices

Main article: Wishart distribution

Wishart matrices are n × n random matrices of the form H = X X, where X is an n × m random matrix (m ≥ n) with independent entries, and X is its conjugate transpose. In the important special case considered by Wishart, the entries of X are identically distributed Gaussian random variables (either real or complex).

The limit of the empirical spectral measure of Wishart matrices was found by Vladimir Marchenko and Leonid Pastur.

Random unitary matrices

Main article: Circular ensembles

Non-Hermitian random matrices

Main article: Circular law

Selected bibliography

Books

  • Mehta, M.L. (2004). Random Matrices. Amsterdam: Elsevier/Academic Press. ISBN 0-12-088409-7.
  • Anderson, G.W.; Guionnet, A.; Zeitouni, O. (2010). An introduction to random matrices. Cambridge: Cambridge University Press. ISBN 978-0-521-19452-5.
  • Akemann, G.; Baik, J.; Di Francesco, P. (2011). The Oxford Handbook of Random Matrix Theory. Oxford: Oxford University Press. ISBN 978-0-19-957400-1.
  • Potters, Marc; Bouchaud, Jean-Philippe (2020-11-30). A First Course in Random Matrix Theory: for Physicists, Engineers and Data Scientists. Cambridge University Press. doi:10.1017/9781108768900. ISBN 978-1-108-76890-0.

Survey articles

Historic works

References

  1. ^ Wigner, Eugene P. (1955). "Characteristic Vectors of Bordered Matrices With Infinite Dimensions". Annals of Mathematics. 62 (3): 548–564. doi:10.2307/1970079. ISSN 0003-486X. JSTOR 1970079.
  2. ^ Block, R. C.; Good, W. M.; Harvey, J. A.; Schmitt, H. W.; Trammell, G. T., eds. (1957-07-01). Conference on Neutron Physics by Time-Of-Flight Held at Gatlinburg, Tennessee, November 1 and 2, 1956 (Report ORNL-2309). Oak Ridge, Tennessee: Oak Ridge National Lab. doi:10.2172/4319287. OSTI 4319287.
  3. ^ Bohigas, O.; Giannoni, M.J.; Schmit, Schmit (1984). "Characterization of Chaotic Quantum Spectra and Universality of Level Fluctuation Laws". Phys. Rev. Lett. 52 (1): 1–4. Bibcode:1984PhRvL..52....1B. doi:10.1103/PhysRevLett.52.1.
  4. ^ Mehta 2004
  5. Aaronson, Scott; Arkhipov, Alex (2013). "The computational complexity of linear optics". Theory of Computing. 9: 143–252. doi:10.4086/toc.2013.v009a004.
  6. Russell, Nicholas; Chakhmakhchyan, Levon; O'Brien, Jeremy; Laing, Anthony (2017). "Direct dialling of Haar random unitary matrices". New J. Phys. 19 (3): 033007. arXiv:1506.06220. Bibcode:2017NJPh...19c3007R. doi:10.1088/1367-2630/aa60ed. S2CID 46915633.
  7. Verbaarschot JJ, Wettig T (2000). "Random Matrix Theory and Chiral Symmetry in QCD". Annu. Rev. Nucl. Part. Sci. 50: 343–410. arXiv:hep-ph/0003017. Bibcode:2000ARNPS..50..343V. doi:10.1146/annurev.nucl.50.1.343. S2CID 119470008.
  8. Franchini F, Kravtsov VE (October 2009). "Horizon in random matrix theory, the Hawking radiation, and flow of cold atoms". Phys. Rev. Lett. 103 (16): 166401. arXiv:0905.3533. Bibcode:2009PhRvL.103p6401F. doi:10.1103/PhysRevLett.103.166401. PMID 19905710. S2CID 11122957.
  9. Sánchez D, Büttiker M (September 2004). "Magnetic-field asymmetry of nonlinear mesoscopic transport". Phys. Rev. Lett. 93 (10): 106802. arXiv:cond-mat/0404387. Bibcode:2004PhRvL..93j6802S. doi:10.1103/PhysRevLett.93.106802. PMID 15447435. S2CID 11686506.
  10. Rychkov VS, Borlenghi S, Jaffres H, Fert A, Waintal X (August 2009). "Spin torque and waviness in magnetic multilayers: a bridge between Valet-Fert theory and quantum approaches". Phys. Rev. Lett. 103 (6): 066602. arXiv:0902.4360. Bibcode:2009PhRvL.103f6602R. doi:10.1103/PhysRevLett.103.066602. PMID 19792592. S2CID 209013.
  11. Callaway DJE (April 1991). "Random matrices, fractional statistics, and the quantum Hall effect". Phys. Rev. B. 43 (10): 8641–8643. Bibcode:1991PhRvB..43.8641C. doi:10.1103/PhysRevB.43.8641. PMID 9996505.
  12. Janssen M, Pracz K (June 2000). "Correlated random band matrices: localization-delocalization transitions". Phys. Rev. E. 61 (6 Pt A): 6278–86. arXiv:cond-mat/9911467. Bibcode:2000PhRvE..61.6278J. doi:10.1103/PhysRevE.61.6278. PMID 11088301. S2CID 34140447.
  13. Zumbühl DM, Miller JB, Marcus CM, Campman K, Gossard AC (December 2002). "Spin-orbit coupling, antilocalization, and parallel magnetic fields in quantum dots". Phys. Rev. Lett. 89 (27): 276803. arXiv:cond-mat/0208436. Bibcode:2002PhRvL..89A6803Z. doi:10.1103/PhysRevLett.89.276803. PMID 12513231. S2CID 9344722.
  14. Bahcall SR (December 1996). "Random Matrix Model for Superconductors in a Magnetic Field". Phys. Rev. Lett. 77 (26): 5276–5279. arXiv:cond-mat/9611136. Bibcode:1996PhRvL..77.5276B. doi:10.1103/PhysRevLett.77.5276. PMID 10062760. S2CID 206326136.
  15. Wishart 1928
  16. Tropp, J. (2011). "User-Friendly Tail Bounds for Sums of Random Matrices". Foundations of Computational Mathematics. 12 (4): 389–434. arXiv:1004.4389. doi:10.1007/s10208-011-9099-z. S2CID 17735965.
  17. Pennington, Jeffrey; Bahri, Yasaman (2017). "Geometry of Neural Network Loss Surfaces via Random Matrix Theory". ICML'17: Proceedings of the 34th International Conference on Machine Learning. 70. S2CID 39515197.
  18. Yang, Greg (2022). "Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer". arXiv:2203.03466v2 .
  19. von Neumann & Goldstine 1947
  20. Edelman & Rao 2005
  21. Keating, Jon (1993). "The Riemann zeta-function and quantum chaology". Proc. Internat. School of Phys. Enrico Fermi. CXIX: 145–185. doi:10.1016/b978-0-444-81588-0.50008-0. ISBN 9780444815880.
  22. Mingo, James A.; Speicher, Roland (2017): Free Probability and Random Matrices. Fields Institute Monographs, Vol. 35, Springer, New York
  23. Voiculescu, Dan (1991): "Limit laws for random matrices and free products". Inventiones mathematicae 104.1: 201-220
  24. Sompolinsky, H.; Crisanti, A.; Sommers, H. (July 1988). "Chaos in Random Neural Networks". Physical Review Letters. 61 (3): 259–262. Bibcode:1988PhRvL..61..259S. doi:10.1103/PhysRevLett.61.259. PMID 10039285. S2CID 16967637.
  25. Rajan, Kanaka; Abbott, L. (November 2006). "Eigenvalue Spectra of Random Matrices for Neural Networks". Physical Review Letters. 97 (18): 188104. Bibcode:2006PhRvL..97r8104R. doi:10.1103/PhysRevLett.97.188104. PMID 17155583.
  26. Wainrib, Gilles; Touboul, Jonathan (March 2013). "Topological and Dynamical Complexity of Random Neural Networks". Physical Review Letters. 110 (11): 118101. arXiv:1210.5082. Bibcode:2013PhRvL.110k8101W. doi:10.1103/PhysRevLett.110.118101. PMID 25166580. S2CID 1188555.
  27. Timme, Marc; Wolf, Fred; Geisel, Theo (February 2004). "Topological Speed Limits to Network Synchronization". Physical Review Letters. 92 (7): 074101. arXiv:cond-mat/0306512. Bibcode:2004PhRvL..92g4101T. doi:10.1103/PhysRevLett.92.074101. PMID 14995853. S2CID 5765956.
  28. Muir, Dylan; Mrsic-Flogel, Thomas (2015). "Eigenspectrum bounds for semirandom matrices with modular and spatial structure for neural networks" (PDF). Phys. Rev. E. 91 (4): 042808. Bibcode:2015PhRvE..91d2808M. doi:10.1103/PhysRevE.91.042808. PMID 25974548.
  29. Vergani, Alberto A.; Martinelli, Samuele; Binaghi, Elisabetta (July 2019). "Resting state fMRI analysis using unsupervised learning algorithms". Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization. 8 (3). Taylor&Francis: 2168–1171. doi:10.1080/21681163.2019.1636413.
  30. Burda, Z; Kornelsen, J; Nowak, MA; Porebski, B; Sboto-Frankenstein, U; Tomanek, B; Tyburczyk, J (2013). "Collective Correlations of Brodmann Areas fMRI Study with RMT-Denoising". Acta Physica Polonica B. 44 (6): 1243. arXiv:1306.3825. Bibcode:2013AcPPB..44.1243B. doi:10.5506/APhysPolB.44.1243.
  31. Chow, Gregory P. (1976). Analysis and Control of Dynamic Economic Systems. New York: Wiley. ISBN 0-471-15616-7.
  32. Turnovsky, Stephen (1974). "The stability properties of optimal economic policies". American Economic Review. 64 (1): 136–148. JSTOR 1814888.
  33. Soize, C. (2005-04-08). "Random matrix theory for modeling uncertainties in computational mechanics" (PDF). Computer Methods in Applied Mechanics and Engineering. 194 (12–16): 1333–1366. Bibcode:2005CMAME.194.1333S. doi:10.1016/j.cma.2004.06.038. ISSN 1879-2138. S2CID 58929758.
  34. Bohigas, Oriol; Weidenmuller, Hans (2015). Akemann, Gernot; Baik, Jinho; Di Francesco, Philippe (eds.). "History – an overview". academic.oup.com. pp. 15–40. doi:10.1093/oxfordhb/9780198744191.013.2. ISBN 978-0-19-874419-1. Retrieved 2024-04-22.
  35. Chiani M (2014). "Distribution of the largest eigenvalue for real Wishart and Gaussian random matrices and a simple approximation for the Tracy-Widom distribution". Journal of Multivariate Analysis. 129: 69–81. arXiv:1209.3394. doi:10.1016/j.jmva.2014.04.002. S2CID 15889291.
  36. Porter, C. E.; Rosenzweig, N. (1960-01-01). "STATISTICAL PROPERTIES OF ATOMIC AND NUCLEAR SPECTRA". Ann. Acad. Sci. Fennicae. Ser. A VI. 44. OSTI 4147616.
  37. Livan, Giacomo; Novaes, Marcel; Vivo, Pierpaolo (2018), Livan, Giacomo; Novaes, Marcel; Vivo, Pierpaolo (eds.), "Classified Material", Introduction to Random Matrices: Theory and Practice, SpringerBriefs in Mathematical Physics, vol. 26, Cham: Springer International Publishing, pp. 15–21, doi:10.1007/978-3-319-70885-0_3, ISBN 978-3-319-70885-0, retrieved 2023-05-17
  38. Meckes, Elizabeth (2021-01-08). "The Eigenvalues of Random Matrices". arXiv:2101.02928 .
  39. ^ .Marčenko, V A; Pastur, L A (1967). "Distribution of eigenvalues for some sets of random matrices". Mathematics of the USSR-Sbornik. 1 (4): 457–483. Bibcode:1967SbMat...1..457M. doi:10.1070/SM1967v001n04ABEH001994.
  40. Pastur 1973
  41. Pastur, L.; Shcherbina, M. (1995). "On the Statistical Mechanics Approach in the Random Matrix Theory: Integrated Density of States". J. Stat. Phys. 79 (3–4): 585–611. Bibcode:1995JSP....79..585D. doi:10.1007/BF02184872. S2CID 120731790.
  42. Johansson, K. (1998). "On fluctuations of eigenvalues of random Hermitian matrices". Duke Math. J. 91 (1): 151–204. doi:10.1215/S0012-7094-98-09108-6.
  43. Pastur, L.A. (2005). "A simple approach to the global regime of Gaussian ensembles of random matrices". Ukrainian Math. J. 57 (6): 936–966. doi:10.1007/s11253-005-0241-4. S2CID 121531907.
  44. Harnad, John (15 July 2013). Random Matrices, Random Processes and Integrable Systems. Springer. pp. 263–266. ISBN 978-1461428770.
  45. Erdős, László; Schlein, Benjamin; Yau, Horng-Tzer (April 2009). "Local Semicircle Law and Complete Delocalization for Wigner Random Matrices". Communications in Mathematical Physics. 287 (2): 641–655. arXiv:0803.0542. Bibcode:2009CMaPh.287..641E. doi:10.1007/s00220-008-0636-9. ISSN 0010-3616.
  46. ^ Bourgade, Paul; Yau, Horng-Tzer; Yin, Jun (2014-08-01). "Local circular law for random matrices". Probability Theory and Related Fields. 159 (3): 545–595. arXiv:1206.1449. doi:10.1007/s00440-013-0514-z. ISSN 1432-2064.
  47. Pastur, L.; Shcherbina, M. (1997). "Universality of the local eigenvalue statistics for a class of unitary invariant random matrix ensembles". Journal of Statistical Physics. 86 (1–2): 109–147. Bibcode:1997JSP....86..109P. doi:10.1007/BF02180200. S2CID 15117770.
  48. Deift, P.; Kriecherbauer, T.; McLaughlin, K.T.-R.; Venakides, S.; Zhou, X. (1997). "Asymptotics for polynomials orthogonal with respect to varying exponential weights". International Mathematics Research Notices. 1997 (16): 759–782. doi:10.1155/S1073792897000500.
  49. Erdős, L.; Péché, S.; Ramírez, J.A.; Schlein, B.; Yau, H.T. (2010). "Bulk universality for Wigner matrices". Communications on Pure and Applied Mathematics. 63 (7): 895–925. arXiv:0905.4176. doi:10.1002/cpa.20317.
  50. Tao, Terence; Vu, Van H. (2010). "Random matrices: universality of local eigenvalue statistics up to the edge". Communications in Mathematical Physics. 298 (2): 549–572. arXiv:0908.1982. Bibcode:2010CMaPh.298..549T. doi:10.1007/s00220-010-1044-5. S2CID 16594369.
  51. Rider, B (2003-03-28). "A limit theorem at the edge of a non-Hermitian random matrix ensemble". Journal of Physics A: Mathematical and General. 36 (12): 3401–3409. Bibcode:2003JPhA...36.3401R. doi:10.1088/0305-4470/36/12/331. ISSN 0305-4470.

External links

Matrix classes
Explicitly constrained entries
Constant
Conditions on eigenvalues or eigenvectors
Satisfying conditions on products or inverses
With specific applications
Used in statistics
Used in graph theory
Used in science and engineering
Related terms
Categories: