Misplaced Pages

Gram matrix: Difference between revisions

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Browse history interactively← Previous editContent deleted Content addedVisualWikitext
Revision as of 01:10, 1 April 2021 edit200.7.90.139 (talk) Gram determinant: - Gram determinant of n m-dimensional vectors measures the n-dimensional volume of the paralletope spanned by the vectors. Article incorrectly said m-dimensional volume.← Previous edit Latest revision as of 20:38, 8 January 2025 edit undo104.138.208.182 (talk) Add a mention of the role of the Gram determinant as an inner product on the exterior product space, and the interpretation as a Pythagorean Theorem. 
(66 intermediate revisions by 30 users not shown)
Line 1: Line 1:
{{short description|Matrix of inner products of a set of vectors}} {{short description|Matrix of inner products of a set of vectors}}
In ], the '''Gram matrix''' (or '''Gramian matrix''', '''Gramian''') of a set of vectors <math>v_1,\dots, v_n</math> in an ] is the ] of ]s, whose entries are given by <math>G_{ij} = \left\langle v_i, v_j \right\rangle</math>.<ref name="HJ-7.2.10">{{harvnb|Horn|Johnson|2013|p=441}}, p.441, Theorem 7.2.10</ref> If the vectors <math>v_1,\dots, v_n</math> are real and the columns of matrix <math>X</math>, then the Gram matrix is <math>X^\top X</math>. In ], the '''Gram matrix''' (or '''Gramian matrix''', '''Gramian''') of a set of vectors <math>v_1,\dots, v_n</math> in an ] is the ] of ]s, whose entries are given by the ] <math>G_{ij} = \left\langle v_i, v_j \right\rangle</math>.<ref name="HJ-7.2.10">{{harvnb|Horn|Johnson|2013|p=441}}, p.441, Theorem 7.2.10</ref> If the vectors <math>v_1,\dots, v_n</math> are the columns of matrix <math>X</math> then the Gram matrix is <math>X^\dagger X</math> in the general case that the vector coordinates are complex numbers, which simplifies to <math>X^\top X</math> for the case that the vector coordinates are real numbers.


An important application is to compute ]: a set of vectors are linearly independent if and only if the ] (the ] of the Gram matrix) is non-zero. An important application is to compute ]: a set of vectors are linearly independent if and only if the ] (the ] of the Gram matrix) is non-zero.
Line 7: Line 7:


==Examples== ==Examples==
For finite-dimensional real vectors in <math>\mathbb{R}^n</math> with the usual Euclidean ], the Gram matrix is simply <math>G = V^\top V</math>, where <math>V</math> is a matrix whose columns are the vectors <math>v_k</math>. For ] vectors in <math>\mathbb{C}^n</math>, <math>G = V^* V</math>, where <math>V^*</math> is the ] of <math>V</math>. For finite-dimensional real vectors in <math>\mathbb{R}^n</math> with the usual Euclidean ], the Gram matrix is <math>G = V^\top V</math>, where <math>V</math> is a matrix whose columns are the vectors <math>v_k</math> and <math>V^\top</math> is its ] whose rows are the vectors <math>v_k^\top</math>. For ] vectors in <math>\mathbb{C}^n</math>, <math>G = V^\dagger V</math>, where <math>V^\dagger</math> is the ] of <math>V</math>.


Given ] <math>\{\ell_i(\cdot),\, i = 1,\dots,n\}</math> on the interval <math>\left</math>, the Gram matrix <math>G = \left</math> is: Given ]s <math>\{\ell_i(\cdot),\, i = 1,\dots,n\}</math> on the interval <math>\left</math>, the Gram matrix <math>G = \left</math> is:


: <math>G_{ij} = \int_{t_0}^{t_f} \ell_i(\tau)\ell_j^*(\tau)\, d\tau. </math> : <math>G_{ij} = \int_{t_0}^{t_f} \ell_i^*(\tau)\ell_j(\tau)\, d\tau. </math>
where <math>\ell_i^*(\tau)</math> is the ] of <math>\ell_i(\tau)</math>.


For any ] <math>B</math> on a ] ] over any ] we can define a Gram matrix <math>G</math> attached to a set of vectors <math>v_1,\dots, v_n</math> by <math>G_{ij} = B\left(v_i, v_j\right)</math>. The matrix will be symmetric if the bilinear form <math>B</math> is symmetric. For any ] <math>B</math> on a ] ] over any ] we can define a Gram matrix <math>G</math> attached to a set of vectors <math>v_1, \dots, v_n</math> by <math>G_{ij} = B\left(v_i, v_j\right)</math>. The matrix will be symmetric if the bilinear form <math>B</math> is symmetric.


===Applications=== ===Applications===
* In ], given an embedded <math>k</math>-dimensional ] <math>M\subset \mathbb{R}^n</math> and a parametrization <math>\phi: U\to M</math> for {{nowrap|<math>(x_1, \ldots, x_k)\in U\subset\mathbb{R}^k</math>,}} the volume form <math>\omega</math> on <math>M</math> induced by the embedding may be computed using the Gramian of the coordinate tangent vectors: <math display="block">\omega = \sqrt{\det G}\ dx_1 \cdots dx_k,\quad G = \left.</math> This generalizes the classical surface integral of a parametrized surface <math>\phi:U\to S\subset \mathbb{R}^3</math> for <math>(x, y)\in U\subset\mathbb{R}^2</math>: <math display="block">\int_S f\ dA = \iint_U f(\phi(x, y))\, \left|\frac{\partial\phi}{\partial x}\,{\times}\,\frac{\partial\phi}{\partial y}\right|\, dx\, dy.</math>
{{bulleted list
* If the vectors are centered ]s, the Gramian is approximately proportional to the ''']''', with the scaling determined by the number of elements in the vector.
| In ], given an embedded <math>k</math>-dimensional Riemannian manifold <math>M\subset \mathbb{R}^n</math> and a coordinate chart <math>\phi: U\to M</math> for <math>(x_1, \ldots, x_k)\in U\subset\mathbb{R}^k</math>, the volume form <math>\omega</math> on <math>M</math> induced by the embedding may be computed using the Gramian of the coordinate tangent vectors:
* In ], the Gram matrix of a set of ] is the ''']'''.
: <math>\omega = \sqrt{\det G}\ dx_1 \cdots dx_k,\quad G = \left.</math>
* In ] (or more generally ]), the ''']''' and ''']''' determine properties of a linear system.

* Gramian matrices arise in covariance structure model fitting (see e.g., Jamshidian and Bentler, 1993, Applied Psychological Measurement, Volume 18, pp.&nbsp;79–94).
This generalizes the classical surface integral of a parametrized surface <math>\phi:U\to
* In the ], the Gram matrix arises from approximating a function from a finite dimensional space; the Gram matrix entries are then the inner products of the basis functions of the finite dimensional subspace.
S\subset \mathbb{R}^3</math> for <math>(x, y)\in U\subset\mathbb{R}^2</math>:
* In ], ]s are often represented as Gram matrices.<ref>{{cite journal |last1=Lanckriet |first1=G. R. G. |first2=N. |last2=Cristianini |first3=P. |last3=Bartlett |first4=L. E. |last4=Ghaoui |first5=M. I. |last5=Jordan |title=Learning the kernel matrix with semidefinite programming |journal=Journal of Machine Learning Research |volume=5 |year=2004 |pages=27–72 |url=https://dl.acm.org/citation.cfm?id=894170 }}</ref> (Also see ])
: <math>\int_S f\ dA = \iint_U f(\phi(x, y))\, \left|\frac{\partial\phi}{\partial x}\,{\times}\,\frac{\partial\phi}{\partial y}\right|\, dx\, dy.</math>
* Since the Gram matrix over the reals is a ], it is ] and its ] are non-negative. The diagonalization of the Gram matrix is the ].
| If the vectors are centered ]s, the Gramian is approximately proportional to the ''']''', with the scaling determined by the number of elements in the vector.
| In ], the Gram matrix of a set of ] is the ''']'''.
| In ] (or more generally ]), the ''']''' and ''']''' determine properties of a linear system.
| Gramian matrices arise in covariance structure model fitting (see e.g., Jamshidian and Bentler, 1993, Applied Psychological Measurement, Volume 18, pp.&nbsp;79–94).
| In the ], the Gram matrix arises from approximating a function from a finite dimensional space; the Gram matrix entries are then the inner products of the basis functions of the finite dimensional subspace.
| In ], ]s are often represented as Gram matrices.<ref>{{cite journal |last=Lanckriet |first=G. R. G. |first2=N. |last2=Cristianini |first3=P. |last3=Bartlett |first4=L. E. |last4=Ghaoui |first5=M. I. |last5=Jordan |title=Learning the kernel matrix with semidefinite programming |journal=Journal of Machine Learning Research |volume=5 |year=2004 |pages=27–72 |url=https://dl.acm.org/citation.cfm?id=894170 }}</ref>
| Since the Gram matrix over the reals is a ], it is ] and its ] are non-negative. The diagonalization of the Gram matrix is the ].
}}


==Properties== ==Properties==
===Positive-semidefiniteness=== ===Positive-semidefiniteness===
The Gram matrix is ] in the case the real product is real-valued; it is ] in the general, complex case by definition of an ]. The Gram matrix is ] in the case the inner product is real-valued; it is ] in the general, complex case by definition of an ].


The Gram matrix is ], and every positive semidefinite matrix is the Gramian matrix for some set of vectors. The fact that the Gramian matrix is positive-semidefinite can be seen from the following simple derivation: The Gram matrix is ], and every positive semidefinite matrix is the Gramian matrix for some set of vectors. The fact that the Gramian matrix is positive-semidefinite can be seen from the following simple derivation:
: <math> : <math>
x^\textsf{T} \mathbf{G} x = x^\dagger \mathbf{G} x =
\sum_{i,j}x_i x_j\left\langle v_i, v_j \right\rangle = \sum_{i,j}x_i^* x_j\left\langle v_i, v_j \right\rangle =
\sum_{i,j}\left\langle x_i v_i, x_j v_j \right\rangle = \sum_{i,j}\left\langle x_i v_i, x_j v_j \right\rangle =
\left\langle \sum_i x_i v_i, \sum_j x_j v_j \right\rangle = \biggl\langle \sum_i x_i v_i, \sum_j x_j v_j \biggr\rangle =
\left\| \sum_i x_i v_i \right\|^2 \geq 0 . \biggl\| \sum_i x_i v_i \biggr\|^2 \geq 0 .
</math> </math>


The first equality follows from the definition of matrix multiplication, the second and third from the bi-linearity of the ], and the last from the positive definiteness of the inner product. The first equality follows from the definition of matrix multiplication, the second and third from the bi-linearity of the ], and the last from the positive definiteness of the inner product.
Note that this also shows that the Gramian matrix is positive definite if and only if the vectors <math> v_i </math> are linearly independent (that is, <math>\textstyle\sum_i x_i v_i \neq 0</math> for all <math>x</math>).<ref name="HJ-7.2.10"/> Note that this also shows that the Gramian matrix is positive definite if and only if the vectors <math> v_i </math> are linearly independent (that is, <math display="inline">\sum_i x_i v_i \neq 0</math> for all <math>x</math>).<ref name="HJ-7.2.10"/>


===Finding a vector realization=== ===Finding a vector realization===
{{See also|Positive definite matrix#Decomposition}} {{See also|Positive definite matrix#Decomposition}}
Given any positive semidefinite matrix <math>M</math>, one can decompose it as: Given any positive semidefinite matrix <math>M</math>, one can decompose it as:
: <math>M = B^* B</math>, : <math>M = B^\dagger B</math>,


where <math>B^*</math> is the ] of <math>B</math> (or <math>M = B^\textsf{T} B</math> in the real case). where <math>B^\dagger</math> is the ] of <math>B</math> (or <math>M = B^\textsf{T} B</math> in the real case).


Here <math>B</math> is a <math>k \times n</math> matrix, where <math>k</math> is the ] of <math>M</math>. Various ways to obtain such a decomposition include computing the ] or taking the ] of <math>M</math>. Here <math>B</math> is a <math>k \times n</math> matrix, where <math>k</math> is the ] of <math>M</math>. Various ways to obtain such a decomposition include computing the ] or taking the ] of <math>M</math>.


The columns <math>b^{(1)}, \dots, b^{(n)}</math> of <math>B</math> can be seen as ''n'' vectors in <math>\mathbb{C}^k</math> (or ''k''-dimensional Euclidean space <math>\mathbb{R}^k</math>, in the real case). Then The columns <math>b^{(1)}, \dots, b^{(n)}</math> of <math>B</math> can be seen as ''n'' vectors in <math>\mathbb{C}^k</math> (or ''k''-dimensional Euclidean space <math>\mathbb{R}^k</math>, in the real case). Then
: <math>M_{ij} = b^{(i)} \cdot b^{(j)}</math> : <math>M_{ij} = b^{(i)} \cdot b^{(j)}</math>


where the ] <math>a \cdot b = \sum_{\ell=1}^k a_\ell^* b_\ell</math> is the usual inner product on <math>\mathbb{C}^k</math>. where the ] <math display="inline">a \cdot b = \sum_{\ell=1}^k a_\ell^* b_\ell</math> is the usual inner product on <math>\mathbb{C}^k</math>.


Thus a ] <math>M</math> is positive semidefinite if and only if it is the ] of some vectors <math>b^{(1)}, \dots, b^{(n)}</math>. Such vectors are called a '''vector realization''' of <math>M</math>. The infinite-dimensional analog of this statement is ]. Thus a ] <math>M</math> is positive semidefinite if and only if it is the Gram matrix of some vectors <math>b^{(1)}, \dots, b^{(n)}</math>. Such vectors are called a '''vector realization''' of {{nowrap|<math>M</math>.}} The infinite-dimensional analog of this statement is ].


===Uniqueness of vector realizations=== ===Uniqueness of vector realizations===
If <math>M</math> is the Gram matrix of vectors <math>v_1,\dots,v_n</math> in <math>\mathbb{R}^k</math>, If <math>M</math> is the Gram matrix of vectors <math>v_1,\dots,v_n</math> in <math>\mathbb{R}^k</math> then applying any rotation or reflection of <math>\mathbb{R}^k</math> (any ], that is, any ] preserving 0) to the sequence of vectors results in the same Gram matrix. That is, for any <math>k \times k</math> ] <math>Q</math>, the Gram matrix of <math>Q v_1,\dots, Q v_n</math> is also {{nowrap|<math>M</math>.}}
then applying any rotation or reflection of <math>\mathbb{R}^k</math> (any ], that is, any ] preserving 0) to the sequence of vectors results in the same Gram matrix. That is, for any <math>k \times k</math> ] <math>Q</math>, the Gram matrix of <math>Q v_1,\dots, Q v_n</math> is also <math>M</math>.


This is the only way in which two real vector realizations of <math>M</math> can differ: the vectors <math>v_1,\dots,v_n</math> are unique up to ]s. In other words, the dot products <math>v_i \cdot v_j</math> and <math>w_i \cdot w_j</math> are equal if and only if some rigid transformation of <math>\mathbb{R}^k</math> transforms the vectors <math>v_1,\dots,v_n</math> to <math>w_1, \dots, w_n</math> and 0 to 0. This is the only way in which two real vector realizations of <math>M</math> can differ: the vectors <math>v_1,\dots,v_n</math> are unique up to ]s. In other words, the dot products <math>v_i \cdot v_j</math> and <math>w_i \cdot w_j</math> are equal if and only if some rigid transformation of <math>\mathbb{R}^k</math> transforms the vectors <math>v_1,\dots,v_n</math> to <math>w_1, \dots, w_n</math> and 0 to 0.


The same holds in the complex case, with ]s in place of orthogonal ones. The same holds in the complex case, with ]s in place of orthogonal ones.
That is, if the Gram matrix of vectors <math>v_1, \dots, v_n</math> is equal to the Gram matrix of vectors <math>w_1, \dots, w_n</math> in <math>\mathbb{C}^k</math>, then there is a ] <math>k \times k</math> matrix <math>U</math> (meaning <math>U^* U = I</math>) such that <math>v_i = U w_i</math> for <math>i = 1, \dots, n</math>.<ref>{{harvtxt|Horn|Johnson|2013}}, p. 452, Theorem 7.3.11</ref> That is, if the Gram matrix of vectors <math>v_1, \dots, v_n</math> is equal to the Gram matrix of vectors <math>w_1, \dots, w_n</math> in <math>\mathbb{C}^k</math> then there is a ] <math>k \times k</math> matrix <math>U</math> (meaning <math>U^\dagger U = I</math>) such that <math>v_i = U w_i</math> for <math>i = 1, \dots, n</math>.<ref>{{harvtxt|Horn|Johnson|2013}}, p. 452, Theorem 7.3.11</ref>


===Other properties=== ===Other properties===
* Because <math>G = G^\dagger</math>, it is necessarily the case that <math>G</math> and <math>G^\dagger</math> commute. That is, a real or complex Gram matrix <math>G</math> is also a ].
* The Gram matrix of any ] is the identity matrix.
* The Gram matrix of any ] is the identity matrix. Equivalently, the Gram matrix of the rows or the columns of a real ] is the identity matrix. Likewise, the Gram matrix of the rows or columns of a ] is the identity matrix.
* The rank of the Gram matrix of vectors in <math>\mathbb{R}^k</math> or <math>\mathbb{C}^k</math> equals the dimension of the space ] by these vectors.<ref name="HJ-7.2.10"/> * The rank of the Gram matrix of vectors in <math>\mathbb{R}^k</math> or <math>\mathbb{C}^k</math> equals the dimension of the space ] by these vectors.<ref name="HJ-7.2.10"/>


==Gram determinant== ==Gram determinant==
The '''Gram determinant''' or '''Gramian''' is the determinant of the Gram matrix: The '''Gram determinant''' or '''Gramian''' is the determinant of the Gram matrix:
: <math>G(x_1, \dots, x_n) = \begin{vmatrix} <math display=block>\bigl|G(v_1, \dots, v_n)\bigr| = \begin{vmatrix}
\langle x_1,x_1\rangle & \langle x_1,x_2\rangle &\dots & \langle x_1,x_n\rangle \\ \langle v_1,v_1\rangle & \langle v_1,v_2\rangle &\dots & \langle v_1,v_n\rangle \\
\langle x_2,x_1\rangle & \langle x_2,x_2\rangle &\dots & \langle x_2,x_n\rangle \\ \langle v_2,v_1\rangle & \langle v_2,v_2\rangle &\dots & \langle v_2,v_n\rangle \\
\vdots&\vdots&\ddots&\vdots \\ \vdots & \vdots & \ddots & \vdots \\
\langle x_n,x_1\rangle & \langle x_n,x_2\rangle &\dots & \langle x_n,x_n\rangle \langle v_n,v_1\rangle & \langle v_n,v_2\rangle &\dots & \langle v_n,v_n\rangle
\end{vmatrix}.</math> \end{vmatrix}.</math>


If <math>x_1, \cdots, x_n</math> are vectors in <math>\mathbb{R}^m</math>, then it is the square of the ''n''-dimensional volume of the ] formed by the vectors. In particular, the vectors are ] if and only if the parallelotope has nonzero ''n''-dimensional volume, if and only if Gram determinant is nonzero, if and only if the Gram matrix is ]. When m=n, This reduces to the standard theorem that determinant of n n-dimensional vectors is the n dimensional volume. If <math>v_1, \dots, v_n</math> are vectors in <math>\mathbb{R}^m</math> then it is the square of the ''n''-dimensional volume of the ] formed by the vectors. In particular, the vectors are ] ] the parallelotope has nonzero ''n''-dimensional volume, if and only if Gram determinant is nonzero, if and only if the Gram matrix is ]. When {{nowrap|''n'' > ''m''}} the determinant and volume are zero. When {{nowrap|1=''n'' = ''m''}}, this reduces to the standard theorem that the absolute value of the determinant of ''n'' ''n''-dimensional vectors is the ''n''-dimensional volume. The Gram determinant is also useful for computing the volume of the ] formed by the vectors; its volume is {{math|Volume(parallelotope) / ''n''!}}.


The Gram determinant can also be expressed in terms of the ] of vectors by The Gram determinant can also be expressed in terms of the ] of vectors by
:<math>G(x_1, \dots, x_n) = \| x_1 \wedge \cdots \wedge x_n\|^2.</math> :<math>\bigl|G(v_1, \dots, v_n)\bigr| = \| v_1 \wedge \cdots \wedge v_n\|^2.</math>

The Gram determinant therefore supplies an ] for the space {{tmath|{\textstyle\bigwedge}^{\!n}(V)}}. If an ] ''e''<sub>''i''</sub>, {{nowrap|1=''i'' = 1, 2, ..., ''n''}} on {{tmath|V}} is given, the vectors
: <math> e_{i_1} \wedge \cdots \wedge e_{i_n},\quad i_1 < \cdots < i_n, </math>
will constitute an orthonormal basis of ''n''-dimensional volumes on the space {{tmath|{\textstyle\bigwedge}^{\!n}(V)}}. Then the Gram determinant <math>\bigl|G(v_1, \dots, v_n)\bigr|</math> amounts to an ''n''-dimensional ] for the volume of the parallelotope formed by the vectors <math>v_1 \wedge \cdots \wedge v_n</math> in terms of its projections onto the basis volumes <math>e_{i_1} \wedge \cdots \wedge e_{i_n}</math>.

When the vectors <math>v_1, \ldots, v_n \in \mathbb{R}^m</math> are defined from the positions of points <math>p_1, \ldots, p_n</math> relative to some reference point <math>p_{n+1}</math>,
:<math display=block>(v_1, v_2, \ldots, v_n) = (p_1 - p_{n+1}, p_2 - p_{n+1}, \ldots, p_n - p_{n+1})\,,</math>
then the Gram determinant can be written as the difference of two Gram determinants,
:<math display=block>
\bigl|G(v_1, \dots, v_n)\bigr| = \bigl|G((p_1, 1), \dots, (p_{n+1}, 1))\bigr| - \bigl|G(p_1, \dots, p_{n+1})\bigr|\,,
</math>
where each <math>(p_j, 1)</math> is the corresponding point <math>p_j</math> supplemented with the coordinate value of 1 for an <math>(m+1)</math>-st dimension.{{Citation needed|reason=This relation between Gram matrices is apparently true but needs a citation to support its ].|date=February 2022}} Note that in the common case that {{math|1=''n'' = ''m''}}, the second term on the right-hand side will be zero.

==Constructing an orthonormal basis==

Given a set of linearly independent vectors <math>\{v_i\}</math> with Gram matrix <math>G</math> defined by <math>G_{ij}:= \langle v_i,v_j\rangle</math>, one can construct an orthonormal basis
:<math>u_i := \sum_j \bigl(G^{-1/2}\bigr)_{ji} v_j.</math>
In matrix notation, <math>U = V G^{-1/2} </math>, where <math>U</math> has orthonormal basis vectors <math>\{u_i\}</math> and the matrix <math>V</math> is composed of the given column vectors <math>\{v_i\}</math>.

The matrix <math>G^{-1/2}</math> is guaranteed to exist. Indeed, <math>G</math> is Hermitian, and so can be decomposed as <math>G=UDU^\dagger</math> with <math>U</math> a unitary matrix and <math>D</math> a real diagonal matrix. Additionally, the <math>v_i</math> are linearly independent if and only if <math>G</math> is positive definite, which implies that the diagonal entries of <math>D</math> are positive. <math>G^{-1/2}</math> is therefore uniquely defined by <math>G^{-1/2}:=UD^{-1/2}U^\dagger</math>. One can check that these new vectors are orthonormal:
:<math>\begin{align}
\langle u_i,u_j \rangle
&= \sum_{i'} \sum_{j'} \Bigl\langle \bigl(G^{-1/2}\bigr)_{i'i} v_{i'},\bigl(G^{-1/2}\bigr)_{j'j} v_{j'} \Bigr\rangle \\
&= \sum_{i'} \sum_{j'} \bigl(G^{-1/2}\bigr)_{ii'} G_{i'j'} \bigl(G^{-1/2}\bigr)_{j'j} \\
&= \bigl(G^{-1/2} G G^{-1/2}\bigr)_{ij} = \delta_{ij}
\end{align}</math>
where we used <math>\bigl(G^{-1/2}\bigr)^\dagger=G^{-1/2} </math>.


==See also== ==See also==
Line 97: Line 118:
==References== ==References==
{{reflist}} {{reflist}}
* {{Cite book | last1=Horn | first1=Roger A. | last2=Johnson | first2=Charles R. | title=Matrix Analysis | publisher=] | isbn=978-0-521-54823-6 | year=2013 |ref=harv |edition=2nd }} * {{Cite book | last1=Horn | first1=Roger A. | last2=Johnson | first2=Charles R. | title=Matrix Analysis | publisher=] | isbn=978-0-521-54823-6 | year=2013 |edition=2nd }}


==External links== ==External links==
* {{springer|title=Gram matrix|id=p/g044750}} * {{springer|title=Gram matrix|id=p/g044750}}
* '''' by Frank Jones * '''' by Frank Jones


{{Matrix classes}} {{Matrix classes}}
Line 110: Line 131:
] ]
] ]

]

Latest revision as of 20:38, 8 January 2025

Matrix of inner products of a set of vectors

In linear algebra, the Gram matrix (or Gramian matrix, Gramian) of a set of vectors v 1 , , v n {\displaystyle v_{1},\dots ,v_{n}} in an inner product space is the Hermitian matrix of inner products, whose entries are given by the inner product G i j = v i , v j {\displaystyle G_{ij}=\left\langle v_{i},v_{j}\right\rangle } . If the vectors v 1 , , v n {\displaystyle v_{1},\dots ,v_{n}} are the columns of matrix X {\displaystyle X} then the Gram matrix is X X {\displaystyle X^{\dagger }X} in the general case that the vector coordinates are complex numbers, which simplifies to X X {\displaystyle X^{\top }X} for the case that the vector coordinates are real numbers.

An important application is to compute linear independence: a set of vectors are linearly independent if and only if the Gram determinant (the determinant of the Gram matrix) is non-zero.

It is named after Jørgen Pedersen Gram.

Examples

For finite-dimensional real vectors in R n {\displaystyle \mathbb {R} ^{n}} with the usual Euclidean dot product, the Gram matrix is G = V V {\displaystyle G=V^{\top }V} , where V {\displaystyle V} is a matrix whose columns are the vectors v k {\displaystyle v_{k}} and V {\displaystyle V^{\top }} is its transpose whose rows are the vectors v k {\displaystyle v_{k}^{\top }} . For complex vectors in C n {\displaystyle \mathbb {C} ^{n}} , G = V V {\displaystyle G=V^{\dagger }V} , where V {\displaystyle V^{\dagger }} is the conjugate transpose of V {\displaystyle V} .

Given square-integrable functions { i ( ) , i = 1 , , n } {\displaystyle \{\ell _{i}(\cdot ),\,i=1,\dots ,n\}} on the interval [ t 0 , t f ] {\displaystyle \left} , the Gram matrix G = [ G i j ] {\displaystyle G=\left} is:

G i j = t 0 t f i ( τ ) j ( τ ) d τ . {\displaystyle G_{ij}=\int _{t_{0}}^{t_{f}}\ell _{i}^{*}(\tau )\ell _{j}(\tau )\,d\tau .}

where i ( τ ) {\displaystyle \ell _{i}^{*}(\tau )} is the complex conjugate of i ( τ ) {\displaystyle \ell _{i}(\tau )} .

For any bilinear form B {\displaystyle B} on a finite-dimensional vector space over any field we can define a Gram matrix G {\displaystyle G} attached to a set of vectors v 1 , , v n {\displaystyle v_{1},\dots ,v_{n}} by G i j = B ( v i , v j ) {\displaystyle G_{ij}=B\left(v_{i},v_{j}\right)} . The matrix will be symmetric if the bilinear form B {\displaystyle B} is symmetric.

Applications

  • In Riemannian geometry, given an embedded k {\displaystyle k} -dimensional Riemannian manifold M R n {\displaystyle M\subset \mathbb {R} ^{n}} and a parametrization ϕ : U M {\displaystyle \phi :U\to M} for ( x 1 , , x k ) U R k {\displaystyle (x_{1},\ldots ,x_{k})\in U\subset \mathbb {R} ^{k}} , the volume form ω {\displaystyle \omega } on M {\displaystyle M} induced by the embedding may be computed using the Gramian of the coordinate tangent vectors: ω = det G   d x 1 d x k , G = [ ϕ x i , ϕ x j ] . {\displaystyle \omega ={\sqrt {\det G}}\ dx_{1}\cdots dx_{k},\quad G=\left.} This generalizes the classical surface integral of a parametrized surface ϕ : U S R 3 {\displaystyle \phi :U\to S\subset \mathbb {R} ^{3}} for ( x , y ) U R 2 {\displaystyle (x,y)\in U\subset \mathbb {R} ^{2}} : S f   d A = U f ( ϕ ( x , y ) ) | ϕ x × ϕ y | d x d y . {\displaystyle \int _{S}f\ dA=\iint _{U}f(\phi (x,y))\,\left|{\frac {\partial \phi }{\partial x}}\,{\times }\,{\frac {\partial \phi }{\partial y}}\right|\,dx\,dy.}
  • If the vectors are centered random variables, the Gramian is approximately proportional to the covariance matrix, with the scaling determined by the number of elements in the vector.
  • In quantum chemistry, the Gram matrix of a set of basis vectors is the overlap matrix.
  • In control theory (or more generally systems theory), the controllability Gramian and observability Gramian determine properties of a linear system.
  • Gramian matrices arise in covariance structure model fitting (see e.g., Jamshidian and Bentler, 1993, Applied Psychological Measurement, Volume 18, pp. 79–94).
  • In the finite element method, the Gram matrix arises from approximating a function from a finite dimensional space; the Gram matrix entries are then the inner products of the basis functions of the finite dimensional subspace.
  • In machine learning, kernel functions are often represented as Gram matrices. (Also see kernel PCA)
  • Since the Gram matrix over the reals is a symmetric matrix, it is diagonalizable and its eigenvalues are non-negative. The diagonalization of the Gram matrix is the singular value decomposition.

Properties

Positive-semidefiniteness

The Gram matrix is symmetric in the case the inner product is real-valued; it is Hermitian in the general, complex case by definition of an inner product.

The Gram matrix is positive semidefinite, and every positive semidefinite matrix is the Gramian matrix for some set of vectors. The fact that the Gramian matrix is positive-semidefinite can be seen from the following simple derivation:

x G x = i , j x i x j v i , v j = i , j x i v i , x j v j = i x i v i , j x j v j = i x i v i 2 0. {\displaystyle x^{\dagger }\mathbf {G} x=\sum _{i,j}x_{i}^{*}x_{j}\left\langle v_{i},v_{j}\right\rangle =\sum _{i,j}\left\langle x_{i}v_{i},x_{j}v_{j}\right\rangle ={\biggl \langle }\sum _{i}x_{i}v_{i},\sum _{j}x_{j}v_{j}{\biggr \rangle }={\biggl \|}\sum _{i}x_{i}v_{i}{\biggr \|}^{2}\geq 0.}

The first equality follows from the definition of matrix multiplication, the second and third from the bi-linearity of the inner-product, and the last from the positive definiteness of the inner product. Note that this also shows that the Gramian matrix is positive definite if and only if the vectors v i {\displaystyle v_{i}} are linearly independent (that is, i x i v i 0 {\textstyle \sum _{i}x_{i}v_{i}\neq 0} for all x {\displaystyle x} ).

Finding a vector realization

See also: Positive definite matrix § Decomposition

Given any positive semidefinite matrix M {\displaystyle M} , one can decompose it as:

M = B B {\displaystyle M=B^{\dagger }B} ,

where B {\displaystyle B^{\dagger }} is the conjugate transpose of B {\displaystyle B} (or M = B T B {\displaystyle M=B^{\textsf {T}}B} in the real case).

Here B {\displaystyle B} is a k × n {\displaystyle k\times n} matrix, where k {\displaystyle k} is the rank of M {\displaystyle M} . Various ways to obtain such a decomposition include computing the Cholesky decomposition or taking the non-negative square root of M {\displaystyle M} .

The columns b ( 1 ) , , b ( n ) {\displaystyle b^{(1)},\dots ,b^{(n)}} of B {\displaystyle B} can be seen as n vectors in C k {\displaystyle \mathbb {C} ^{k}} (or k-dimensional Euclidean space R k {\displaystyle \mathbb {R} ^{k}} , in the real case). Then

M i j = b ( i ) b ( j ) {\displaystyle M_{ij}=b^{(i)}\cdot b^{(j)}}

where the dot product a b = = 1 k a b {\textstyle a\cdot b=\sum _{\ell =1}^{k}a_{\ell }^{*}b_{\ell }} is the usual inner product on C k {\displaystyle \mathbb {C} ^{k}} .

Thus a Hermitian matrix M {\displaystyle M} is positive semidefinite if and only if it is the Gram matrix of some vectors b ( 1 ) , , b ( n ) {\displaystyle b^{(1)},\dots ,b^{(n)}} . Such vectors are called a vector realization of M {\displaystyle M} . The infinite-dimensional analog of this statement is Mercer's theorem.

Uniqueness of vector realizations

If M {\displaystyle M} is the Gram matrix of vectors v 1 , , v n {\displaystyle v_{1},\dots ,v_{n}} in R k {\displaystyle \mathbb {R} ^{k}} then applying any rotation or reflection of R k {\displaystyle \mathbb {R} ^{k}} (any orthogonal transformation, that is, any Euclidean isometry preserving 0) to the sequence of vectors results in the same Gram matrix. That is, for any k × k {\displaystyle k\times k} orthogonal matrix Q {\displaystyle Q} , the Gram matrix of Q v 1 , , Q v n {\displaystyle Qv_{1},\dots ,Qv_{n}} is also M {\displaystyle M} .

This is the only way in which two real vector realizations of M {\displaystyle M} can differ: the vectors v 1 , , v n {\displaystyle v_{1},\dots ,v_{n}} are unique up to orthogonal transformations. In other words, the dot products v i v j {\displaystyle v_{i}\cdot v_{j}} and w i w j {\displaystyle w_{i}\cdot w_{j}} are equal if and only if some rigid transformation of R k {\displaystyle \mathbb {R} ^{k}} transforms the vectors v 1 , , v n {\displaystyle v_{1},\dots ,v_{n}} to w 1 , , w n {\displaystyle w_{1},\dots ,w_{n}} and 0 to 0.

The same holds in the complex case, with unitary transformations in place of orthogonal ones. That is, if the Gram matrix of vectors v 1 , , v n {\displaystyle v_{1},\dots ,v_{n}} is equal to the Gram matrix of vectors w 1 , , w n {\displaystyle w_{1},\dots ,w_{n}} in C k {\displaystyle \mathbb {C} ^{k}} then there is a unitary k × k {\displaystyle k\times k} matrix U {\displaystyle U} (meaning U U = I {\displaystyle U^{\dagger }U=I} ) such that v i = U w i {\displaystyle v_{i}=Uw_{i}} for i = 1 , , n {\displaystyle i=1,\dots ,n} .

Other properties

  • Because G = G {\displaystyle G=G^{\dagger }} , it is necessarily the case that G {\displaystyle G} and G {\displaystyle G^{\dagger }} commute. That is, a real or complex Gram matrix G {\displaystyle G} is also a normal matrix.
  • The Gram matrix of any orthonormal basis is the identity matrix. Equivalently, the Gram matrix of the rows or the columns of a real rotation matrix is the identity matrix. Likewise, the Gram matrix of the rows or columns of a unitary matrix is the identity matrix.
  • The rank of the Gram matrix of vectors in R k {\displaystyle \mathbb {R} ^{k}} or C k {\displaystyle \mathbb {C} ^{k}} equals the dimension of the space spanned by these vectors.

Gram determinant

The Gram determinant or Gramian is the determinant of the Gram matrix: | G ( v 1 , , v n ) | = | v 1 , v 1 v 1 , v 2 v 1 , v n v 2 , v 1 v 2 , v 2 v 2 , v n v n , v 1 v n , v 2 v n , v n | . {\displaystyle {\bigl |}G(v_{1},\dots ,v_{n}){\bigr |}={\begin{vmatrix}\langle v_{1},v_{1}\rangle &\langle v_{1},v_{2}\rangle &\dots &\langle v_{1},v_{n}\rangle \\\langle v_{2},v_{1}\rangle &\langle v_{2},v_{2}\rangle &\dots &\langle v_{2},v_{n}\rangle \\\vdots &\vdots &\ddots &\vdots \\\langle v_{n},v_{1}\rangle &\langle v_{n},v_{2}\rangle &\dots &\langle v_{n},v_{n}\rangle \end{vmatrix}}.}

If v 1 , , v n {\displaystyle v_{1},\dots ,v_{n}} are vectors in R m {\displaystyle \mathbb {R} ^{m}} then it is the square of the n-dimensional volume of the parallelotope formed by the vectors. In particular, the vectors are linearly independent if and only if the parallelotope has nonzero n-dimensional volume, if and only if Gram determinant is nonzero, if and only if the Gram matrix is nonsingular. When n > m the determinant and volume are zero. When n = m, this reduces to the standard theorem that the absolute value of the determinant of n n-dimensional vectors is the n-dimensional volume. The Gram determinant is also useful for computing the volume of the simplex formed by the vectors; its volume is Volume(parallelotope) / n!.

The Gram determinant can also be expressed in terms of the exterior product of vectors by

| G ( v 1 , , v n ) | = v 1 v n 2 . {\displaystyle {\bigl |}G(v_{1},\dots ,v_{n}){\bigr |}=\|v_{1}\wedge \cdots \wedge v_{n}\|^{2}.}

The Gram determinant therefore supplies an inner product for the space ⁠ n ( V ) {\displaystyle {\textstyle \bigwedge }^{\!n}(V)} ⁠. If an orthonormal basis ei, i = 1, 2, ..., n on ⁠ V {\displaystyle V} ⁠ is given, the vectors

e i 1 e i n , i 1 < < i n , {\displaystyle e_{i_{1}}\wedge \cdots \wedge e_{i_{n}},\quad i_{1}<\cdots <i_{n},}

will constitute an orthonormal basis of n-dimensional volumes on the space ⁠ n ( V ) {\displaystyle {\textstyle \bigwedge }^{\!n}(V)} ⁠. Then the Gram determinant | G ( v 1 , , v n ) | {\displaystyle {\bigl |}G(v_{1},\dots ,v_{n}){\bigr |}} amounts to an n-dimensional Pythagorean Theorem for the volume of the parallelotope formed by the vectors v 1 v n {\displaystyle v_{1}\wedge \cdots \wedge v_{n}} in terms of its projections onto the basis volumes e i 1 e i n {\displaystyle e_{i_{1}}\wedge \cdots \wedge e_{i_{n}}} .

When the vectors v 1 , , v n R m {\displaystyle v_{1},\ldots ,v_{n}\in \mathbb {R} ^{m}} are defined from the positions of points p 1 , , p n {\displaystyle p_{1},\ldots ,p_{n}} relative to some reference point p n + 1 {\displaystyle p_{n+1}} ,

( v 1 , v 2 , , v n ) = ( p 1 p n + 1 , p 2 p n + 1 , , p n p n + 1 ) , {\displaystyle (v_{1},v_{2},\ldots ,v_{n})=(p_{1}-p_{n+1},p_{2}-p_{n+1},\ldots ,p_{n}-p_{n+1})\,,}

then the Gram determinant can be written as the difference of two Gram determinants,

| G ( v 1 , , v n ) | = | G ( ( p 1 , 1 ) , , ( p n + 1 , 1 ) ) | | G ( p 1 , , p n + 1 ) | , {\displaystyle {\bigl |}G(v_{1},\dots ,v_{n}){\bigr |}={\bigl |}G((p_{1},1),\dots ,(p_{n+1},1)){\bigr |}-{\bigl |}G(p_{1},\dots ,p_{n+1}){\bigr |}\,,}

where each ( p j , 1 ) {\displaystyle (p_{j},1)} is the corresponding point p j {\displaystyle p_{j}} supplemented with the coordinate value of 1 for an ( m + 1 ) {\displaystyle (m+1)} -st dimension. Note that in the common case that n = m, the second term on the right-hand side will be zero.

Constructing an orthonormal basis

Given a set of linearly independent vectors { v i } {\displaystyle \{v_{i}\}} with Gram matrix G {\displaystyle G} defined by G i j := v i , v j {\displaystyle G_{ij}:=\langle v_{i},v_{j}\rangle } , one can construct an orthonormal basis

u i := j ( G 1 / 2 ) j i v j . {\displaystyle u_{i}:=\sum _{j}{\bigl (}G^{-1/2}{\bigr )}_{ji}v_{j}.}

In matrix notation, U = V G 1 / 2 {\displaystyle U=VG^{-1/2}} , where U {\displaystyle U} has orthonormal basis vectors { u i } {\displaystyle \{u_{i}\}} and the matrix V {\displaystyle V} is composed of the given column vectors { v i } {\displaystyle \{v_{i}\}} .

The matrix G 1 / 2 {\displaystyle G^{-1/2}} is guaranteed to exist. Indeed, G {\displaystyle G} is Hermitian, and so can be decomposed as G = U D U {\displaystyle G=UDU^{\dagger }} with U {\displaystyle U} a unitary matrix and D {\displaystyle D} a real diagonal matrix. Additionally, the v i {\displaystyle v_{i}} are linearly independent if and only if G {\displaystyle G} is positive definite, which implies that the diagonal entries of D {\displaystyle D} are positive. G 1 / 2 {\displaystyle G^{-1/2}} is therefore uniquely defined by G 1 / 2 := U D 1 / 2 U {\displaystyle G^{-1/2}:=UD^{-1/2}U^{\dagger }} . One can check that these new vectors are orthonormal:

u i , u j = i j ( G 1 / 2 ) i i v i , ( G 1 / 2 ) j j v j = i j ( G 1 / 2 ) i i G i j ( G 1 / 2 ) j j = ( G 1 / 2 G G 1 / 2 ) i j = δ i j {\displaystyle {\begin{aligned}\langle u_{i},u_{j}\rangle &=\sum _{i'}\sum _{j'}{\Bigl \langle }{\bigl (}G^{-1/2}{\bigr )}_{i'i}v_{i'},{\bigl (}G^{-1/2}{\bigr )}_{j'j}v_{j'}{\Bigr \rangle }\\&=\sum _{i'}\sum _{j'}{\bigl (}G^{-1/2}{\bigr )}_{ii'}G_{i'j'}{\bigl (}G^{-1/2}{\bigr )}_{j'j}\\&={\bigl (}G^{-1/2}GG^{-1/2}{\bigr )}_{ij}=\delta _{ij}\end{aligned}}}

where we used ( G 1 / 2 ) = G 1 / 2 {\displaystyle {\bigl (}G^{-1/2}{\bigr )}^{\dagger }=G^{-1/2}} .

See also

References

  1. ^ Horn & Johnson 2013, p. 441, p.441, Theorem 7.2.10
  2. Lanckriet, G. R. G.; Cristianini, N.; Bartlett, P.; Ghaoui, L. E.; Jordan, M. I. (2004). "Learning the kernel matrix with semidefinite programming". Journal of Machine Learning Research. 5: 27–72 .
  3. Horn & Johnson (2013), p. 452, Theorem 7.3.11

External links

Matrix classes
Explicitly constrained entries
Constant
Conditions on eigenvalues or eigenvectors
Satisfying conditions on products or inverses
With specific applications
Used in statistics
Used in graph theory
Used in science and engineering
Related terms
Categories: