Misplaced Pages

Heat kernel signature

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.

A heat kernel signature (HKS) is a feature descriptor for use in deformable shape analysis and belongs to the group of spectral shape analysis methods. For each point in the shape, HKS defines its feature vector representing the point's local and global geometric properties. Applications include segmentation, classification, structure discovery, shape matching and shape retrieval.

HKS was introduced in 2009 by Jian Sun, Maks Ovsjanikov and Leonidas Guibas. It is based on heat kernel, which is a fundamental solution to the heat equation. HKS is one of the many recently introduced shape descriptors which are based on the Laplace–Beltrami operator associated with the shape.

Overview

Shape analysis is the field of automatic digital analysis of shapes, e.g., 3D objects. For many shape analysis tasks (such as shape matching/retrieval), feature vectors for certain key points are used instead of using the complete 3D model of the shape. An important requirement of such feature descriptors is for them to be invariant under certain transformations. For rigid transformations, commonly used feature descriptors include shape context, spin images, integral volume descriptors and multiscale local features, among others. HKS allows isometric transformations which generalizes rigid transformations.

HKS is based on the concept of heat diffusion over a surface. Given an initial heat distribution u 0 ( x ) {\displaystyle u_{0}(x)} over the surface, the heat kernel h t ( x , y ) {\displaystyle h_{t}(x,y)} relates the amount of heat transferred from x {\displaystyle x} to y {\displaystyle y} after time t {\displaystyle t} . The heat kernel is invariant under isometric transformations and stable under small perturbations to the isometry. In addition, the heat kernel fully characterizes shapes up to an isometry and represents increasingly global properties of the shape with increasing time. Since h t ( x , y ) {\displaystyle h_{t}(x,y)} is defined for a pair of points over a temporal domain, using heat kernels directly as features would lead to a high complexity. HKS instead restricts itself to just the temporal domain by considering only h t ( x , x ) {\displaystyle h_{t}(x,x)} . HKS inherits most of the properties of heat kernels under certain conditions.

Technical details

The heat diffusion equation over a compact Riemannian manifold M {\displaystyle M} (possibly with a boundary) is given by,

( Δ t ) u ( x , t ) = 0 {\displaystyle \left(\Delta -{\frac {\partial }{\partial t}}\right)u(x,t)=0}

where Δ {\displaystyle \Delta } is the Laplace–Beltrami operator and u ( x , t ) {\displaystyle u(x,t)} is the heat distribution at a point x {\displaystyle x} at time t {\displaystyle t} . The solution to this equation can be expressed as,

u ( x , t ) = h t ( x , y ) u 0 ( y ) d y . {\displaystyle u(x,t)=\int h_{t}(x,y)u_{0}(y)dy.}

The eigen decomposition of the heat kernel is expressed as,

h t ( x , y ) = i = 0 exp ( λ i t ) ϕ i ( x ) ϕ i ( y ) {\displaystyle h_{t}(x,y)=\sum _{i=0}^{\infty }\exp(-\lambda _{i}t)\phi _{i}(x)\phi _{i}(y)}

where λ i {\displaystyle \lambda _{i}} and ϕ i {\displaystyle \phi _{i}} are the i t h {\displaystyle i^{th}} eigenvalue and eigenfunction of Δ {\displaystyle \Delta } . The heat kernel fully characterizes a surface up to an isometry: For any surjective map T : M N {\displaystyle T:M\rightarrow N} between two Riemannian manifolds M {\displaystyle M} and N {\displaystyle N} , if h t ( x , y ) = h t ( T ( x ) , T ( y ) ) {\displaystyle h_{t}(x,y)=h_{t}(T(x),T(y))} then T {\displaystyle T} is an isometry, and vice versa. For a concise feature descriptor, HKS restricts the heat kernel only to the temporal domain,

h t ( x , x ) = i = 0 exp ( λ i t ) ϕ i 2 ( x ) . {\displaystyle h_{t}(x,x)=\sum _{i=0}^{\infty }\exp(-\lambda _{i}t)\phi _{i}^{2}(x).}

HKS, similar to the heat kernel, characterizes surfaces under the condition that the eigenvalues of Δ {\displaystyle \Delta } for M {\displaystyle M} and N {\displaystyle N} are non-repeating. The terms exp ( λ i t ) {\displaystyle \exp(-\lambda _{i}t)} can be intuited as a bank of low-pass filters, with λ i {\displaystyle \lambda _{i}} determining the cutoff frequencies.

Practical considerations

Since h t ( x , x ) {\displaystyle h_{t}(x,x)} is, in general, a non-parametric continuous function, HKS is in practice represented as a discrete sequence of { h t 1 ( x , x ) , , h t n ( x , x ) } {\displaystyle \{h_{t_{1}}(x,x),\ldots ,h_{t_{n}}(x,x)\}} values sampled at times t 1 , , t n {\displaystyle t_{1},\ldots ,t_{n}} .

In most applications, the underlying manifold for an object is not known. The HKS can be computed if a mesh representation of the manifold is available, by using a discrete approximation to Δ {\displaystyle \Delta } and using the discrete analogue of the heat equation. In the discrete case, the Laplace–Beltrami operator is a sparse matrix and can be written as,

L = A 1 W {\displaystyle L=A^{-1}W}

where A {\displaystyle A} is a positive diagonal matrix with entries A ( i , i ) {\displaystyle A(i,i)} corresponding to the area of the triangles in the mesh sharing the vertex i {\displaystyle i} , and W {\displaystyle W} is a symmetric semi-definite weighting matrix. L {\displaystyle L} can be decomposed into L = Φ Λ Φ T A {\displaystyle L=\Phi \Lambda \Phi ^{T}A} , where Λ {\displaystyle \Lambda } is a diagonal matrix of the eigenvalues of L {\displaystyle L} arranged in the ascending order, and Φ {\displaystyle \Phi } is the matrix with the corresponding orthonormal eigenvectors. The discrete heat kernel is the matrix given by,

K t = Φ exp ( t Λ ) Φ T . {\displaystyle K_{t}=\Phi \exp(-t\Lambda )\Phi ^{T}.}

The elements k t ( i , j ) {\displaystyle k_{t}(i,j)} represents the heat diffusion between vertices i {\displaystyle i} and j {\displaystyle j} after time t {\displaystyle t} . The HKS is then given by the diagonal entries of this matrix, sampled at discrete time intervals. Similar to the continuous case, the discrete HKS is robust to noise.

Limitations

Non-repeating eigenvalues

The main property that characterizes surfaces using HKS up to an isometry holds only when the eigenvalues of the surfaces are non-repeating. There are certain surfaces (especially those with symmetry) where this condition is violated. A sphere is a simple example of such a surface.

Time parameter selection

The time parameter in the HKS is closely related to the scale of global information. However, there is no direct way to choose the time discretization. The existing method chooses time samples logarithmically which is a heuristic with no guarantees

Time complexity

The discrete heat kernel requires eigendecomposition of a matrix of size n × n {\displaystyle n\times n} , where n {\displaystyle n} is the number of vertices in the mesh representation of the manifold. Computing the eigendecomposition is an expensive operation, especially as n {\displaystyle n} increases. Note, however, that because of the inverse exponential dependence on the eigenvalue, typically only a small (less than 100) eigenvectors are sufficient to obtain a good approximation of the HKS.

Non-isometric transformations

The performance guarantees for HKS only hold for truly isometric transformations. However, deformations for real shapes are often not isometric. A simple example of such transformation is closing of the fist by a person, where the geodesic distances between two fingers changes.

Relation with other methods

Source:

Curvature

The (continuous) HKS at a point x {\displaystyle x} , h t ( x , x ) {\displaystyle h_{t}(x,x)} on the Riemannian manifold is related to the scalar curvature s ( x ) {\displaystyle s(x)} by,

h t ( x , x ) = 1 4 π t + s ( x ) 12 π + O ( t ) . {\displaystyle h_{t}(x,x)={\frac {1}{4\pi t}}+{\frac {s(x)}{12\pi }}+O(t).}

Hence, HKS can as be interpreted as the curvature of x {\displaystyle x} at scale t {\displaystyle t} .

Wave kernel signature (WKS)

The WKS follows a similar idea to the HKS, replacing the heat equation with the Schrödinger wave equation,

( i Δ + t ) ψ ( x , t ) = 0 {\displaystyle \left(i\Delta +{\frac {\partial }{\partial t}}\right)\psi (x,t)=0}

where ψ ( x , t ) {\displaystyle \psi (x,t)} is the complex wave function. The average probability of measuring the particle at a point x {\displaystyle x} is given by,

p ( x ) = i = 0 f 2 ( λ i ) ϕ i 2 ( x ) {\displaystyle p(x)=\sum _{i=0}^{\infty }f^{2}(\lambda _{i})\phi _{i}^{2}(x)}

where f {\displaystyle f} is the initial energy distribution. By fixing a family of these energy distributions f i ( x ) {\displaystyle f_{i}(x)} , the WKS can be obtained as a discrete sequence { p f 1 ( x ) , , p f n ( x ) } {\displaystyle \{p_{f_{1}}(x),\ldots ,p_{f_{n}}(x)\}} . Unlike HKS, the WKS can be intuited as a set of band-pass filters leading to better feature localization. However, the WKS does not represent large-scale features well (as they are filtered out) yielding poor performance at shape matching applications.

Global point signature (GPS)

Similar to the HKS, the GPS is based on the Laplace-Beltrami operator. GPS at a point x {\displaystyle x} is a vector of scaled eigenfunctions of the Laplace–Beltrami operator computed at x {\displaystyle x} . The GPS is a global feature whereas the scale of the HKS can be varied by varying the time parameter for heat diffusion. Hence, the HKS can be used in partial shape matching applications whereas the GPS cannot.

Spectral graph wavelet signature (SGWS)

SGWS provides a general form for spectral descriptors, where one can obtain HKS by specifying the filter function. SGWS is a multiresolution local descriptor that is not only isometric invariant, but also compact, easy to compute and combines the advantages of both band-pass and low-pass filters.

Extensions

Scale invariance

Even though the HKS represents the shape at multiple scales, it is not inherently scale invariant. For example, the HKS for a shape and its scaled version are not the same without pre-normalization. A simple way to ensure scale invariance is by pre-scaling each shape to have the same surface area (e.g. 1). Using the notation above, this means:

s = j A j A = A / s λ i = s λ i  for each  i ϕ i = s ϕ i  for each  i {\displaystyle {\begin{aligned}s&=\sum _{j}A_{j}\\A&=A/s\\\lambda _{i}&=s\lambda _{i}{\text{ for each }}i\\\phi _{i}&={\sqrt {s}}\phi _{i}{\text{ for each }}i\\\end{aligned}}}

Alternatively, scale-invariant version of the HKS can also be constructed by generating a Scale space representation. In the scale-space, the HKS of a scaled shape corresponds to a translation up to a multiplicative factor. The Fourier transform of this HKS changes the time-translation into the complex plane, and the dependency on translation can be eliminated by considering the modulus of the transform. Demo of Scale-invariant HKS on YouTube. An alternative scale invariant HKS can be established by working out its construction through a scale invariant metric, as defined in.

Volumetric HKS

The HKS is defined for a boundary surface of a 3D shape, represented as a 2D Riemannian manifold. Instead of considering only the boundary, the entire volume of the 3D shape can be considered to define the volumetric version of the HKS. The Volumetric HKS is defined analogous to the normal HKS by considering the heat equation over the entire volume (as a 3-submanifold) and defining a Neumann boundary condition over the 2-manifold boundary of the shape. Volumetric HKS characterizes transformations up to a volume isometry, which represent the transformation for real 3D objects more faithfully than boundary isometry.

Shape Search

The scale-invariant HKS features can be used in the bag-of-features model for shape retrieval applications. The features are used to construct geometric words by taking into account their spatial relations, from which shapes can be constructed (analogous to using features as words and shapes as sentences). Shapes themselves are represented using compact binary codes to form an indexed collection. Given a query shape, similar shapes in the index with possibly isometric transformations can be retrieved by using the Hamming distance of the code as the nearness-measure.

References

  1. ^ Sun, J. and Ovsjanikov, M. and Guibas, L. (2009). "A Concise and Provably Informative Multi-Scale Signature-Based on Heat Diffusion". Computer Graphics Forum. Vol. 28. pp. 1383–1392.{{cite conference}}: CS1 maint: multiple names: authors list (link)
  2. ^ Alexander M. Bronstein (2011). "Spectral descriptors for deformable shapes". arXiv:1110.5015. Bibcode:2011arXiv1110.5015B. {{cite journal}}: Cite journal requires |journal= (help)
  3. Grigor'yan, Alexander (2006). "Heat kernels on weighted manifolds and applications". The ubiquitous heat kernel. Contemporary Mathematics. Vol. 398. Providence, RI: American Mathematical Society. pp. 93–191. doi:10.1090/conm/398/07486. ISBN 978-0-8218-3698-9. MR 2218016.
  4. ^ Aubry, M. and Schlickewei, U. and Cremers, D. (2011). "The Wave Kernel Signature—A Quantum Mechanical Approach to Shape Analysis". IEEE International Conference on Computer Vision (ICCV) - Workshop on Dynamic Shape Capture and Analysis (4DMOD).{{cite conference}}: CS1 maint: multiple names: authors list (link)
  5. Rustamov, R.M. (2007). "Laplace–Beltrami eigenfunctions for deformation invariant shape representation". Proceedings of the fifth Eurographics symposium on Geometry processing. Eurographics Association. pp. 225–233.
  6. C. Li; A. Ben Hamza (2013). "A multiresolution descriptor for deformable 3D shape retrieval". The Visual Computer. 29 (6–8): 513–524. doi:10.1007/s00371-013-0815-3. S2CID 10125228.
  7. Bronstein, M.M.; Kokkinos, I. (2010). "Scale-invariant heat kernel signatures for non-rigid shape recognition". Computer Vision and Pattern Recognition (CVPR), 2010. IEEE. pp. 1704–1711.
  8. Aflalo, Yonathan; Kimmel, Ron; Raviv, Dan (2013). "Scale Invariant Geometry for Nonrigid Shapes". SIAM Journal on Imaging Sciences. 6 (3): 1579–1597. CiteSeerX 10.1.1.406.3701. doi:10.1137/120888107.
  9. ^ Raviv, D. and Bronstein, M.M. and Bronstein, A.M. and Kimmel, R. (2010). "Volumetric heat kernel signatures". Proceedings of the ACM workshop on 3D object retrieval. ACM. pp. 30–44.{{cite conference}}: CS1 maint: multiple names: authors list (link)
  10. Bronstein, A.M. and Bronstein, M.M. and Guibas, L.J. and Ovsjanikov, M. (2011). "Shape google: Geometric words and expressions for invariant shape retrieval". ACM Transactions on Graphics. 30 (1). doi:10.1145/1899404.1899405. S2CID 7964594.{{cite journal}}: CS1 maint: multiple names: authors list (link)
Categories: