Misplaced Pages

SigSpec

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Statistical technique

SigSpec (acronym of SIGnificance SPECtrum) is a statistical technique to provide the reliability of periodicities in a measured (noisy and not necessarily equidistant) time series. It relies on the amplitude spectrum obtained by the Discrete Fourier transform (DFT) and assigns a quantity called the spectral significance (frequently abbreviated by “sig”) to each amplitude. This quantity is a logarithmic measure of the probability that the given amplitude level would be seen in white noise, in the sense of a type I error. It represents the answer to the question, “What would be the chance to obtain an amplitude like the measured one or higher, if the analysed time series were random?”

SigSpec may be considered a formal extension to the Lomb-Scargle periodogram, appropriately incorporating a time series to be averaged to zero before applying the DFT, which is done in many practical applications. When a zero-mean corrected dataset has to be statistically compared to a random sample, the sample mean (rather than the population mean only) has to be zero.

Probability density function (pdf) of white noise in Fourier space

Considering a time series to be represented by a set of K {\displaystyle K} pairs ( t k , x k ) {\displaystyle (t_{k},x_{k})} , the amplitude pdf of white noise in Fourier space, depending on frequency and phase angle may be described in terms of three parameters, α 0 {\displaystyle \alpha _{0}} , β 0 {\displaystyle \beta _{0}} , θ 0 {\displaystyle \theta _{0}} , defining the “sampling profile”, according to

tan 2 θ 0 = K k = 0 K 1 sin 2 ω t k 2 ( k = 0 K 1 cos ω t k ) ( k = 0 K 1 sin ω t k ) K k = 0 K 1 cos 2 ω t k ( k = 0 K 1 cos ω t k ) 2 + ( k = 0 K 1 sin ω t k ) 2 , {\displaystyle \tan 2\theta _{0}={\frac {K\sum _{k=0}^{K-1}\sin 2\omega t_{k}-2\left(\sum _{k=0}^{K-1}\cos \omega t_{k}\right)\left(\sum _{k=0}^{K-1}\sin \omega t_{k}\right)}{K\sum _{k=0}^{K-1}\cos 2\omega t_{k}-{\big (}\sum _{k=0}^{K-1}\cos \omega t_{k}{\big )}^{2}+{\big (}\sum _{k=0}^{K-1}\sin \omega t_{k}{\big )}^{2}}},}
α 0 = 2 K 2 ( K k = 0 K 1 cos 2 ( ω t k θ 0 ) [ l = 0 K 1 cos ( ω t k θ 0 ) ] 2 ) , {\displaystyle \alpha _{0}={\sqrt {{\frac {2}{K^{2}}}\left(K\sum _{k=0}^{K-1}\cos ^{2}\left(\omega t_{k}-\theta _{0}\right)-\left^{2}\right)}},}
β 0 = 2 K 2 ( K k = 0 K 1 sin 2 ( ω t k θ 0 ) [ l = 0 K 1 sin ( ω t k θ 0 ) ] 2 ) . {\displaystyle \beta _{0}={\sqrt {{\frac {2}{K^{2}}}\left(K\sum _{k=0}^{K-1}\sin ^{2}\left(\omega t_{k}-\theta _{0}\right)-\left^{2}\right)}}.}

In terms of the phase angle in Fourier space, θ {\displaystyle \theta } , with

tan θ = k = 0 K 1 sin ω t k k = 0 K 1 cos ω t k , {\displaystyle \tan \theta ={\frac {\sum _{k=0}^{K-1}\sin \omega t_{k}}{\sum _{k=0}^{K-1}\cos \omega t_{k}}},}

the probability density of amplitudes is given by

ϕ ( A ) = K A sock 2 < x 2 > exp ( K A 2 4 < x 2 > sock ) , {\displaystyle \phi (A)={\frac {KA\cdot \operatorname {sock} }{2<x^{2}>}}\exp \left(-{\frac {KA^{2}}{4<x^{2}>}}\cdot \operatorname {sock} \right),}

where the sock function is defined by

sock ( ω , θ ) = [ cos 2 ( θ θ 0 ) α 0 2 + sin 2 ( θ θ 0 ) β 0 2 ] {\displaystyle \operatorname {sock} (\omega ,\theta )=\left}

and < x 2 > {\displaystyle <x^{2}>} denotes the variance of the dependent variable x k {\displaystyle x_{k}} .

False-alarm probability and spectral significance

Integration of the pdf yields the false-alarm probability that white noise in the time domain produces an amplitude of at least A {\displaystyle A} ,

Φ FA ( A ) = exp ( K A 2 4 < x 2 > sock ) . {\displaystyle \Phi _{\operatorname {FA} }(A)=\exp \left(-{\frac {KA^{2}}{4<x^{2}>}}\cdot \operatorname {sock} \right).}

The sig is defined as the negative logarithm of the false-alarm probability and evaluates to

sig ( A ) = K A 2 log e 4 < x 2 > sock . {\displaystyle \operatorname {sig} (A)={\frac {KA^{2}\log e}{4<x^{2}>}}\cdot \operatorname {sock} .}

It returns the number of random time series one would have to examine to obtain one amplitude exceeding A {\displaystyle A} at the given frequency and phase.

Applications

SigSpec is primarily used in asteroseismology to identify variable stars and to classify stellar pulsation (see references below). The fact that this method incorporates the properties of the time-domain sampling appropriately makes it a valuable tool for typical astronomical measurements containing data gaps.

See also

References

  1. P. Reegen (2007). "SigSpec - I. Frequency- and phase-resolved significance in Fourier space". Astronomy and Astrophysics. 467 (3): 1353–1371. arXiv:physics/0703160. Bibcode:2007A&A...467.1353R. doi:10.1051/0004-6361:20066597. S2CID 15076973.
  2. N. R. Lomb (1976). "Least-squares frequency analysis of unequally spaced data". Astrophysics and Space Science. 39 (2): 447–462. Bibcode:1976Ap&SS..39..447L. doi:10.1007/BF00648343. S2CID 2671466.
  3. J. D. Scargle (1982). "Studies in astronomical time series analysis. II. Statistical aspects of spectral analysis of unevenly spaced data". The Astrophysical Journal. 263: 835–853. Bibcode:1982ApJ...263..835S. doi:10.1086/160554.

External links

Categories: