Misplaced Pages

Tjøstheim's coefficient

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Measure of spatial correlation

Tjøstheim's coefficient is a measure of spatial association that attempts to quantify the degree to which two spatial data sets are related. Developed by Norwegian statistician Dag Tjøstheim. It is similar to rank correlation coefficients like Spearman's rank correlation coefficient and the Kendall rank correlation coefficient but also explicitly considers the spatial relationship between variables.

Consider two variables, F ( x , y ) {\displaystyle F(x,y)} and G ( x , y ) {\displaystyle G(x,y)} , observed at the same set of N {\displaystyle N} spatial locations with co-ordinates x i {\displaystyle x_{i}} and y i {\displaystyle y_{i}} . The Rank of F {\displaystyle F} at ( x i , y i ) {\displaystyle (x_{i},y_{i})} is

R F ( x i , y i ) = i N θ ( F ( x i , y i ) F ( x j , y j ) ) {\displaystyle R_{F}(x_{i},y_{i})=\sum _{i}^{N}\theta (F(x_{i},y_{i})-F(x_{j},y_{j}))}

with a similar definition for G {\displaystyle G} . Here θ {\displaystyle \theta } is a step function and this formula counts how many values F ( x j , y j ) {\displaystyle F(x_{j},y_{j})} are less than or equal to the value at the target point F ( x i , y i ) {\displaystyle F(x_{i},y_{i})} .

Now define

X F ( i ) = j N x j δ ( i , R F ( x j , y j ) ) {\displaystyle X_{F}(i)=\sum _{j}^{N}x_{j}\delta (i,R_{F}(x_{j},y_{j}))}

where δ {\displaystyle \delta } is the Kronecker delta. This is the x {\displaystyle x} coordinate of the i th {\displaystyle i^{\text{th}}} ranked F {\displaystyle F} value. The quantities Y F ( i ) , X G ( i ) {\displaystyle Y_{F}(i),X_{G}(i)} and Y G ( i ) {\displaystyle Y_{G}(i)} can be defined similarly.

Tjøstheim's coefficient is defined by

A = i N ( X F ( i ) X ¯ F ) ( X G ( i ) X ¯ G ) + ( Y F ( i ) Y ¯ F ) ( Y G ( i ) Y ¯ G ) ( i N [ ( X F ( i ) X ¯ F ) 2 + ( Y F ( i ) Y ¯ F ) 2 ] i N [ ( X G ( i ) X ¯ G ) 2 + ( Y G ( i ) Y ¯ G ) 2 ] ) 1 / 2 {\displaystyle A={\frac {\sum _{i}^{N}(X_{F}(i)-{\bar {X}}_{F})(X_{G}(i)-{\bar {X}}_{G})+(Y_{F}(i)-{\bar {Y}}_{F})(Y_{G}(i)-{\bar {Y}}_{G})}{\left(\sum _{i}^{N}\left\sum _{i}^{N}\left\right)^{1/2}}}}

Under the assumptions that F {\displaystyle F} and G {\displaystyle G} are independent and identically distributed random variables and are independent of each other it can be shown that E [ A ] = 0 {\displaystyle E=0} and

v a r ( A ) = ( i N x i 2 ) 2 + 2 ( i N x i y i ) 2 + ( i N y i 2 ) 2 ( N 1 ) ( i N x i 2 + i N y i 2 ) 2 {\displaystyle var(A)={\frac {\left(\sum _{i}^{N}x_{i}^{2}\right)^{2}+2\left(\sum _{i}^{N}x_{i}y_{i}\right)^{2}+\left(\sum _{i}^{N}y_{i}^{2}\right)^{2}}{(N-1)\left(\sum _{i}^{N}x_{i}^{2}+\sum _{i}^{N}y_{i}^{2}\right)^{2}}}}

The maximum variance of 1 / ( N 1 ) {\displaystyle 1/(N-1)} occurs when all points are on a straight line and the minimum variance of 1 / ( 2 ( N 1 ) ) {\displaystyle 1/(2(N-1))} occurs for a symmetric cross pattern where x i y i = 0 {\displaystyle x_{i}y_{i}=0} and i N x i 2 = i N y i 2 {\displaystyle \sum _{i}^{N}x_{i}^{2}=\sum _{i}^{N}y_{i}^{2}} .

Tjøstheim's coefficient is implemented as cor.spatial in the R package SpatialPack. Numerical simulations suggest that A {\displaystyle A} is an effective measure of correlation between variables but is sensitive to the degree of autocorrelation in F {\displaystyle F} and G {\displaystyle G} .

See also

References

  1. D. Tjøstheim (1978). "A measure of association for spatial variables". Biometrika. 65 (1): 109–114. doi:10.1093/biomet/65.1.109. JSTOR 2335284.
  2. Vallejos, Ronny; Osorio, Feilpe; Bevilacqua, Moreno (2020). Spatial relationships between two georeferenced variables: With applications in R. Springer Cham. doi:10.1007/978-3-030-56681-4. ISBN 978-3-030-56681-4.
  3. ^ B.J. Glick (1982). "A Spatial Rank-Order Correlation Measure". Geographical Analysis. 14 (2): 177–181. doi:10.1111/j.1538-4632.1982.tb00066.x.
  4. http://spatialpack.mat.utfsm.cl
Categories: