Misplaced Pages

S-estimator

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.

The goal of S-estimators is to have a simple high-breakdown regression estimator, which share the flexibility and nice asymptotic properties of M-estimators. The name "S-estimators" was chosen as they are based on estimators of scale.

We will consider estimators of scale defined by a function ρ {\displaystyle \rho } , which satisfy

  • R1 – ρ {\displaystyle \rho } is symmetric, continuously differentiable and ρ ( 0 ) = 0 {\displaystyle \rho (0)=0} .
  • R2 – there exists c > 0 {\displaystyle c>0} such that ρ {\displaystyle \rho } is strictly increasing on [ c , ] {\displaystyle }

For any sample { r 1 , . . . , r n } {\displaystyle \{r_{1},...,r_{n}\}} of real numbers, we define the scale estimate s ( r 1 , . . . , r n ) {\displaystyle s(r_{1},...,r_{n})} as the solution of

1 n i = 1 n ρ ( r i / s ) = K {\textstyle {\frac {1}{n}}\sum _{i=1}^{n}\rho (r_{i}/s)=K} ,

where K {\displaystyle K} is the expectation value of ρ {\displaystyle \rho } for a standard normal distribution. (If there are more solutions to the above equation, then we take the one with the smallest solution for s; if there is no solution, then we put s ( r 1 , . . . , r n ) = 0 {\displaystyle s(r_{1},...,r_{n})=0} .)

Definition:

Let ( x 1 , y 1 ) , . . . , ( x n , y n ) {\displaystyle (x_{1},y_{1}),...,(x_{n},y_{n})} be a sample of regression data with p-dimensional x i {\displaystyle x_{i}} . For each vector θ {\displaystyle \theta } , we obtain residuals s ( r 1 ( θ ) , . . . , r n ( θ ) ) {\displaystyle s(r_{1}(\theta ),...,r_{n}(\theta ))} by solving the equation of scale above, where ρ {\displaystyle \rho } satisfy R1 and R2. The S-estimator θ ^ {\displaystyle {\hat {\theta }}} is defined by

θ ^ = min θ s ( r 1 ( θ ) , . . . , r n ( θ ) ) {\displaystyle {\hat {\theta }}=\min _{\theta }\,s(r_{1}(\theta ),...,r_{n}(\theta ))}

and the final scale estimator σ ^ {\displaystyle {\hat {\sigma }}} is then

σ ^ = s ( r 1 ( θ ^ ) , . . . , r n ( θ ^ ) ) {\displaystyle {\hat {\sigma }}=s(r_{1}({\hat {\theta }}),...,r_{n}({\hat {\theta }}))} .

References

  1. P. Rousseeuw and V. Yohai, Robust Regression by Means of S-estimators, from the book: Robust and nonlinear time series analysis, pages 256–272, 1984
Categories: