Kushner equation - Misplaced Pages

In filtering theory the Kushner equation (after Harold Kushner) is an equation for the conditional probability density of the state of a stochastic non-linear dynamical system, given noisy measurements of the state. It therefore provides the solution of the nonlinear filtering problem in estimation theory. The equation is sometimes referred to as the Stratonovich–Kushner (or Kushner–Stratonovich) equation. However, the correct equation in terms of Itō calculus was first derived by Kushner although a more heuristic Stratonovich version of it appeared already in Stratonovich's works in late fifties. However, the derivation in terms of Itō calculus is due to Richard Bucy.

Overview

Assume the state of the system evolves according to

dx=f(x,t)\,dt+\sigma \,dw

and a noisy measurement of the system state is available:

dz=h(x,t)\,dt+\eta \,dv

where w, v are independent Wiener processes. Then the conditional probability density p(x, t) of the state at time t is given by the Kushner equation:

dp(x,t)=Ldt+p(x,t){\big (}h(x,t)-E_{t}h(x,t){\big )}^{\top }\eta ^{-\top }\eta ^{-1}{\big (}dz-E_{t}h(x,t)dt{\big )}.

where

L:=-\sum {\frac {\partial (f_{i}p)}{\partial x_{i}}}+{\frac {1}{2}}\sum (\sigma \sigma ^{\top })_{i,j}{\frac {\partial ^{2}p}{\partial x_{i}\partial x_{j}}}

is the Kolmogorov forward operator and

dp(x,t)=p(x,t+dt)-p(x,t)

is the variation of the conditional probability.

The term $dz-E_{t}h(x,t)dt$ is the innovation, i.e. the difference between the measurement and its expected value.

Kalman–Bucy filter

One can use the Kushner equation to derive the Kalman–Bucy filter for a linear diffusion process. Suppose we have $f(x,t)=Ax$ and $h(x,t)=Cx$ . The Kushner equation will be given by

dp(x,t)=Ldt+p(x,t){\big (}Cx-C\mu (t){\big )}^{\top }\eta ^{-\top }\eta ^{-1}{\big (}dz-C\mu (t)dt{\big )},

where $\mu (t)$ is the mean of the conditional probability at time $t$ . Multiplying by $x$ and integrating over it, we obtain the variation of the mean

d\mu (t)=A\mu (t)dt+\Sigma (t)C^{\top }\eta ^{-\top }\eta ^{-1}{\big (}dz-C\mu (t)dt{\big )}.

Likewise, the variation of the variance $\Sigma (t)$ is given by

{\tfrac {d}{dt}}\Sigma (t)=A\Sigma (t)+\Sigma (t)A^{\top }+\sigma ^{\top }\sigma -\Sigma (t)C^{\top }\eta ^{-\top }\eta ^{-1}C\,\Sigma (t).

The conditional probability is then given at every instant by a normal distribution ${\mathcal {N}}(\mu (t),\Sigma (t))$ .

References

Kushner, H. J. (1964). "On the differential equations satisfied by conditional probability densities of Markov processes, with applications". Journal of the Society for Industrial and Applied Mathematics, Series A: Control. 2 (1): 106–119. doi:10.1137/0302009.
Stratonovich, R.L. (1959). Optimum nonlinear systems which bring about a separation of a signal with constant parameters from noise. Radiofizika, 2:6, pp. 892–901.
Stratonovich, R.L. (1959). On the theory of optimal non-linear filtering of random functions. Theory of Probability and Its Applications, 4, pp. 223–225.
Stratonovich, R.L. (1960) Application of the Markov processes theory to optimal filtering. Radio Engineering and Electronic Physics, 5:11, pp. 1–19.
Stratonovich, R.L. (1960). Conditional Markov Processes. Theory of Probability and Its Applications, 5, pp. 156–178.
Bucy, R. S. (1965). "Nonlinear filtering theory". IEEE Transactions on Automatic Control. 10 (2): 198. doi:10.1109/TAC.1965.1098109.

Categories:

Overview

Kalman–Bucy filter

See also

References