Article snapshot taken from[REDACTED] with creative commons attribution-sharealike license.
Give it a read and then ask your questions in the chat.
We can research this topic together.
This article, Innovation method, has recently been created via the Articles for creation process. Please check to see if the reviewer has accidentally left this template after accepting the draft and take appropriate action as necessary.
Reviewer tools:Inform author
The innovation estimator for SDEs is defined in the framework of continuous-discrete state space models . These models arise as natural mathematical representation of the temporal evolution of continuous random phenomena and their measurements in a succession of time instants. In the simplest formulation, these continuous-discrete models are expressed in term
of a SDE of the form
describing the time evolution of state variables of the phenomenon for all time instant , and an observation equation
Once the dynamics of a phenomenon is described by a state equation as (1) and the way of measurement the state variables specified by a observation equation as (2), the inference problem to solve is the following : given partial and noisy observations of the stochastic process on the observation times , estimate the unobserved state variable of and the unknown parameters in (1) that better fit to the given observations.
Discrete-time innovation process
Let be the sequence of observation times of the states of (1), and the time series of partial and noisy measurements of described by the observation equation (2).
defines the discrete-time innovation process , where is proved to be an independent normally distributed random vector with zero mean and variance
for small enough , with . In practice , this distribution for the discrete-time innovation is valid when, with a suitable selection of both, the number of observations and the time distance between consecutive observations, the time series of observations of the SDE contains the main information about the continuous-time process . That is, when the sampling of the continuous-time process has low distortion (aliasing) and when there is a suitable signal-noise ratio.
Innovation estimator
The innovation estimator for the parameters of the SDE (1) is the one that maximizes the likelihood function of the discrete-time innovation process with respect to the parameters . More precisely, given measurements of the state space model (1)-(2) with on the innovation estimator for the parameters of (1) is defined by
where
being the discrete-time innovation (3) and the innovation variance (4) of the model (1)-(2) at , for all In the above expression for the conditional mean and variance are computed by the continuous-discrete filtering algorithm for the evolution of the moments (Section 6.4 in ), for all
Differences with the maximum likelihood estimator
The maximum likelihood estimator of the parameters in the model (1)-(2) involves the evaluation of the - usually unknown - transition density function between the states and of the diffusion process for all the observation times and . instead of this, the innovation estimator (5) is obtained by maximizing the likelihood of the discrete-time innovation process taking into account that are Gaussian and independent random vectors. Remarkably, whereas the transition density function changes when the SDE for does, the transition density function for the innovation process remains Gaussian independently of the SDEs for . Only in the case that the diffusion is described by a linear SDE with additive noise, the density function is Gaussian and equal to and so the maximum likelihood and the innovation estimator coincide . Otherwise , the innovation estimator is an approximation to the maximum likelihood estimator and, in this sense, the innovation estimator is a Quasi-Maximum Likelihood estimator. In addition, the innovation method is a particular instance of the Prediction Error method according to the definition given in . Therefore, the asymptotic results obtained in for that general class of estimators are valid for the innovation estimators . Intuitively, by following the typical control engineering viewpoint, it is expected that the innovation process - viewed as a measure of the prediction errors of the fitted model - be approximately a white noise process when the models fit the data , which can be used as a practical tool for designing of models and for optimal experimental design .
Properties
The innovation estimator (5) has a number of important attributes:
For smooth enough function , nonlinear observation equations of the form
can be transformed to the simpler one (2), and the innovation estimator (5) can be applied .
Approximate Innovation estimators
In practice, close form expressions for computing and in (5) are only available for a few models (1)-(2). Therefore, approximate filtering algorithms as the following are used in applications.
Given measurements and the initial filter estimates , , the approximate Linear Minimum Variance (LMV) filter for the model (1)-(2) is iteratively defined at each observation time by the prediction estimates
and
with initial conditions and , and the filter estimates
and
with filter gain
for all , where is an approximation to the solution of (1) on the observation times .
Given measurements of the state space model (1)-(2) with on , the approximate innovation estimator for the parameters of (1) is defined by
where
being
and
approximations to the discrete-time innovation (3) and innovation variance (4), respectively, resulting from the filtering algorithm (7)-(8).
For models with complete observations free of noise (i.e, with and in (2), the approximate innovation estimator (9) reduces to the known Quasi-Maximum Likelihood estimators for SDEs .
Main conventional-type estimators
Conventional-type innovation estimators are those (9) derived from conventional-type continuous-discrete or
discrete-discrete approximate filtering algorithms. With approximate continuous-discrete filters there are the innovation estimators based on Local Linearization (LL) filters , on the extended Kalman filter , and on the second order filters . Approximate innovation estimators based on discrete-discrete filters result from the discretization of the SDE (1) by means of a numerical scheme . Typically, the effectiveness of these innovation estimators is directly related to the stability of the involved filtering algorithms.
A shared drawback of these approximate innovation estimators is that, once the observations are given, the error between the approximate and the exact innovation process is fixed and completely settled by the time distance between observations . This might sets a large bias of the approximate estimators in some applications, bias that can not be corrected by increasing the number of observations. However, they are useful in many practical situations for which only medium or low accuracy for the parameter estimation is required .
Order-β innovation estimators
Let us consider the finer time discretization of the time interval satisfying the condition . Further, let be the approximate value of obtained from a discretization of the equation (1) for all , and
for all
a continuous-time approximation to .
A order-LMV filter is an approximate LMV filter for which is an order-weak approximation to satisfying (10) and the weak convergence condition
for all and any times continuously differentiable functions for which and all its partial derivatives up to order have polynomial growth, being a positive constant. This order- LMV filter converges with rate to the exact LMV filter as goes to zero , where is the maximum stepsize of the time
discretization on which the approximation to is defined.
A order-innovation estimator is an approximate innovation estimator (9) for which the approximations to the discrete-time innovation (3) and innovation variance (4), respectively, resulting from an order- LMV filter.
Approximations of any kind converging to in a weak sense (as, e.g., those in ) can be used to design an order- LMV filter and, consequently, an order- innovation estimator. These order- innovation estimators are intended for the recurrent practical situation in which a diffusion process should be identified from a reduced number of observations distant in time or when high accuracy for the estimated parameters is required.
Properties
An order-innovation estimator has a number of important properties :
For each given data of observations, converges to the exact innovation estimator as the maximum stepsize of the time discretization goes to zero.
For finite samples of observations, the expected value of converges to the expected value of the exact innovation estimator as goes to zero.
For an increasing number of observations, is asymptotically normal distributed and its bias decreases when goes to zero.
Likewise to the convergence of the order- LMV filter to the exact LMV filter, for the convergence and asymptotic properties of there are no constraints on the time distance between two consecutive observations and , nor on the time discretization
Approximations for the Akaike or Bayesian information criterion and confidence limits are directly obtained by replacing the exact estimator by its approximation . These approximations converge to the corresponding exact one when the maximum stepsize of the time discretization goes to zero.
The distribution of the approximate fitting-innovation process measures the goodness of fit of the model to the data, which is also used as a practical tool for designing of models and for optimal experimental design.
For smooth enough function , nonlinear observation equations of the form (6) can be transformed to the simpler one (2), and the order- innovation estimator can be applied.
Figure 1 presents the histograms of the differences and between the exact innovation estimator with the conventional and order- innovation estimators for the parameters and of the equation
obtained from 100 time series of noisy observations
of on the observation times , , with and . The classical and the order- Local Linearization filters of the innovation
estimators and are defined as in , respectively, on the uniform time discretizations and , with . The number of stochastic simulations of the order- Local Linearization filter is estimated via an adaptive sampling algorithm with moderate tolerance. The Figure illustrates the convergence of the order- innovation estimator to the exact innovation estimators as decreases, which substantially improves the estimation provided by the conventional innovation estimator .
Deterministic approximations
The order- innovation estimators overcome the drawback of the conventional-type innovation estimators concerning the impossibility of reducing bias . However, the viable bias reduction of an order- innovation estimators might eventually require that the associated order- LMV filter performs a large number of stochastic simulations . In situations where only low or medium precision approximate estimators are needed, an alternative deterministic filter algorithm - called deterministic order- LMV filter - can be obtained by tracking the first two conditional moments and of the order- weak approximation at all the time instants in between two consecutive observation times and . That is, the value of the predictions and in the filtering algorithm are computed from the recursive formulas
and with
and with . The approximate innovation estimators defined with these deterministic order- LMV filters not longer converge to the exact innovation estimator, but allow a significant bias reduction in the estimated parameters for a given finite sample with a lower computational cost.
Figure 2 presents the histograms and the confidence limits of the approximate innovation estimators and for the parameters and of the Van der Pol oscillator with random frequency
obtained from 100 time series of partial and noisy observations
of on the observation times , , with and . The deterministic order- Local Linearization filter of the innovation estimators and is defined , respectively, on uniform time discretizations , with and adaptive time-stepping discretization with moderate relative and absolute tolerances. Observe the bias reduction of the estimated parameter as decreases.
Software
A Matlab implementation of various approximate innovation estimators is provided by the SdeEstimation toolbox . This toolbox contains various implementations of Local Linearization filters for the state estimation and, consequently, of the Innovation Estimators for the parameters. This includes deterministic and stochastic filters with fixed step sizes and number of samples, with adaptive time stepping algorithms, with adaptive sampling algorithms, as well as local and global optimization algorithms for computing the innovation estimators. For models with complete observations free of noise, various approximations to the Quasi-Maximum Likelihood estimator are implemented in R .
Referencias
^ Ozaki T. (1994) "The local linearization filter with application to nonlinear system identification". In: Bozdogan H.(ed.) Proceedings of the first US/Japan Conference on the Frontiers of Statistical Modeling: An Informational Approach. 217-240. Kluwer Academic Publishers. https://doi.org/10.1007/978-94-011-0854-6_10
^ Jazwinski A.H., Stochastic Processes and Filtering Theory, Academic Press, New York, 1970.
^ Nielsen J.N., Vestergaard M., Madsen H. (2000) "Estimation in continuous-time stochastic volatility models using nonlinear filters", Int. J. Theor. Appl. Finance, 3, 279–308. https://doi.org/10.1142/S0219024900000139
Kailath T., Lectures on Wiener and Kalman Filtering. New York: Springer-Verlag, 1981.
^ Jimenez J.C., Ozaki T. (2006) "An approximate innovation method for the estimation of diffusion processes from discrete data", J. Time Series Analysis, 27, 77-97. http://dx.doi.org/10.1111/j.1467-9892.2005.00454.x
^ Jimenez J.C., Yoshimoto A., Miwakeichi F. (2021) "State and parameter estimation of stochastic physical systems from uncertain and indirect measurements", Eur. Phys. J. Plus, 136, 869. https://doi.org/10.1140/epjp/s13360-021-01859-1
Ljung L., System Identification, Theory for the User (2nd edn). Englewood Cliffs: Prentice Hall, 1999.
Ljung L., Caines P.E. (1979) "Asymptotic normality of prediction error estimators for approximate system models", Stochastics 3, 29-46. https://doi.org/10.1080/17442507908833135
^ Nolsoe K., Nielsen, J.N., Madsen H. (2000) "Prediction-based estimating function for diffusion processes with measurement noise", Technical Reports 2000, No. 10, Informatics and Mathematical Modelling, Technical University of Denmark.
^ Ozaki T., Jimenez J.C., Haggan V. (2000) "Role of the likelihood function in the estimation of chaos models", J. Time Ser. Anal., 21, 363-387. http://dx.doi.org/10.1111/1467-9892.00189
^ Jimenez J.C. (2020) "Bias reduction in the estimation of diffusion processes from discrete observations", IMA J. Math. Control. Inform., 37, 1468-1505. https://doi.org/10.1093/imamci/dnaa021
^ Jimenez J.C. (2019) "Approximate linear minimum variance filters for continuous-discrete state space models: convergence and practical adaptive algorithms", IMA J. Math. Control Inform., 36, 341-378. http://dx.doi.org/10.1093/imamci/dnx047
Shoji I. (1998) "A comparative study of maximum likelihood estimators for nonlinear dynamical systems", Int. J. Control, 71, 391-404. https://doi.org/10.1080/002071798221731
Nielsen, J. N., Madsen, H. (2001) "Applying the EKF to stochastic differential equations with level effects", Automatica, 37, 107-112. https://doi.org/10.1016/S0005-1098(00)00128-X
^ Singer H. (2002) "Parameter estimation of nonlinear stochastic differential equations: Simulated maximum likelihood versus extended Kalman filter and Ito-Taylor expansion", J. Comput. Graph. Stat., 11, 972-995. https://doi.org/10.1198/106186002808
Ozaki T., Iino M. (2001) "An innovation approach to non-Gaussian time series analysis", J. Appl. Prob., 38A, 78-92. https://doi.org/10.1239/jap/1085496593
Peng H., Ozaki T., Jimenez J.C. (2002) "Modeling and control for foreign exchange based on a continuous time stochastic microstructure model", Proceedings of the 41st IEEE Conference on Decision and Control, LasVegas, Nevada USA, December 2002 IEEE, 4, 4440-4445. http://dx.doi.org/10.1109/CDC.2002.1185071
Kloeden P.E., Platen E., Numerical Solution of Stochastic Differential Equations, 3rd edn. Berlin: Springer, 1999.