Misplaced Pages

Decomposition of time series

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Statistical task that deconstructs a time series into several components
This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "Decomposition of time series" – news · newspapers · books · scholar · JSTOR (October 2015) (Learn how and when to remove this message)

The decomposition of time series is a statistical task that deconstructs a time series into several components, each representing one of the underlying categories of patterns. There are two principal types of decomposition, which are outlined below.

Decomposition based on rates of change

This is an important technique for all types of time series analysis, especially for seasonal adjustment. It seeks to construct, from an observed time series, a number of component series (that could be used to reconstruct the original by additions or multiplications) where each of these has a certain characteristic or type of behavior. For example, time series are usually decomposed into:

  • T t {\displaystyle T_{t}} , the trend component at time t, which reflects the long-term progression of the series (secular variation). A trend exists when there is a persistent increasing or decreasing direction in the data. The trend component does not have to be linear.
  • C t {\displaystyle C_{t}} , the cyclical component at time t, which reflects repeated but non-periodic fluctuations. The duration of these fluctuations depend on the nature of the time series.
  • S t {\displaystyle S_{t}} , the seasonal component at time t, reflecting seasonality (seasonal variation). A seasonal pattern exists when a time series is influenced by seasonal factors. Seasonality occurs over a fixed and known period (e.g., the quarter of the year, the month, or day of the week).
  • I t {\displaystyle I_{t}} , the irregular component (or "noise") at time t, which describes random, irregular influences. It represents the residuals or remainder of the time series after the other components have been removed.

Hence a time series using an additive model can be thought of as

y t = T t + C t + S t + I t , {\displaystyle y_{t}=T_{t}+C_{t}+S_{t}+I_{t},}

whereas a multiplicative model would be

y t = T t × C t × S t × I t . {\displaystyle y_{t}=T_{t}\times C_{t}\times S_{t}\times I_{t}.\,}

An additive model would be used when the variations around the trend do not vary with the level of the time series whereas a multiplicative model would be appropriate if the trend is proportional to the level of the time series.

Sometimes the trend and cyclical components are grouped into one, called the trend-cycle component. The trend-cycle component can just be referred to as the "trend" component, even though it may contain cyclical behavior. For example, a seasonal decomposition of time series by Loess (STL) plot decomposes a time series into seasonal, trend and irregular components using loess and plots the components separately, whereby the cyclical component (if present in the data) is included in the "trend" component plot.

Decomposition based on predictability

The theory of time series analysis makes use of the idea of decomposing a times series into deterministic and non-deterministic components (or predictable and unpredictable components). See Wold's theorem and Wold decomposition.

Examples

An example of using multiplicative decomposition in biohydrogen production forecast.

Kendall shows an example of a decomposition into smooth, seasonal and irregular factors for a set of data containing values of the monthly aircraft miles flown by UK airlines.

In policy analysis, forecasting future production of biofuels is key data for making better decisions, and statistical time series models have recently been developed to forecast renewable energy sources, and a multiplicative decomposition method was designed to forecast future production of biohydrogen. The optimum length of the moving average (seasonal length) and start point, where the averages are placed, were indicated based on the best coincidence between the present forecast and actual values.

Software

An example of statistical software for this type of decomposition is the program BV4.1 that is based on the Berlin procedure. The R statistical software also includes many packages for time series decomposition, such as seasonal, stl, stlplus, and bfast. Bayesian methods are also available; one example is the BEAST method in a package Rbeast in R, Matlab, and Python.

See also

References

  1. ^ "6.1 Time series components | OTexts". www.otexts.org. Retrieved 2016-05-14.
  2. ^ Dodge, Y. (2003). The Oxford Dictionary of Statistical Terms. New York: Oxford University Press. ISBN 0-19-920613-9.
  3. ^ "6.1 Time series components | OTexts". www.otexts.org. Retrieved 2016-05-18.
  4. "6.5 STL decomposition | OTexts". www.otexts.org. Retrieved 2016-05-18.
  5. ^ Asadi, Nooshin; Karimi Alavijeh, Masih; Zilouei, Hamid (2016). "Development of a mathematical methodology to investigate biohydrogen production from regional and national agricultural crop residues: A case study of Iran". International Journal of Hydrogen Energy. doi:10.1016/j.ijhydene.2016.10.021.
  6. Kendall, M. G. (1976). Time-Series (Second ed.). Charles Griffin. (Fig. 5.1). ISBN 0-85264-241-5.
  7. Sax, Christoph. "seasonal: R Interface to X-13-ARIMA-SEATS".
  8. Hafen, Ryan. "stlplus: Enhanced Seasonal Decomposition of Time Series by Loess".
  9. Li, Yang; Zhao, Kaiguang; Hu, Tongxi; Zhang, Xuesong. "BEAST: A Bayesian Ensemble Algorithm for Change-Point Detection and Time Series Decomposition".

Further reading

Statistics
Descriptive statistics
Continuous data
Center
Dispersion
Shape
Count data
Summary tables
Dependence
Graphics
Data collection
Study design
Survey methodology
Controlled experiments
Adaptive designs
Observational studies
Statistical inference
Statistical theory
Frequentist inference
Point estimation
Interval estimation
Testing hypotheses
Parametric tests
Specific tests
Goodness of fit
Rank statistics
Bayesian inference
Correlation
Regression analysis
Linear regression
Non-standard predictors
Generalized linear model
Partition of variance
Categorical / Multivariate / Time-series / Survival analysis
Categorical
Multivariate
Time-series
General
Specific tests
Time domain
Frequency domain
Survival
Survival function
Hazard function
Test
Applications
Biostatistics
Engineering statistics
Social statistics
Spatial statistics
Quantitative forecasting methods
Historical data forecasts
Associative (causal) forecasts
  • Moving average
  • Simple linear regression
  • Regression analysis
  • Econometric model
  • Category: