Unit 3: Time Series-I

Table of Contents

3.1 Introduction to Time Series Data

A time series is a set of statistical observations arranged in chronological (time) order.

It's a sequence of data points, measured typically at successive, uniform time intervals (e.g., hourly, daily, monthly, quarterly, yearly).

Examples:

The goal of Time Series Analysis is to study this past data to identify patterns (like trends or seasonality) and use them to make forecasts or predictions about the future.

3.2 Application of Time Series in Various Fields

Time series analysis is crucial in many fields:

3.3 Components of a Time Series

The "classical" approach to time series assumes that the data's value (Y) at any time (t) is a combination of four distinct components:

  1. Secular Trend (T):
    • The long-term, smooth movement of the series in one general direction (up, down, or constant).
    • It represents the underlying growth or decline over many years.
    • Example: The general increase in a country's population over several decades.
  2. Seasonal Variation (S):
    • Regular, periodic fluctuations that occur within one year and repeat each year.
    • Caused by weather (e.g., ice cream sales peak in summer) or social customs (e.g., retail sales peak before festivals).
    • The time period is less than one year (e.g., monthly or quarterly).
  3. Cyclical Variation (C):
    • Long-term, "wave-like" oscillations or swings around the trend line.
    • The time period is more than one year (often 3-10 years).
    • This component is often related to the "business cycle" (boom, recession, depression, recovery).
    • It is less regular and harder to predict than seasonality.
  4. Irregular/Random Variation (I):
    • The unpredictable, "noise" component. It's what's left over after the other three components are accounted for.
    • Caused by random, unforeseen events like strikes, natural disasters, or wars.
    • It is non-repeating and short-term.

3.4 Decomposition of Time Series

Decomposition is the process of breaking down an observed time series (Y) into its constituent components (T, S, C, and I).

The goal is to isolate and study each component. For example, to make a good forecast, we might first find the trend (T) and then add back the seasonal effect (S).

There are two main models for decomposition:

1. Additive Model

Y = T + S + C + I

This model is used when the components are assumed to be independent of each other. The magnitude of the seasonal variation is constant, regardless of the trend level.

2. Multiplicative Model

Y = T × S × C × I

This model is used when the components are inter-dependent. The magnitude of the seasonal variation is proportional to the trend.

The multiplicative model is more common in practice. It can be "linearized" by taking the logarithm:
log(Y) = log(T) + log(S) + log(C) + log(I)
This transforms the multiplicative model into an additive one, which is often easier to work with.