Include Top

ARIMA – Multiple Seasonal Decomposition (MSD) Forecast

  1. Open Monthly Airline Passengers - Series G.xlsx (Sheet 1 tab). This is the Series G data from Box and Jenkins, monthly total international airline passengers for January 1949 to December 1960. See the Run Chart, ACF/PACF Plots, Spectral Density Plot and Seasonal Trend Decomposition Plots for this data. The Multiple Seasonal Decomposition (MSD) option is not necessary for this data, but by way of introduction, we will use this to compare to the previous ARIMA analysis.
  2. Click SigmaXL > Time Series Forecasting > ARIMA Forecast > Multiple Seasonal Decomposition Forecast. Ensure that the entire data table is selected. If not, check Use Entire Data Table. Click Next.
  3. Select Monthly Airline Passengers, click Numeric Time Series Data (Y) >>. Select Date, click Optional Time Axis Labels >>. Check Display ACF/PACF/LB Plots and Display Residual Plots. Check Specify Model Periods. Set Withhold Periods = 24. Select Withhold Forecast Type: Multi-Step-Ahead with Prediction Interval at Start of Withhold. Select Seasonal Frequency Specify and enter 12. Check Box-Cox Transformation and select Rounded Lambda. We will use the default Prediction Interval = 95.0 %.


    ARIMAMSDFORECASTStep3
    Seasonal Frequency can have multiple entries however, we recommend no more than 3 values.

  4. Click Model Options.

    ARIMAMSDFORECASTStep4
  5. We will use the default Automatic Model Selection with AICc as the Model Selection Criterion. Click OK to return to the ARIMA MSD Forecast dialog. Click OK. The ARIMA forecast report is given:

    ARIMAMSDFORECASTStep5
  6. Scroll down to view the ARIMA Model header:

    ARIMAMSDFORECASTStep6
  7. The ARIMA Model Summary is given as:

    ARIMAMSDFORECASTStep7
    This is a summary of the model information for the deseasonalized data: ARIMA (0,1,1) with a constant. Seasonal Frequency = 12 using Decomposition and Model Selection Criterion = “AICc”. There are no seasonal terms in the model. The Box-Cox Transformation is “Rounded Lambda” with Lambda = 0 (Ln transformation).

  8. The Parameter Estimates for the deseasonalized Airline Passenger data are:

    ARIMAMSDFORECASTStep8
  9. The ARIMA Model Statistics are:

    ARIMAMSDFORECASTStep9
    • The number of observations, n = 144 – 24 (withhold) = 120

    • Degrees of freedom (DF) = 120 (n) – 3 (2 terms in the model, 1 nonseasonal difference) = 117

    • Note that the model statistics are based on the Ln transformed data, not the original data.

    • Comparing to the Exponential Smoothing MSD Model Statistics, we see that the StDev and Variance are approximately equal, but the Log-Likelihood, AICc, AIC and BIC are very different. This is due to different formulas being used in the Likelihood function. You cannot use Information Criteria to compare ARIMA and Exponential Smooth models to determine which model has the best fit.

  10. The Forecast Accuracy metrics are:

    ARIMAMSDFORECASTStep10
    Comparing to Exponential Smoothing MSD Forecast Accuracy metrics, we see that the ARIMA MSD results are approximately the same.

    Forecast Accuracy metrics are calculated using the actual raw data versus inverse transformed forecast as displayed in the Forecast Chart and Table, so allow comparison across all model types and transformations.

  11. Click on the ARIMA MSD ACF PACF LB sheet to view the ACF/PACF/LB Plots:

    ARIMAMSDFORECASTStep11A

    ARIMAMSDFORECASTStep11B
    The ARIMA MSD ACF/PACF Residuals Plots are similar to the Exponential Smoothing MSD ACF/PACF/LB Plots and indicate that the autocorrelation has been accounted for in the model. All of the Ljung-Box P-Values are blue (i.e., not significant), with P-Values > .05.

  12. Click on the ARIMA MSD Residuals sheet to view the Residual Plots:

    ARIMAMSDFORECAST12
    The residuals are approximately normally distributed, with a roughly straight line on the normal probability plot. There are no obvious extreme outliers or patterns in the charts. The residual plots are approximately the same as those given previously in Exponential Smoothing MSD Residual Plots.

    Since a Box-Cox transformation was used, the residuals are in Ln transformed units.

  13. Open Half-Hourly Multiple Seasonal Electricity Demand - Taylor.xlsx (Sheet 1 tab). This is half-hourly electricity demand (MW) in England and Wales from Monday, June 5, 2000 to Sunday, August 27, 2000 (taylor, R forecast). This data has multiple seasonality with frequency = 48 (observations per day) and 336 (observations per week), with a total of 4032 observations. See the Run Chart, ACF/PACF Plots, Spectral Density Plot and Seasonal Trend Decomposition Plots for this data.
  14. Click SigmaXL > Time Series Forecasting > ARIMA Forecast > Multiple Seasonal Decomposition Forecast. Ensure that the entire data table is selected. If not, check Use Entire Data Table. Click Next.
  15. Select Demand, click Numeric Time Series Data (Y) >>. Check Display ACF/PACF/LB Plots and Display Residual Plots. Check Specify Model Periods. Set Withhold Periods = 96. Select Withhold Forecast Type: Multi-Step-Ahead with Prediction Interval at Start of Withhold. Check Seasonal Frequency with Specify = 48 336. Leave Box-Cox Transformation unchecked. We will use the default Prediction Interval = 95.0 %.

    ARIMAMSDFORECASTStep15
    Withhold Periods is 2*dominant seasonal frequency (48). Dominant frequency is obtained from the Spectral Density Plot. Start Model at Period = 1 is always greyed out for MSD.

  16. Click Model Options.

    ARIMAMSDFORECASTStep16
  17. We will use the default Automatic Model Selection with AICc as the Model Selection Criterion. Click OK to return to the ARIMA MSD Forecast dialog. Click OK. The ARIMA forecast report is given:

    ARIMAMSDFORECASTStep17
  18. We will want to zoom in on the last 3 days, i.e., 144 half-hourly time periods, using chart scrolling. Click SigmaXL Chart Tools > Enable Scrolling

    ARIMACCStep8
    You may be prompted with a warning message that custom formatting on the chart will be cleared. You can avoid seeing this warning by checking Save this choice as default and do not show this form again.

  19. Click OK. The scroll dialog appears allowing you to specify the Start Period and Window Width. Enter Start Period = 3888 and Window Width = 144:

    ARIMAMSDFORECASTStep19
    At any point, you can click Restore/Show All Data Points or Freeze Chart. Freezing the chart will remove the scroll and unload the dialog. The scroll dialog will also unload if you change worksheets. To restore the dialog, click SigmaXL Chart Tools > Enable Scrolling.

  20. Click OK. A scroll bar appears beneath the forecast chart. You can also change the Start Subgroup and Window Width and Update.

    ARIMAMSDFORECASTStep20
    You can scroll through by clicking to the right or left, with the specified window width of 144.

    The blank dots are the data values in the withhold sample with a multi-step forecast and prediction intervals displayed at the start of the withhold sample. The model does quite well at predicting the withhold 96 half-hour demand values. Note that the prediction error increases the further out we predict. The ARIMA MSD Forecast Chart looks approximately the same as the previous Exponential Smoothing MSD Forecast Chart.

    Click Cancel to exit the scroll dialog.

  21. Scroll down to view the ARIMA Model header:

    ARIMAMSDFORECASTStep21
  22. The ARIMA Model Summary is given as:

    ARIMAMSDFORECASTStep22
    This is a summary of the model information for the deseasonalized data: ARIMA (2,1,0) without a constant. Seasonal Frequency = 48, 336 using Decomposition and Model Selection Criterion = “AICc”. There are no seasonal terms in the model.

  23. The Parameter Estimates for the deseasonalized Demand data are:

    ARIMAMSDFORECASTStep23
    ARIMA Parameter Estimates include significance tests; P-Values < .05 are significant and highlighted in red. This may be useful for model refinement with multiple predictors but for AR/MA model order selection, minimum AICc should be used, rather than significance tests (see Kostenko, A.V. and Hyndman, R.J. ).

  24. The ARIMA Model Statistics are:

    ARIMAMSDFORECASTStep24
    • The number of observations, n = 4032 – 96 (withhold) = 3936

    • Degrees of freedom (DF) = 3936 (n) – 3 (2 terms in the model, 1 nonseasonal difference) = 3933

    • Note that the model statistics are calculated using the deseasonalized data.

  25. The Forecast Accuracy metrics are:

    ARIMAMSDFORECASTStep25
    As expected, the Out-of Sample (Withhold) Multi-Step-Ahead Forecast errors are larger than the In-Sample (Estimation) One-Step-Ahead Forecast errors. Note, if we were primarily interested in a short term one-step ahead forecast, then we would have selected Withhold Forecast Type: One-Step-Ahead and the above table would show Out-of-Sample (Withhold) One-Step-Ahead Forecast errors.

    Comparing to the previous Exponential Smoothing MSD Forecast Accuracy metrics, we see that the ARIMA MSD results are approximately the same.

    Forecast Accuracy metrics are calculated using the actual raw data versus forecast as displayed in the Forecast Chart and Table so, unlike the model statistics above, allows comparison across all forecast model types and transformations.

  26. The Forecast Table is given as:

    ARIMAMSDFORECASTStep26
    These are the same forecast and prediction interval values displayed in the Forecast Chart, but provided for further analysis or charting (e.g., a run chart of the forecast errors). The Withhold Data is also displayed.

  27. Click on the ARIMA MSD ACF PACF LB sheet to view the ACF/PACF/LB Plots:

    ARIMAMSDFORECASTStep2A7

    ARIMAMSDFORECASTStep2AB
    The ACF/PACF Residuals Plots indicate that much of the autocorrelation has been accounted for in the model, but the Ljung-Box plot shows that some significant autocorrelation still remains (the red P-Values are significant at alpha=.05) - so the model can potentially be improved. This does not mean that the model is a bad model, it can still be very useful for prediction purposes, but the prediction intervals may not provide accurate coverage.

  28. Click on the ARIMA MSD Residuals sheet to view the Residual Plots:

    ARIMAMSDFORECASTStep28
    Similar to the Exponential Smoothing MSD Residual Plots, the residuals are not normally distributed and there are extreme outliers. These should be investigated with a control chart on the residuals. Outliers in Electricity Demand are often explained by Temperature.


    ARIMA does not have a theoretical frequency limit, but for computational efficiency and to minimize the potential loss of observations through differencing, we recommend using ARIMA – Multiple Seasonal Decomposition (MSD) for seasonal frequency greater than 52 (or with multiple frequencies).

    The seasonal component is first removed through decomposition, a nonseasonal ARIMA model fitted to the remainder (+trend), and then the seasonal component is added back in. For forecasting, a naïve seasonal forecast is used on the seasonal component. Note that the prediction intervals are derived from the ARIMA model and do not include uncertainty in the seasonal component.

    As the name implies, Multiple Seasonal Decomposition (MSD) also accommodates multiple seasonality, for example the half-hourly data with a seasonal frequency of 48 observations per day and 336 observations per week. When using MSD, it is recommended to limit the forecast period to 2*dominant seasonal frequency.


Define, Measure, Analyze, Improve, Control

Lean Six Sigma Software Excel Add-in

Simulate, Optimize,
Realize

Lean Six Sigma Software Excel Add-in

Web Demos

Our CTO and Co-Founder, John Noguera, regularly hosts free Web Demos featuring SigmaXL and DiscoverSim
Click here to view some now!

Contact Us

Phone: 1.888.SigmaXL (744.6295)

Support: Support@SigmaXL.com

Sales: Sales@SigmaXL.com

Information: Information@SigmaXL.com