# ARIMA – Multiple Seasonal Decomposition (MSD) Forecast

ARIMA does not have a theoretical frequency limit, but for computational efficiency and to minimize the potential loss of observations through differencing, we recommend using ARIMA – Multiple Seasonal Decomposition (MSD) for seasonal frequency greater than 52 (or with multiple frequencies).The seasonal component is first removed through decomposition, a nonseasonal ARIMA model fitted to the remainder (+trend), and then the seasonal component is added back in. For forecasting, a naïve seasonal forecast is used on the seasonal component. Note that the prediction intervals are derived from the ARIMA model and do not include uncertainty in the seasonal component.

As the name implies, Multiple Seasonal Decomposition (MSD) also accommodates multiple seasonality, for example the half-hourly data with a seasonal frequency of 48 observations per day and 336 observations per week. When using MSD, it is recommended to limit the forecast period to 2*dominant seasonal frequency.

- Open
**Monthly Airline Passengers - Series G.xlsx**(**Sheet 1**tab). This is the Series G data from Box and Jenkins, monthly total international airline passengers for January 1949 to December 1960. See the Run Chart, ACF/PACF Plots, Spectral Density Plot and Seasonal Trend Decomposition Plots for this data. The Multiple Seasonal Decomposition (MSD) option is not necessary for this data, but by way of introduction, we will use this to compare to the previous ARIMA analysis. - Click
**SigmaXL > Time Series Forecasting > ARIMA Forecast > Multiple Seasonal Decomposition Forecast**. Ensure that the entire data table is selected. If not, check Use**Entire Data Table**. Click**Next**. - Select
*Monthly Airline Passengers*, click**Numeric Time Series Data (Y) >>**. Select*Date*, click**Optional Time Axis Labels >>**. Check**Display ACF/PACF/LB Plots**and**Display Residual Plots**. Check**Specify Model Periods**. Set W**ithhold Periods**= 24. Select**Withhold Forecast Type: Multi-Step-Ahead with Prediction Interval at Start of Withhold**. Select**Seasonal Frequency Specify**and enter**12**. Check**Box-Cox Transformation**and select**Rounded Lambda**. We will use the default**Prediction Interval**= 95.0 %.

•**Seasonal Frequency**can have multiple entries however, we recommend no more than 3 values.

- Click
**Model Options**.

- We will use the default
**Automatic Model Selection**with**AICc**as the**Model Selection Criterion**. Click**OK**to return to the ARIMA MSD Forecast dialog. Click**OK**. The ARIMA forecast report is given:

- Scroll down to view the ARIMA Model
header:

- The ARIMA Model Summary is given as:

This is a summary of the model information for the deseasonalized data: ARIMA (0,1,1) with a constant. Seasonal Frequency = 12 using Decomposition and Model Selection Criterion = “AICc”. There are no seasonal terms in the model. The Box-Cox Transformation is “Rounded Lambda” with Lambda = 0 (Ln transformation).

- The Parameter Estimates for the
deseasonalized Airline Passenger data are:

- The ARIMA Model Statistics are:

• The number of observations, n = 144 – 24 (*withhold*) = 120

• Degrees of freedom (DF) = 120 (*n*) – 3 (2 terms in the model, 1 nonseasonal difference) = 117

• Note that the model statistics are based on the Ln transformed data, not the original data.

• Comparing to the Exponential Smoothing MSD Model Statistics, we see that the StDev and Variance are approximately equal, but the Log-Likelihood, AICc, AIC and BIC are very different. This is due to different formulas being used in the Likelihood function. You cannot use Information Criteria to compare ARIMA and Exponential Smooth models to determine which model has the best fit.

- The Forecast Accuracy metrics are:

Comparing to Exponential Smoothing MSD Forecast Accuracy metrics, we see that the ARIMA MSD results are approximately the same.

Forecast Accuracy metrics are calculated using the actual raw data versus inverse transformed forecast as displayed in the Forecast Chart and Table, so allow comparison across all model types and transformations.

- Click on the
**ARIMA MSD ACF PACF LB**sheet to view the ACF/PACF/LB Plots:

The ARIMA MSD ACF/PACF Residuals Plots are similar to the Exponential Smoothing MSD ACF/PACF/LB Plots and indicate that the autocorrelation has been accounted for in the model. All of the Ljung-Box P-Values are blue (i.e., not significant), with P-Values > .05.

- Click on the
**ARIMA MSD Residuals**sheet to view the Residual Plots:

The residuals are approximately normally distributed, with a roughly straight line on the normal probability plot. There are no obvious extreme outliers or patterns in the charts. The residual plots are approximately the same as those given previously in Exponential Smoothing MSD Residual Plots.

Since a Box-Cox transformation was used, the residuals are in Ln transformed units.

- Open
**Half-Hourly Multiple Seasonal Electricity Demand - Taylor.xlsx**(**Sheet 1**tab). This is half-hourly electricity demand (MW) in England and Wales from Monday, June 5, 2000 to Sunday, August 27, 2000 (taylor, R forecast). This data has multiple seasonality with frequency = 48 (observations per day) and 336 (observations per week), with a total of 4032 observations. See the Run Chart, ACF/PACF Plots, Spectral Density Plot and Seasonal Trend Decomposition Plots for this data.

- Click
**SigmaXL > Time Series Forecasting > ARIMA Forecast > Multiple Seasonal Decomposition Forecast**. Ensure that the entire data table is selected. If not, check**Use Entire Data Table**. Click**Next**.

- Select
*Demand*, click**Numeric Time Series Data (Y) >>**. Check**Display ACF/PACF/LB Plots**and**Display Residual Plots**. Check**Specify Model Periods**. Set**Withhold Periods**= 96. Select**Withhold Forecast Type: Multi-Step-Ahead with Prediction Interval at Start of Withhold**. Check**Seasonal Frequency**with**Specify**= 48 336. Leave**Box-Cox Transformation**unchecked. We will use the default**Prediction Interval**= 95.0 %.

**Withhold Periods**is 2*dominant seasonal frequency (48). Dominant frequency is obtained from the Spectral Density Plot. Start Model at Period = 1 is always greyed out for MSD.

- Click
**Model Options**.

- We will use the default
**Automatic Model Selection**with**AICc**as the**Model Selection Criterion**. Click**OK**to return to the ARIMA MSD Forecast dialog. Click**OK**. The ARIMA forecast report is given: - We will want
to zoom in on the last 3 days, i.e., 144 half-hourly time
periods, using chart scrolling. Click
**SigmaXL Chart Tools > Enable Scrolling**

You may be prompted with a warning message that custom formatting on the chart will be cleared. You can avoid seeing this warning by checking**Save this choice as default and do not show this form again**.

- Click
**OK**. The scroll dialog appears allowing you to specify the**Start Period**and**Window Width**. Enter**Start Period**= 3888 and**Window Width**= 144:

At any point, you can click**Restore/Show All Data Points**or**Freeze Chart**. Freezing the chart will remove the scroll and unload the dialog. The scroll dialog will also unload if you change worksheets. To restore the dialog, click**SigmaXL Chart Tools > Enable Scrolling**. -
Click
**OK**. A scroll bar appears beneath the forecast chart. You can also change the**Start Subgroup**and**Window Width**and**Update**.

You can scroll through by clicking to the right or left, with the specified window width of 144.

The blank dots are the data values in the withhold sample with a multi-step forecast and prediction intervals displayed at the start of the withhold sample. The model does quite well at predicting the withhold 96 half-hour demand values. Note that the prediction error increases the further out we predict. The ARIMA MSD Forecast Chart looks approximately the same as the previous Exponential Smoothing MSD Forecast Chart.

Click**Cancel**to exit the scroll dialog.

- Scroll down to view the ARIMA Model
header:

- The ARIMA Model Summary is given as:

This is a summary of the model information for the deseasonalized data: ARIMA (2,1,0) without a constant. Seasonal Frequency = 48, 336 using Decomposition and Model Selection Criterion = “AICc”. There are no seasonal terms in the model.

- The Parameter Estimates for the
deseasonalized Demand data are:

ARIMA Parameter Estimates include significance tests; P-Values < .05 are significant and highlighted in red. This may be useful for model refinement with multiple predictors but for AR/MA model order selection, minimum AICc should be used, rather than significance tests (see Kostenko, A.V. and Hyndman, R.J. ).

- The ARIMA Model Statistics are:

• The number of observations, n = 4032 – 96 (withhold) = 3936

• Degrees of freedom (DF) = 3936 (n) – 3 (2 terms in the model, 1 nonseasonal difference) = 3933

• Note that the model statistics are calculated using the deseasonalized data.

- The Forecast Accuracy metrics are:

As expected, the**Out-of Sample (Withhold) Multi-Step-Ahead Forecast**errors are larger than the**In-Sample (Estimation) One-Step-Ahead Forecast**errors. Note, if we were primarily interested in a short term one-step ahead forecast, then we would have selected**Withhold Forecast Type: One-Step-Ahead**and the above table would show**Out-of-Sample (Withhold) One-Step-Ahead Forecast**errors.

Comparing to the previous Exponential Smoothing MSD Forecast Accuracy metrics, we see that the ARIMA MSD results are approximately the same.

Forecast Accuracy metrics are calculated using the actual raw data versus forecast as displayed in the Forecast Chart and Table so, unlike the model statistics above, allows comparison across all forecast model types and transformations.

- The Forecast Table is given as:

These are the same forecast and prediction interval values displayed in the Forecast Chart, but provided for further analysis or charting (e.g., a run chart of the forecast errors). The**Withhold Data**is also displayed.

- Click on the
**ARIMA MSD ACF PACF LB**sheet to view the ACF/PACF/LB Plots:

The ACF/PACF Residuals Plots indicate that much of the autocorrelation has been accounted for in the model, but the Ljung-Box plot shows that some significant autocorrelation still remains (the red P-Values are significant at alpha=.05) - so the model can potentially be improved. This does not mean that the model is a bad model, it can still be very useful for prediction purposes, but the prediction intervals may not provide accurate coverage.

- Click on the
**ARIMA MSD Residuals**sheet to view the Residual Plots:

Similar to the Exponential Smoothing MSD Residual Plots, the residuals are not normally distributed and there are extreme outliers. These should be investigated with a control chart on the residuals. Outliers in Electricity Demand are often explained by Temperature.

# Web Demos

Our CTO and Co-Founder, John Noguera, regularly hosts free Web Demos featuring SigmaXL and DiscoverSim**Click here to view some now!**

# Contact Us

**Ph: **1.888.SigmaXL (744.6295)

**Support: **Support@SigmaXL.com

**Sales: **Sales@SigmaXL.com

**Information: **Information@SigmaXL.com