Holt-Winters Exponential Smoothing - Statistical Modeling and Forecasting (2024)

A super-fast forecasting technique for time series data

Holt-Winters Exponential Smoothing is used for forecasting time series data that exhibits both a trend and a seasonal variation. The Holt-Winters technique is made up of the following four forecasting techniques stacked one over the other:

Holt-Winters Exponential Smoothing - Statistical Modeling and Forecasting (1)

Weighted Averages: A weighted average is simply an average of n numbers where each number is given a certain weight and the denominator is the sum of those n weights. The weights are often assigned as per some weighing function. Common weighing functions are logarithmic, linear, quadratic, cubic and exponential. Averaging as a time series forecasting technique has the property of smoothing out the variation in the historical values while calculating the forecast. By choosing a suitable weighing function, the forecaster determines which historical values should be given emphasis for calculating future values of the time series.

Exponential Smoothing: The Exponential Smoothing (ES) technique forecasts the next value using a weighted average of all previous values where the weights decay exponentially from the most recent to the oldest historical value. When you use ES, you are making the crucial assumption that recent values of the time series are much more important to you than older values. The ES technique has two big shortcomings: It cannot be used when your data exhibits a trend and/or seasonal variations.

Holt Exponential Smoothing: The Holt ES technique fixes one of the two shortcomings of the simple ES technique. Holt ES can be used to forecast time series data that has a trend. But Holt ES fails in the presence of seasonal variations in the time series.

Holt-Winters Exponential Smoothing: The Holt-Winters ES modifies the Holt ES technique so that it can be used in the presence of both trend and seasonality.

To understand how Holt-Winters Exponential Smoothing works, one must understand the following four aspects of a time series:

Level

The concept of level is best understood with an example. The following time series shows the closing stock price of Merck & Co. on NYSE. The horizontal red lines indicate some of the levels in the time series in its up and down journey:

Holt-Winters Exponential Smoothing - Statistical Modeling and Forecasting (2)

Trend

A time series whose level changes in some sort of a pattern is said to have a trend. A time series whose level changes randomly around some mean value can be said to exhibit a random trend. Apart from knowing that the trend is random, the concept of trend is not so useful when it’s random, compared to one where the trend can be modeled by some function.

Let’s zoom into one particular area of the above stock price chart to illustrate the concept of a positive trend:

Holt-Winters Exponential Smoothing - Statistical Modeling and Forecasting (3)

Some of the commonly observed trends are linear, square, exponential, logarithmic, square root, inverse and 3rd degree or higher polynomials. These trends can be easily modeled using the corresponding mathematical function, namely, log(x), linear, x², exp(x) etc.

Highly non-linear trends require complex modeling techniques such as artificial neural networks to model them successfully.

A useful way to look at trend is as a rate or as the velocity of the time series at a given level.

This makes trend a vector that has a magnitude (rate of change) and a direction (increasing or decreasing).

Let’s kept this interpretation of trend as a rate or velocity at the back of our minds. We’ll use it when we deconstruct the forecasting equation of Holt-Winters Exponential Smoothing.

Seasonality

Many time series show periodic up and down movements around the current level. This periodic up and down movement is called seasonality. Here is an example of a time series demonstrating a seasonal pattern:

Holt-Winters Exponential Smoothing - Statistical Modeling and Forecasting (4)

Noise

Noise is simply the aspect of the time series data that you cannot (or do not want to) explain.

Level, Trend, Seasonality and Noise are considered to interact in an additive or multiplicative manner to produce the final value of the time series that you observe:

Multiplicative combination (with additivetrend)

Holt-Winters Exponential Smoothing - Statistical Modeling and Forecasting (5)

Fully additive combination

Holt-Winters Exponential Smoothing - Statistical Modeling and Forecasting (6)

The Holt-Winters Exponential Smoothing Equation

We are now ready to look at the forecasting equations of the Holt-Winter’s Exponential Smoothing technique. We’ll first consider the case where trend adds to the current level, but the seasonality is multiplicative. This is a commonly situation in real world time series data.

Since we are specifying the forecasting model’s equations, we’ll leave out the noise term.

Holt-Winters Exponential Smoothing - Statistical Modeling and Forecasting (7)

In the above equation, we are forecasting the value of the time series k time steps out into the future starting from some arbitrary step i. The seasonal variation is assumed to have a known period length of m time steps. For e.g. for an annual variation, m=12.

Let’s see how we can estimate L_i, B_i and S_i.

Let’s start with the estimate of trend B_i at step i:

Holt-Winters Exponential Smoothing - Statistical Modeling and Forecasting (8)

Estimation of Initial conditions

Since all equations for the Holt-Winters method are recurrence relations, we need to supply a set of initial values to these estimating equations to get the forecasting engine started. Specifically, we need to set the values of L_0, B_0 and S_0.

There are several ways to set these initial values. I’ll explain the technique used by the Python statsmodels library. (We’ll soon use statsmodels for building a Holt-Winters ES estimator and use it to forecast 12 time steps out in the future).

Estimating L_0: Statsmodels sets L_0 to the average of all observed values of the time series that you supply it, lying at indexes 0, m, 2m, 3m and so on, where m is the seasonal period. For e.g. if you tell statsmodels that your time series exhibits a seasonal period of 12 months, it will calculate L_0 as follows:

Holt-Winters Exponential Smoothing - Statistical Modeling and Forecasting (13)

Note that T_0 is the oldest value in your time series data.

You can use the Holt-Winters forecasting technique even if your time series does not display seasonality. In this case, statsmodels will set L_0 to the first value of the training data set. i.e.

L_0 = T_0, when there is no seasonal variation in the data

Estimating B_0: If your time series displays an additive trend, i.e. its level changes linearly, statsmodels estimates the initial trend B_0 by calculating the rate of change of the observed value T_i across m time steps and then taking the mean of these rates. For e.g. if you tell statsmodels that your time series exhibits an additive trend and it has a seasonal period of 12 months, it will calculate B_0 as follows:

Holt-Winters Exponential Smoothing - Statistical Modeling and Forecasting (14)

If your time series exhibits a multiplicative trend, i.e. the level grows at a rate that is proportional to the current level, statsmodels uses a slightly complex looking estimator for B_0. It is best illustrated using the example of annual seasonality (m=12):

Holt-Winters Exponential Smoothing - Statistical Modeling and Forecasting (15)

But if your time series does not display a seasonal variation, B_0 is simply set to T_1/T_0 if the trend is multiplicative, or to (T_1 — T_0) if the trend is additive.

Estimating α, β andγ

The weighing coefficients α, β and γ are estimated by giving them initial values and then iteratively optimizing their values for some suitable score. Minimization of the MSE (mean-squared-error) is a commonly used optimization goal. Statsmodels sets the initial α to 1/2m, β to 1/20m and it sets the initial γ to 1/20*(1 — α) when there is seasonality.

Once L_0, B_0 and S_0 are estimated, and α, β and γ are set, we can use the recurrence relations for L_i, B_i, S_i, F_i and F_(i+k) to estimate the value of the time series at steps 0, 1, 2, 3,…, i,…,n,n+1,n+2,…,n+k.

If your training data set has n data points, then positions n+1,n+2,…,n+k correspond to the k out-of-sample forecasts that you would generate using the Holt-Winters estimation technique.

Using the Holt-Winters Exponential Smoothing inPython

We’ll estimate 12 future values of the time series of retail sales of used car dealers in the United States using the Holt-Winters Exponential Smoothing technique:

Holt-Winters Exponential Smoothing - Statistical Modeling and Forecasting (18)

The data set is available for download over here.

Let’s start by importing all the required packages.

import pandas as pdfrom matplotlib import pyplot as pltfrom statsmodels.tsa.holtwinters import ExponentialSmoothing as HWES

Read the data set into a Pandas data frame. Note that the Date column (column 0) is the index column and it has the format mm-dd-yyyy.

df = pd.read_csv('retail_sales_used_car_dealers_us_1992_2020.csv', header=0, infer_datetime_format=True, parse_dates=[0], index_col=[0])

Set the index frequency explicitly to Monthly so that statsmodels does not have to try to infer it.

df.index.freq = 'MS'

Plot the data:

df.plot()plt.show()

We get the following chart:

Holt-Winters Exponential Smoothing - Statistical Modeling and Forecasting (19)

Split between the training and the test data sets. The last 12 periods form the test data.

df_train = df.iloc[:-12]df_test = df.iloc[-12:]

Build and train the model on the training data. In the above chart, the level of the time series seems to be increasing linearly. So we set the trend as additive. However, the seasonal variation around each level seems to be increasing in proportion to the current level. So we set the seasonality to multiplicative.

model = HWES(df_train, seasonal_periods=12, trend='add', seasonal='mul')fitted = model.fit()

Print out the training summary.

print(fitted.summary())

We get the following output:

Holt-Winters Exponential Smoothing - Statistical Modeling and Forecasting (20)

Create an out of sample forecast for the next 12 steps beyond the final data point in the training data set.

sales_forecast = fitted.forecast(steps=12)

Plot the training data, the test data and the forecast on the same plot.

fig = plt.figure()fig.suptitle('Retail Sales of Used Cars in the US (1992-2020)')past, = plt.plot(df_train.index, df_train, 'b.-', label='Sales History')future, = plt.plot(df_test.index, df_test, 'r.-', label='Actual Sales')predicted_future, = plt.plot(df_test.index, sales_forecast, 'g.-', label='Sales Forecast')plt.legend(handles=[past, future, predicted_future])plt.show()

Holt-Winters Exponential Smoothing - Statistical Modeling and Forecasting (21)

Let’s zoom into the last 12 periods. You can see that the forecast lags behind sharp turning points as it rightly should for any moving average based forecasting technique:

Holt-Winters Exponential Smoothing - Statistical Modeling and Forecasting (22)

Here is the complete source code:

	import pandas as pd
	from matplotlib import pyplot as plt
	from statsmodels.tsa.holtwinters import ExponentialSmoothing as HWES

	#read the data file. the date column is expected to be in the mm-dd-yyyy format.
	df = pd.read_csv('retail_sales_used_car_dealers_us_1992_2020.csv', header=0, infer_datetime_format=True, parse_dates=[0], index_col=[0])
	df.index.freq = 'MS'

	#plot the data
	df.plot()
	plt.show()

	#split between the training and the test data sets. The last 12 periods form the test data
	df_train = df.iloc[:–12]
	df_test = df.iloc[–12:]

	#build and train the model on the training data
	model = HWES(df_train, seasonal_periods=12, trend='add', seasonal='mul')
	fitted = model.fit(optimized=True, use_brute=True)

	#print out the training summary
	print(fitted.summary())

	#create an out of sample forcast for the next 12 steps beyond the final data point in the training data set
	sales_forecast = fitted.forecast(steps=12)

	#plot the training data, the test data and the forecast on the same plot
	fig = plt.figure()
	fig.suptitle('Retail Sales of Used Cars in the US (1992-2020)')
	past, = plt.plot(df_train.index, df_train, 'b.-', label='Sales History')
	future, = plt.plot(df_test.index, df_test, 'r.-', label='Actual Sales')
	predicted_future, = plt.plot(df_test.index, sales_forecast, 'g.-', label='Sales Forecast')
	plt.legend(handles=[past, future, predicted_future])
	plt.show()

view raw holt_winters.py hosted with ❤ by GitHub

Citations and Copyrights

Data Set

U.S. Census Bureau, Retail Sales: Used Car Dealers [MRTSSM44112USN], retrieved from FRED, Federal Reserve Bank of St. Louis; https://fred.stlouisfed.org/series/MRTSSM44112USN, June 17, 2020, under FRED copyright terms.

SILSO, World Data Center — Sunspot Number and Long-term Solar Observations, Royal Observatory of Belgium, on-line Sunspot Number catalogue: http://www.sidc.be/SILSO/, 1818–2020 (CC-BY-NA)

Merck & Co., Inc. (MRK), NYSE — Historical Adjusted Closing Price. Currency in USD, https://finance.yahoo.com/quote/MRK/history?p=MRK, 23-Jul-2020. Copyright Yahoo Finance and NYSE

Papers

Peter R. Winters, Forecasting Sales by Exponentially Weighted Moving Averages. Management Science 6 (3) 324-342https://doi.org/10.1287/mnsc.6.3.324

Makridakis, S., Wheelwright, S. C., Hyndman, R. J. Forecasting Methods and Applications. Third Ed. John Wiley & Sons.

Images

All images are copyright Sachin Date under CC-BY-NC-SA, unless a different source and copyright are mentioned underneath the image.

PREVIOUS: The Binomial Regression Model

NEXT: Regression With ARIMA Errors

UP: Table of Contents

Holt-Winters Exponential Smoothing - Statistical Modeling and Forecasting (2024)

FAQs

What is the formula for Holt Winters exponential smoothing? ›

Holt and Winters extended Holt's method to capture seasonality. st = γ(yt − `t−1 − bt−1)+(1 − γ)st−m, k = integer part of (h − 1)/m. Ensures estimates from the final year are used for forecasting.

Learn More ›

What is Holt's exponential smoothing forecast? ›

Holt's method extends simple exponential smoothing by assuming that the time series has both a level and a trend. A forecast with Holt's method can therefore be defined as: As we can see, it is literally just a simple extenuation of original SES method, just with the inclusion of the trend, T, component.

Discover More ›

How to calculate the exponential smoothing method of forecasting? ›

The exponential smoothing calculation is as follows: The most recent period's demand multiplied by the smoothing factor. The most recent period's forecast multiplied by (one minus the smoothing factor). S = the smoothing factor represented in decimal form (so 35% would be represented as 0.35).

Read The Full Story ›

What is Holt Winters exponential smoothing multiplicative model? ›

Holt-Winters' Multiplicative method also calculates exponentially smoothed values for level, trend, and seasonal adjustment to the forecast. This seasonal multiplicative method multiplies the trended forecast by the seasonality, producing the Holt-Winters' multiplicative forecast.

Learn More ›

What is the formula for simple exponential smoothing? ›

The component form of simple exponential smoothing is given by: Forecast equation^yt+h|t=ℓtSmoothing equationℓt=αyt+(1−α)ℓt−1, Forecast equation y ^ t + h | t = ℓ t Smoothing equation ℓ t = α y t + ( 1 − α ) ℓ t − 1 , where ℓt is the level (or the smoothed value) of the series at time t .

Explore More ›

What is the formula for the Winters model? ›

Formula. The additive model is: L _t = α (Y _t – S _t_–_p ) + (1 – α) [L _t_–₁ + T _t_–₁ ] T _t = γ [L _t – L _t_–₁ ] + (1 – γ) T.

Read The Full Story ›

What is the formula for forecasting? ›

The formula is: previous month's sales x velocity = additional sales; and then: additional sales + previous month's rate = forecasted sales for next month.

Get More Info Here ›

What is the formula for exponential smoothing forecast in Excel? ›

Excel has a built-in function called FORECAST. ETS that can perform exponential smoothing on a time series. To use it, you need to have a column of dates or periods, and a column of values that you want to forecast. Then, you can select a cell where you want to display the forecast, and enter the formula =FORECAST.

Read The Full Story ›

What is the formula for exponential smoothing with trend adjustment? ›

Use trend-adjusted exponential smoothing with smoothing parameter α= 0.5 and trend parameter β= 0.3 to compute the demand forecast for January (Period 13). For Period 3, A2 = αD2 +(1−α)(A1+T1) = 0.5(40)+(1−0.5)(37+0) = 38.5, and T2 = β(A2 −A1)+(1−β)T1 =0.3(38.5− 37)+ (1 − 0.3)(0) = 0.45.

Find Out More ›

When to use Holt-Winters method? ›

The Holt-Winters method is used for time-series forecasting because it can capture trends and seasonality in the data, making it particularly useful for predicting future values of a time series that exhibit these patterns. The method is also relatively simple and can produce accurate forecasts.

Learn More ›

What are the disadvantages of Holt-Winters exponential smoothing? ›

However, it does not consider the most recent inter-trend relations, which can be a disadvantage. The advantages of using Holt's method with different time horizons include accurate forecasting, while the disadvantages include increased complexity and potential for errors.

Tell Me More ›

What is the difference between Arima and Holt-winters? ›

The study found that both of the studied models (Holt-Winters and ARIMA) performed well in predicting rice price values. However, in the model comparison, although by very little difference, the Holt-Winters additive model was closer to the actual data.

What is Holt's method in R? ›

The Holt-Winters method takes into account three components of a time series: level, trend, and seasonality. It uses exponential smoothing to estimate the level, trend and seasonality components and make predictions.

Learn More ›

What is the formula for the multiplicative model? ›

The multiplicative model can be made linear by the logarithmic transformation,⁷ that is, log ( y ) = β 0 + β 1 log ( x 1 ) + β 2 log ( x 2 ) + ⋯ + β m log ( x m ) + ε .

Learn More Now ›

What is the Holt methodology? ›

HOLT's proprietary methodology corrects subjectivity by converting income statement and balance sheet information into the company's internal rate of return (CFROI), a measure that more closely approximates a company's underlying economics.

Show Me More ›