Demand Forecasting in the Presence of Privileged Information

Open Access
Authors
Publication date 2020
Host editors
  • V. Lemaire
  • S. Malinowski
  • A. Bagnall
  • T. Guyet
  • R. Tavenard
  • G. Ifrim
Book title Advanced Analytics and Learning on Temporal Data
Book subtitle 5th ECML PKDD Workshop, AALTD 2020, Ghent, Belgium, September 18, 2020 : revised selected papers
ISBN
  • 9783030657413
ISBN (electronic)
  • 9783030657420
Series Lecture Notes in Computer Science
Event Advanced Analytics and Learning on Temporal Data
Pages (from-to) 46-62
Publisher Cham: Springer
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Predicting the amount of sales in the future is a fundamental problem in the replenishment process of retail companies. Models for forecasting the demand of an item typically rely on influential features and historical sales of the item. However, the values of some influential features (to which we refer as non-plannable features) are only known during model training (for the past), and not for the future at prediction time. Examples of such features include sales in other channels, such as other stores in chain supermarkets. Existing forecasting methods ignore such non-plannable features or wrongly assume that they are also known at prediction time. We identify non-plannable features as privileged information, i.e., information that is available at training time but not at prediction time, and design a neural network to leverage this source of data accordingly. We present a dual branch neural network architecture that incorporates non-plannable features at training time, with a first branch to embed the historical information, and a second branch, the privileged information (PI) branch, to predict demand based on privileged information. Next, we leverage a single branch network at prediction time, which applies a simulation component to mimic the behavior of the PI branch, whose inputs are not available at prediction time. We evaluate our approach on two real-world forecasting datasets, and find that it outperforms state-of-the-art competitors in terms of mean absolute error and symmetric mean absolute percentage error metrics. We further provide visualizations and conduct experiments to validate the contribution of different components in our proposed architecture.
Document type Conference contribution
Language English
Published at https://doi.org/10.1007/978-3-030-65742-0_4
Downloads
ariannezhad-2020-demand (Accepted author manuscript)
Permalink to this page
Back