Demand Forecasting in the Presence of Privileged Information
| Authors | |
|---|---|
| Publication date | 2020 |
| Host editors |
|
| Book title | Advanced Analytics and Learning on Temporal Data |
| Book subtitle | 5th ECML PKDD Workshop, AALTD 2020, Ghent, Belgium, September 18, 2020 : revised selected papers |
| ISBN |
|
| ISBN (electronic) |
|
| Series | Lecture Notes in Computer Science |
| Event | Advanced Analytics and Learning on Temporal Data |
| Pages (from-to) | 46-62 |
| Publisher | Cham: Springer |
| Organisations |
|
| Abstract |
Predicting the amount of sales in the future is a fundamental problem in
the replenishment process of retail companies. Models for forecasting
the demand of an item typically rely on influential features and
historical sales of the item. However, the values of some influential
features (to which we refer as non-plannable features)
are only known during model training (for the past), and not for the
future at prediction time. Examples of such features include sales in
other channels, such as other stores in chain supermarkets. Existing
forecasting methods ignore such non-plannable features or wrongly assume
that they are also known at prediction time. We identify non-plannable
features as privileged information, i.e., information that is available
at training time but not at prediction time, and design a neural network
to leverage this source of data accordingly. We present a dual branch
neural network architecture that incorporates non-plannable features at
training time, with a first branch to embed the historical information,
and a second branch, the privileged information (PI) branch,
to predict demand based on privileged information. Next, we leverage a
single branch network at prediction time, which applies a simulation
component to mimic the behavior of the PI branch, whose inputs are not
available at prediction time. We evaluate our approach on two real-world
forecasting datasets, and find that it outperforms state-of-the-art
competitors in terms of mean absolute error and symmetric mean absolute
percentage error metrics. We further provide visualizations and conduct
experiments to validate the contribution of different components in our
proposed architecture.
|
| Document type | Conference contribution |
| Language | English |
| Published at | https://doi.org/10.1007/978-3-030-65742-0_4 |
| Downloads |
ariannezhad-2020-demand
(Accepted author manuscript)
|
| Permalink to this page | |
