A regret lower bound for assortment optimization under the capacitated MNL model with arbitrary revenue parameters

Open Access
Authors
Publication date 10-2022
Journal Probability in the Engineering and Informational Sciences
Volume | Issue number 36 | 4
Pages (from-to) 1266-1274
Organisations
  • Faculty of Science (FNWI) - Korteweg-de Vries Institute for Mathematics (KdVI)
  • Faculty of Economics and Business (FEB) - Amsterdam Business School Research Institute (ABS-RI)
Abstract
In this note, we consider dynamic assortment optimization with incomplete information under the capacitated multinomial logit choice model. Recently, it has been shown that the regret (the cumulative expected revenue loss caused by offering suboptimal assortments) that any decision policy endures is bounded from below by a constant times √NT, where Ndenotes the number of products and T denotes the time horizon. This result is shown under the assumption that the product revenues are constant, and thus leaves the question open whether a lower regret rate can be achieved for nonconstant revenue parameters. In this note, we show that this is not the case: we show that, for any vector of product revenues there is a positive constant such that the regret of any policy is bounded from below by this constant times √NT. Our result implies that policies that achieve O(√NT) regret are asymptotically optimal for all product revenue parameters.

Document type Article
Language English
Published at https://doi.org/10.1017/S0269964821000395
Other links https://www.scopus.com/pages/publications/85114320027
Downloads
Permalink to this page
Back