To Model or to Intervene: A Comparison of Counterfactual and Online Learning to Rank from User Interactions

R. Jagerman; H. Oosterhuis; M. de Rijke

doi:https://doi.org/10.1145/3331184.3331269

To Model or to Intervene: A Comparison of Counterfactual and Online Learning to Rank from User Interactions

Authors	R. Jagerman H. Oosterhuis M. de Rijke
Publication date	2019
Book title	SIGIR '19
Book subtitle	proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval : July 21-25, 2019, Paris, France
ISBN (electronic)	9781450361729
Event	42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2019
Pages (from-to)	15-24
Number of pages	10
Publisher	New York, New York: The Association for Computing Machinery
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	Learning to Rank (LTR) from user interactions is challenging as user feedback often contains high levels of bias and noise. At the moment, two methodologies for dealing with bias prevail in the field of LTR: counterfactual methods that learn from historical data and model user behavior to deal with biases; and online methods that perform interventions to deal with bias but use no explicit user models. For practitioners the decision between either methodology is very important because of its direct impact on end users. Nevertheless, there has never been a direct comparison between these two approaches to unbiased LTR. In this study we provide the first benchmarking of both counterfactual and online LTR methods under different experimental conditions. Our results show that the choice between the methodologies is consequential and depends on the presence of selection bias, and the degree of position bias and interaction noise. In settings with little bias or noise counterfactual methods can obtain the highest ranking performance; however, in other circumstances their optimization can be detrimental to the user experience. Conversely, online methods are very robust to bias and noise but require control over the displayed rankings. Our findings confirm and contradict existing expectations on the impact of model-based and intervention-based methods in LTR, and allow practitioners to make an informed decision between the two methodologies.
Document type	Conference contribution
Language	English
Published at	https://doi.org/10.1145/3331184.3331269
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

To Model or to Intervene: A Comparison of Counterfactual and Online Learning to Rank from User Interactions