The Solitude of Relevant Documents in the Pool

A. Lipani; M. Lupu; E. Kanoulas; A. Hanbury

doi:https://doi.org/10.1145/2983323.2983891

The Solitude of Relevant Documents in the Pool

Authors	A. Lipani M. Lupu E. Kanoulas A. Hanbury
Publication date	2016
Book title	CIKM'16
Book subtitle	proceedings of the 2016 ACM Conference on Information and Knowledge Management : October 24-28, 2016, Indianapolis, IN, USA
ISBN	9781450340731
Event	25th ACM International Conference on Information and Knowledge Management
Pages (from-to)	1989-1992
Publisher	New York, NY: Association for Computing Machinery
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	Pool bias is a well understood problem of test-collection based benchmarking in information retrieval. The pooling method itself is designed to identify all relevant documents. In practice, 'all' translates to `as many as possible given some budgetary constraints' and the problem persists, albeit mitigated. Recently, methods to address this pool bias for previously created test collections have been proposed, for the evaluation measure precision at cut-off (P@n). Analyzing previous methods, we make the empirical observation that the distribution of the probability of providing new relevant documents to the pool, over the runs, is log-normal (when the pooling strategy is fixed depth at cut-off). We use this observation to calculate a prior probability of providing new relevant documents, which we then use in a pool bias estimator that improves upon previous estimates of precision at cut-off. Through extensive experimental results, covering 15 test collections, we show that the proposed bias correction method is the new state of the art, providing the closest estimates yet when compared to the original pool.
Document type	Conference contribution
Language	English
Published at	https://doi.org/10.1145/2983323.2983891 (Final published version)
Downloads	p1989-lipani (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

The Solitude of Relevant Documents in the Pool