Score distributions in information retrieval

Authors
Publication date 2009
Host editors
  • L. Azzopardi
  • G. Kazai
  • S. Robertson
  • S. RĂ¼ger
  • M. Shokouhi
  • D. Song
  • E. Yilmaz
Book title Advances in Information Retrieval Theory
Book subtitle Second International Conference on the Theory of Information Retrieval, ICTIR 2009 Cambridge, UK, September 10-12, 2009 : proceedings
ISBN
  • 9783642044168
ISBN (electronic)
  • 9783642044175
Series Lecture Notes in Computer Science
Event Second International Conference on the Theory of Information Retrieval (ICTIR 2009), Cambridge, UK
Pages (from-to) 139-151
Publisher Berlin: Springer
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract
We review the history of modeling score distributions, focusing on the mixture of normal-exponential by investigating the theoretical as well as the empirical evidence supporting its use. We discuss previously suggested conditions which valid binary mixture models should satisfy, such as the Recall-Fallout Convexity Hypothesis, and formulate two new hypotheses considering the component distributions under some limiting conditions of parameter values. From all the mixtures suggested in the past, the current theoretical argument points to the two gamma as the most-likely universal model, with the normal-exponential being a usable approximation. Beyond the theoretical contribution, we provide new experimental evidence showing vector space or geometric models, and BM25, as being "friendly" to the normal-exponential, and that the non-convexity problem that the mixture possesses is practically not severe.
Document type Conference contribution
Language English
Published at https://doi.org/10.1007/978-3-642-04417-5_13
Permalink to this page
Back