Focus and element length for book and Wikipedia retrieval

Authors
Publication date 2011
Host editors
  • S. Geva
  • J. Kamps
  • R. Schenkel
  • A. Trotman
Book title Comparative Evaluation of Focused Retrieval
Book subtitle 9th International Workshop of the Inititative for the Evaluation of XML Retrieval, INEX 2010, Vugh, The Netherlands, December 13-15, 2010 : revised selected papers
ISBN
  • 9783642235764
ISBN (electronic)
  • 9783642235771
Series Lecture Notes in Computer Science
Event 9th International Workshop of the INitiative for the Evaluation of XML retrieval (INEX 2010), Vught, the Netherlands
Pages (from-to) 140-153
Publisher Heidelberg: Springer
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract
In this paper we describe our participation in INEX 2010 in the Ad Hoc Track and the Book Track. In the Ad Hoc track we investigate the impact of propagated anchor-text on article level precision and the impact of an element length prior on the within-document precision and recall. Using the article ranking of an document level run for both document and focused retrieval techniques, we find that focused retrieval techniques clearly outperform document retrieval, especially for the Focused and Restricted Relevant in Context Tasks, which limit the amount of text than can be returned per topic and per article respectively. Somewhat surprisingly, an element length prior increases within-document precision even when we restrict the amount of retrieved text to only 1000 characters per topic. The query-independent evidence of the length prior can help locate elements with a large fraction of relevant text. For the Book Track we look at the relative impact of retrieval units based on whole books, individual pages and multiple pages.
Document type Conference contribution
Language English
Published at https://doi.org/10.1007/978-3-642-23577-1_12
Permalink to this page
Back