Focus and element length for book and Wikipedia retrieval
| Authors | |
|---|---|
| Publication date | 2011 |
| Host editors |
|
| Book title | Comparative Evaluation of Focused Retrieval |
| Book subtitle | 9th International Workshop of the Inititative for the Evaluation of XML Retrieval, INEX 2010, Vugh, The Netherlands, December 13-15, 2010 : revised selected papers |
| ISBN |
|
| ISBN (electronic) |
|
| Series | Lecture Notes in Computer Science |
| Event | 9th International Workshop of the INitiative for the Evaluation of XML retrieval (INEX 2010), Vught, the Netherlands |
| Pages (from-to) | 140-153 |
| Publisher | Heidelberg: Springer |
| Organisations |
|
| Abstract |
In this paper we describe our participation in INEX 2010 in the Ad Hoc Track and the Book Track. In the Ad Hoc track we investigate the impact of propagated anchor-text on article level precision and the impact of an element length prior on the within-document precision and recall. Using the article ranking of an document level run for both document and focused retrieval techniques, we find that focused retrieval techniques clearly outperform document retrieval, especially for the Focused and Restricted Relevant in Context Tasks, which limit the amount of text than can be returned per topic and per article respectively. Somewhat surprisingly, an element length prior increases within-document precision even when we restrict the amount of retrieved text to only 1000 characters per topic. The query-independent evidence of the length prior can help locate elements with a large fraction of relevant text. For the Book Track we look at the relative impact of retrieval units based on whole books, individual pages and multiple pages.
|
| Document type | Conference contribution |
| Language | English |
| Published at | https://doi.org/10.1007/978-3-642-23577-1_12 |
| Permalink to this page | |
