The importance of link evidence in Wikipedia

Authors
Publication date 2008
Host editors
  • C. Macdonald
  • I. Ounis
  • V. Plachouras
  • I. Ruthven
  • R.W. White
Book title Advances in Information Retrieval
Book subtitle 30th European Conference on IR Research, ECIR 2008, Glasgow, UK, March 30-April 3, 2008 : proceedings
ISBN
  • 9783540786450
ISBN (electronic)
  • 9783540786467
Series Lecture Notes in Computer Science
Event 30th European Conference on IR Research (ECIR 2008), Glasgow, UK
Pages (from-to) 270-282
Publisher Berlin: Springer
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract
Wikipedia is one of the most popular information sources on the Web. The free encyclopedia is densely linked. The link structure in Wikipedia differs from the Web at large: internal links in Wikipedia are typically based on words naturally occurring in a page, and link to another semantically related entry. Our main aim is to find out if Wikipedia’s link structure can be exploited to improve ad hoc information retrieval. We first analyse the relation between Wikipedia links and the relevance of pages. We then experiment with use of link evidence in the focused retrieval of Wikipedia content, based on the test collection of INEX 2006. Our main findings are: First, our analysis of the link structure reveals that the Wikipedia link structure is a (possibly weak) indicator of relevance. Second, our experiments on INEX ad hoc retrieval tasks reveal that if the link evidence is made sensitive to the local context we see a significant improvement of retrieval effectiveness. Hence, in contrast with earlier TREC experiments using crawled Web data, we have shown that Wikipedia’s link structure can help improve the effectiveness of ad hoc retrieval.
Document type Conference contribution
Language English
Published at https://doi.org/10.1007/978-3-540-78646-7_26
Permalink to this page
Back