Is Wikipedia link structure different?

Authors
Publication date 2010
Host editors
  • R. Baeza-Yates
  • P. Boldi
  • B. Ribeiro-Neto
  • B.B. Cambazoglu
Book title Proceedings of the Second ACM International Conference on Web Search and Data Mining (WSDM 2009): Barcelona, Spain February 9-12, 2009
ISBN
  • 9781605583907
Event Second ACM International Conference on Web Search and Data Mining (WSDM 2009), Barcelona, Spain
Pages (from-to) 232-241
Publisher New York: ACM Press
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract
In this paper, we investigate the difference between Wikipedia and Web link structure with respect to their value as indicators of the relevance of a page for a given topic of request. Our experimental evidence is from two IR test-collections: the .GOV collection used at the TREC Web tracks and the Wikipedia XML Corpus used at INEX. We first perform a comparative analysis of Wikipedia and .GOV link structure and then investigate the value of link evidence for improving search on Wikipedia and on the .GOV domain. Our main findings are: First, Wikipedia link structure is similar to the Web, but more densely linked. Second, Wikipedia's outlinks behave similar to inlinks and both are good indicators of relevance, whereas on the Web the inlinks are more important. Third, when incorporating link evidence in the retrieval model, for Wikipedia the global link evidence fails and we have to take the local context into account.
Document type Conference contribution
Language English
Published at https://doi.org/10.1145/1498759.1498831
Permalink to this page
Back