Using centrality to rank web snippets

Authors
Publication date 2008
Host editors
  • C. Peters
  • V. Jijkoun
  • T. Mandl
  • H. Müller
  • D.W. Oard
  • A. Peñas
  • V. Petras
  • D. Santos
Book title Advances in multilingual and multimodal information retrieval: 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007: Revised selected papers
ISBN
  • 9783540857594
Series Lecture Notes in Computer Science, 5152
Pages (from-to) 737-741
Number of pages 922
Publisher Berlin: Springer
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
We describe our participation in the WebCLEF 2007 task, targeted at snippet retrieval from web data. Our system ranks snippets based on a simple similarity-based centrality, inspired by the web page ranking algorithms. We experimented with retrieval units (sentences and paragraphs) and with the similarity functions used for centrality computations (word overlap and cosine similarity). We found that using paragraphs with the cosine similarity function shows the best performance with precision around 20% and recall around 25% according to human assessments of the first 7,000 bytes of responses for individual topics.
Document type Chapter
Note Gebeurtenis: 8th Workshop of the Cross-Language Evaluation Forum (CLEF 2007), Budapest, Hungary, September 19-21, 2007
Published at https://doi.org/10.1007/978-3-540-85760-0_94
Permalink to this page
Back