Using centrality to rank web snippets
| Authors | |
|---|---|
| Publication date | 2008 |
| Host editors |
|
| Book title | Advances in multilingual and multimodal information retrieval: 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007: Revised selected papers |
| ISBN |
|
| Series | Lecture Notes in Computer Science, 5152 |
| Pages (from-to) | 737-741 |
| Number of pages | 922 |
| Publisher | Berlin: Springer |
| Organisations |
|
| Abstract |
We describe our participation in the WebCLEF 2007 task, targeted at snippet retrieval from web data. Our system ranks snippets based on a simple similarity-based centrality, inspired by the web page ranking algorithms. We experimented with retrieval units (sentences and paragraphs) and with the similarity functions used for centrality computations (word overlap and cosine similarity). We found that using paragraphs with the cosine similarity function shows the best performance with precision around 20% and recall around 25% according to human assessments of the first 7,000 bytes of responses for individual topics.
|
| Document type | Chapter |
| Note | Gebeurtenis: 8th Workshop of the Cross-Language Evaluation Forum (CLEF 2007), Budapest, Hungary, September 19-21, 2007 |
| Published at | https://doi.org/10.1007/978-3-540-85760-0_94 |
| Permalink to this page | |
