Web directories as topical context
| Authors | |
|---|---|
| Publication date | 2009 |
| Host editors |
|
| Book title | Proceedings of the 9th Dutch-Belgian Information Retrieval Workshop (DIR 2009): February 2-3, 2009, Enschede, the Netherlands |
| Event | 9th Dutch-Belgian Information Retrieval Workshop (DIR 2009), Enschede, The Netherlands |
| Pages (from-to) | 71-78 |
| Publisher | Enschede: University of Twente, Centre for Telematics and Information Technology (CTIT) |
| Organisations |
|
| Abstract |
In this paper we explore whether the Open Directory (or DMOZ) can be used to classify queries into topical categories on different levels and whether we can use this topical context to improve retrieval performance. We have set up a user study to let test persons explicitly classify queries into topical categories. Categories are either chosen freely from DMOZ, or from a list of suggestions created by several automatic topic categorization techniques. The results of this user study show that DMOZ categories are suitable for topic categorization. Either free search or evaluation of a list of suggestions can be used to elicit the topical context. Free search leads to more specific topic categories than the list of suggestions. Different test persons show moderate agreement between their individual judgments, but broadly agree on the initial levels of the chosen categories. When we use the topic categories selected by the free search as topical context, this leads to significant improvements over the baseline retrieval results. The more general topic categories selected from the suggestions list, and top level categories do not lead to significant improvements.
|
| Document type | Conference contribution |
| Published at | http://dir2009.cs.utwente.nl/dir2009proceedings.pdf |
| Permalink to this page | |
