A syntax-aware re-ranker for microblog retrieval

Authors
Publication date 2014
Book title SIGIR '14
Book subtitle proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval: July 6-11 2014, Gold Coast, Queensland, Australia
ISBN
  • 9781450322577
ISBN (electronic)
  • 9781450322591
Event SIGIR '14: 37th international ACM SIGIR conference on Research and development in information retrieval
Pages (from-to) 1067-1070
Publisher New York, NY: ACM
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
We tackle the problem of improving microblog retrieval algorithms by proposing a robust structural representation of (query, tweet) pairs. We employ these structures in a principled kernel learning framework that automatically extracts and learns highly discriminative features. We test the generalization power of our approach on the TREC Microblog 2011 and 2012 tasks. We find that relational syntactic features generated by structural kernels are effective for learning to rank (L2R) and can easily be combined with those of other existing systems to boost their accuracy. In particular, the results show that our L2R approach improves on almost all the participating systems at TREC, only using their raw scores as a single feature. Our method yields an average increase of 5\% in retrieval effectiveness and 7 positions in system ranks.
Document type Conference contribution
Note Short paper
Language English
Published at https://doi.org/10.1145/2600428.2609511
Permalink to this page
Back