Adaptation of Statistical Machine Translation Model for Cross-Lingual Information Retrieval in a Service Context

Open Access
Authors
  • V. Nikoulina
  • B. Kovachev
  • N. Lagos
  • C. Monz
Publication date 2012
Host editors
  • W. Daelemans
Book title EACL 2012: 13th Conference of the European Chapter of the Association for Computational Linguistics
Book subtitle proceedings of the conference : April 23-27 2012, Avignon France
ISBN
  • 9781937284190
Event EACL 2012: 13th Conference of the European Chapter of the Association for Computational Linguistics
Pages (from-to) 109-119
Publisher Stroudsburg, PA: Association for Computational Linguistics
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
This work proposes to adapt an existing general SMT model for the task of translating queries that are subsequently going to be used to retrieve information from a target language collection. In the scenario that we focus on access to the document collection itself is not available and changes to the IR model are not possible. We propose two ways to achieve the adaptation effect and both of them are aimed at tuning parameter weights on a set of parallel queries. The first approach is via a standard tuning procedure optimizing for BLEU score and the second one is via a reranking approach optimizing for MAP score. We also extend the second approach by using syntax-based features. Our experiments show improvements of 1-2.5 in terms of MAP score over the retrieval with the non-adapted translation. We show that these improvements are due both to the integration of the adaptation and syntax-features for the query translation task.
Document type Conference contribution
Language English
Published at http://www.aclweb.org/anthology/E/E12/E12-1002.pdf http://dl.acm.org/citation.cfm?id=2380832
Downloads
381122 (Final published version)
Permalink to this page
Back