A discriminative syntactic model for source permutation via tree transduction

Authors
Publication date 2010
Host editors
  • D. Wu
Book title Proceedings of the Fourth Workshop on Syntax and Structure in Statistical Translation (SSST-4), Beijing, China
Event Fourth Workshop on Syntax and Structure in Statistical Translation (SSST-4), Beijing, China
Pages (from-to) 92-100
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract
A major challenge in statistical machine translation is mitigating the word order differences between source and target strings. While reordering and lexical translation choices are often conducted in tandem, source string permutation prior to translation is attractive for studying reordering using hierarchical and syntactic structure. This work contributes an approach for learning source string permutation via transfer of the source syntax tree. We present a novel discriminative, probabilistic tree transduction model, and contribute a set of empirical upperbounds on translation performance for English-to-Dutch source string permutation under sequence and parse tree constraints. Finally, the translation performance of our learning model is shown to outperform the state-of-the-art phrase-based system significantly.
Document type Conference contribution
Language English
Published at http://www.mt-archive.info/SSST-2010-Khalilov.pdf
Permalink to this page
Back