Reordering Grammar induction

Authors
Publication date 2015
Host editors
  • L. Márquez
  • C. Callison-Burch
  • J. Su
Book title EMNLP 2015 Lisbon : conference proceedings
Book subtitle September 17-21 : Conference on Empirical Methods in Natural Language Processing
ISBN
  • 9781941643327
Event Conference on Empirical Methods in Natural Language Processing, EMNLP 2015
Pages (from-to) 44-54
Number of pages 11
Publisher Stroudsburg, PA: The Association for Computational Linguistics
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract

We present a novel approach for unsupervised induction of a Reordering Grammar using a modified form of permutation trees (Zhang and Gildea, 2007), which we apply to preordering in phrase-based machine translation. Unlike previous approaches, we induce in one step both the hierarchical structure and the transduction function over it from word-aligned parallel corpora. Furthermore, our model (1) handles non-ITG reordering patterns (up to 5-ary branching), (2) is learned from all derivations by treating not only labeling but also bracketing as latent variable, (3) is entirely unlexicalized at the level of reordering rules, and (4) requires no linguistic annotation. Our model is evaluated both for accuracy in predicting target order, and for its impact on translation quality. We report significant performance gains over phrase reordering, and over two known preordering baselines for English-Japanese.

Document type Conference contribution
Language English
Published at https://aclweb.org/anthology/D/D15/D15-1005.pdf
Other links https://www.scopus.com/pages/publications/84959896212
Permalink to this page
Back