A Stochastic Decoder for Neural Machine Translation

Open Access
Authors
Publication date 2018
Host editors
  • I. Gurevych
  • Y. Miyao
Book title ACL 2018 : The 56th Annual Meeting of the Association for Computational Linguistics
Book subtitle proceedings of the conference : July 15-20, 2018, Melbourne, Australia
ISBN (electronic)
  • 9781948087322
Event 56th Annual Meeting of the Association for Computational Linguistics, ACL 2018
Volume | Issue number 1
Pages (from-to) 1243-1252
Publisher Stroudsburg, PA: The Association for Computational Linguistics
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract
The process of translation is ambiguous, in that there are typically many valid translations for a given sentence. This gives rise to significant variation in parallel corpora, however, most current models of machine translation do not account for this variation, instead treating the problem as a deterministic process. To this end, we present a deep generative model of machine translation which incorporates a chain of latent variables, in order to account for local lexical and syntactic variation in parallel corpora. We provide an in-depth analysis of the pitfalls encountered in variational inference for training deep generative models. Experiments on several different language pairs demonstrate that the model consistently improves over strong baselines.
Document type Conference contribution
Note With supplementary notes.
Language English
Published at https://doi.org/10.18653/v1/P18-1115
Other links https://vimeo.com/288152820
Downloads
P18-1115 (Final published version)
Supplementary materials
Permalink to this page
Back