A syntactified direct translation model with linear-time decoding

H. Hassan; K. Sima'an; A. Way

A syntactified direct translation model with linear-time decoding

Authors	H. Hassan K. Sima'an A. Way
Publication date	2009
Book title	Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing (EMNLP 2009): Volume 3
ISBN	9781932432633
Event	2009 Conference on Empirical Methods in Natural Language Processing (EMNLP 2009), Singapore
Pages (from-to)	1182-1191
Publisher	Morristown, NJ: Association for Computational Linguistics (ACL)
Organisations	Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract	Recent syntactic extensions of statistical translation models work with a synchronous context-free or tree-substitution grammar extracted from an automatically parsed parallel corpus. The decoders accompanying these extensions typically exceed quadratic time complexity. This paper extends the Direct Translation Model 2 (DTM2) with syntax while maintaining linear-time decoding. We employ a linear-time parsing algorithm based on an eager, incremental interpretation of Combinatory Categorial Grammar (CCG). As every input word is processed, the local parsing decisions resolve ambiguity eagerly, by selecting a single supertag-operator pair for extending the dependency parse incrementally. Alongside translation features extracted from the derived parse tree, we explore syntactic features extracted from the incremental derivation process. Our empirical experiments show that our model significantly outperforms the state-of-the art DTM2 system.
Document type	Conference contribution
Published at	http://portal.acm.org/citation.cfm?id=1699664
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

A syntactified direct translation model with linear-time decoding