Modeling morphosyntactic agreement in constituency-based parsing of modern Hebrew

Authors
Publication date 2010
Book title Proceedings of the first workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2010) at NAACL HLT 2010, Los Angeles, CA
Event First workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2010) at NAACL HLT 2010, Los Angeles, CA
Pages (from-to) 40-48
Publisher Stroudsburg, PA: Association for Computational Linguistics (ACL)
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract
We show that naïve modeling of morphosyn-tactic agreement in a Constituency-Based (CB) statistical parsing model is worse than none, whereas a linguistically adequate way of modeling inflectional morphology in CB parsing leads to improved performance. In particular, we show that an extension of the Relational-Realizational (RR) model that incorporates agreement features is superior to CB models that treat morphosyntax as state-splits (SP), and that the RR model benefits more from inflectional features. We focus on parsing Hebrew and report the best result to date, F184.13 for parsing off of gold-tagged text, 5% error reduction from previous results.
Document type Conference contribution
Language English
Published at http://portal.acm.org/citation.cfm?id=1868771.1868776
Permalink to this page
Back