Bilingual learning of multi-sense embeddings with discrete autoencoders

Open Access
Authors
Publication date 2016
Host editors
  • K. Knight
  • A. Nenkova
  • O. Rambow
Book title NAACL HLT 2016 : The 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Book subtitle Proceedings of the Conference : June 12-17, 2016, San Diego, California, USA
ISBN
  • 9781941643914
Event 15th Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016
Pages (from-to) 1346-1356
Number of pages 11
Publisher Stroudsburg, PA: The Association for Computational Linguistics
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract

We present an approach to learning multi-sense word embeddings relying both on monolingual and bilingual information. Our model consists of an encoder, which uses monolingual and bilingual context (i.e. a parallel sentence) to choose a sense for a given word, and a decoder which predicts context words based on the chosen sense. The two components are estimated jointly. We observe that the word representations induced from bilingual data outperform the monolingual counterparts across a range of evaluation tasks, even though crosslingual information is not available at test time.

Document type Conference contribution
Language English
Published at https://doi.org/10.18653/v1/N16-1160
Other links https://www.scopus.com/pages/publications/84994078748
Downloads
N16-1160 (Final published version)
Permalink to this page
Back