Using Collostructional Analysis to evaluate BERT's representation of linguistic constructions

Open Access
Authors
Publication date 2023
Host editors
  • A. Rogers
  • J. Boyd-Graber
  • N. Okazaki
Book title Findings of the Association for Computational Linguistics: ACL 2023
Book subtitle July 9-14, 2023
ISBN (electronic)
  • 9781959429623
Event 61st Annual Meeting of the Association for Computational Linguistics
Pages (from-to) 12937–12951
Publisher Stroudsburg, PA: Association for Computational Linguistics
Organisations
  • Faculty of Humanities (FGw)
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract
Collostructional analysis is a technique devised to find correlations between particular words and linguistic constructions in order to analyse meaning associations of these constructions. Contrasting collostructional analysis results with output from BERT might provide insights into the way BERT represents the meaning of linguistic constructions. This study tests to what extent English BERT’s meaning representations correspond to known constructions from the linguistics literature by means of two tasks that we propose. Firstly, by predicting the words that can be used in open slots of constructions, the meaning associations of more lexicalized constructions can be observed. Secondly, by finding similar sequences using BERT’s output embeddings and manually reviewing the resulting sentences, we can observe whether instances of less lexicalized constructions are clustered together in semantic space. These two methods show that BERT represents constructional meaning to a certain extent, but does not separate instances of a construction from a near-synonymous construction that has a different form.
Document type Conference contribution
Language English
Published at https://doi.org/10.18653/v1/2023.findings-acl.819
Downloads
2023.findings-acl.819 (Final published version)
Permalink to this page
Back