Analyzing the Evaluation of Cross-Lingual Knowledge Transfer in Multilingual Language Models

S. Rajaee; C. Monz

doi:https://doi.org/10.18653/v1/2024.eacl-long.177

Analyzing the Evaluation of Cross-Lingual Knowledge Transfer in Multilingual Language Models

Authors	S. Rajaee C. Monz
Publication date	2024
Host editors	Y. Graham M. Purver
Book title	The 18th Conference of the European Chapter of the Association for Computational Linguistics : Proceedings of the Conference
Book subtitle	EACL 2024 : March 17-22, 2024
ISBN (electronic)	9798891760882
Event	18th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2024
Volume \| Issue number	1
Pages (from-to)	2895–2914
Publisher	Kerrville, TX: Association for Computational Linguistics
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	Recent advances in training multilingual language models on large datasets seem to have shown promising results in knowledge transfer across languages and achieve high performance on downstream tasks. However, we question to what extent the current evaluation benchmarks and setups accurately measure zero-shot cross-lingual knowledge transfer. In this work, we challenge the assumption that high zero-shot performance on target tasks reflects high cross-lingual ability by introducing more challenging setups involving instances with multiple languages. Through extensive experiments and analysis, we show that the observed high performance of multilingual models can be largely attributed to factors not requiring the transfer of actual linguistic knowledge, such as task- and surface-level knowledge. More specifically, we observe what has been transferred across languages is mostly data artifacts and biases, especially for low-resource languages. Our findings highlight the overlooked drawbacks of existing cross-lingual test data and evaluation setups, calling for a more nuanced understanding of the cross-lingual capabilities of multilingual models.
Document type	Conference contribution
Note	With supplementary video
Language	English
Published at	https://doi.org/10.18653/v1/2024.eacl-long.177 (Final published version)
Other links	https://github.com/Sara-Rajaee/crosslingual-evaluation
Downloads	2024.eacl-long.177 (Final published version)
Supplementary materials	2024.eacl-long.177
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Analyzing the Evaluation of Cross-Lingual Knowledge Transfer in Multilingual Language Models