UvA-MT at WMT25 Evaluation Task LLM Uncertainty as a Proxy for Translation Quality

Open Access
Authors
Publication date 2025
Host editors
  • Barry Haddow
  • Tom Kocmi
  • Philipp Koehn
  • Christof Monz
Book title Tenth Conference on Machine Translation : Proceedings of the Conference
Book subtitle WMT 2025 : November 8-9, 2025
ISBN (electronic)
  • 9798891763418
Event 10th Conference on Machine Translation, WMT 2025
Pages (from-to) 974-983
Number of pages 10
Publisher Kerrville, TX: Association for Computational Linguistics
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract

This year, we focus exclusively on using the uncertainty quantification as a proxy for translation quality. While this has traditionally been regarded as a form of unsupervised quality estimation, such signals have been overlooked in the design of the current metric models-we show their value in the context of LLMs. More specifically, in contrast to conventional unsupervised QE methods, we apply recent calibration technology (Wu et al., 2025b) to adjust translation likelihoods to better align with quality signals, and we use the single resulting model to participate in both the general translation and QE tracks at WMT25. Our offline experiments show some advantages: 1) uncertainty signals extracted from LLMs, like Tower or Gemma-3, provide accurate quality predictions; and 2) calibration technology further improves this QE performance, sometimes even surpassing certain metric models that were trained with human annotations, such as CometKiwi. We therefore argue that uncertainty quantification (confidence), especially from LLMs, can serve as a strong and complementary signal for the metric design, particularly when human-annotated data are lacking. However, we also identify limitations, i.e., its tendency to assign disproportionately higher scores to hypotheses generated by the model itself.

Document type Conference contribution
Language English
Published at https://doi.org/10.18653/v1/2025.wmt-1.72
Other links https://www.scopus.com/pages/publications/105028923196
Downloads
2025.wmt-1.72 (Final published version)
Permalink to this page
Back