Overview of the CLEF 2022 SimpleText Lab: Automatic Simplification of Scientific Texts

Open Access
Authors
  • L. Ermakova
  • E. SanJuan
  • J. Kamps ORCID logo
  • S. Huet
  • I. Ovchinnikova
  • D. Nurbakova
  • S. Araújo
  • R. Hannachi
  • E. Mathurin
  • P. Bellot
Publication date 2022
Host editors
  • A. Barrón-Cedeño
  • G. Da San Martino
  • M. Degli Esposti
  • F. Sebastiani
  • C. Macdonald
  • G. Pasi
  • A. Hanbury
  • M. Potthast
  • G. Faggioli
  • N. Ferro
Book title Experimental IR Meets Multilinguality, Multimodality, and Interaction
Book subtitle 13th International Conference of the CLEF Association, CLEF 2022, Bologna, Italy, September 5–8, 2022 : proceedings
ISBN
  • 9783031136429
  • 9783031136443
ISBN (electronic)
  • 9783031136436
Series Lecture Notes in Computer Science
Event 13th International Conference of the Cross-Language Evaluation Forum for European Languages, CLEF 2022
Pages (from-to) 470-494
Number of pages 25
Publisher Cham: Springer
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract

Although citizens agree on the importance of objective scientific information, yet they tend to avoid scientific literature due to access restrictions, its complex language or their lack of prior background knowledge. Instead, they rely on shallow information on the web or social media often published for commercial or political incentives rather than the correctness and informational value. This paper presents an overview of the CLEF 2022 SimpleText track addressing the challenges of text simplification approaches in the context of promoting scientific information access, by providing appropriate data and benchmarks, and creating a community of IR and NLP researchers working together to resolve one of the greatest challenges of today. The track provides a corpus of scientific literature abstracts and popular science requests. It features three tasks. First, content selection (what is in, or out?) challenges systems to select passages to include in a simplified summary in response to a query. Second, complexity spotting (what is unclear?) given a passage and a query, aims to rank terms/concepts that are required to be explained for understanding this passage (definitions, context, applications). Third, text simplification (rewrite this!) given a query, asks to simplify passages from scientific abstracts while preserving the main content.

Document type Conference contribution
Language English
Published at https://doi.org/10.1007/978-3-031-13643-6_28
Other links https://www.scopus.com/pages/publications/85136957710
Downloads
978-3-031-13643-6_28 (Final published version)
Permalink to this page
Back