Good Applications for Crummy Entity Linkers? The Case of Corpus Selection in Digital Humanities

Open Access
Authors
Publication date 2017
Host editors
  • R. Hoekstra
  • C. Faron-Zucker
  • T. Pellegrini
  • V. de Boer
Book title Proceedings of the 13th International Conference on Semantic Systems
Book subtitle 12th-13th of September 2017, Amsterdam, the Netherlands
ISBN (electronic)
  • 9781450352963
Series ACM International Conference Proceedings Series
Event International Conference on Semantic Systems 2017
Pages (from-to) 81-88
Number of pages 8
Publisher New York: The Association for Computing Machinery
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
  • Faculty of Science (FNWI)
  • Faculty of Humanities (FGw)
Abstract
Over the last decade we have made great progress in entity linking (EL) systems, but performance may vary depending on the context and, arguably, there are even principled limitations preventing a "perfect" EL system. This also suggests that there may be applications for which current "imperfect" EL is already very useful, and makes finding the "right" application as important as building the "right" EL system. We investigate the Digital Humanities use case, where scholars spend a considerable amount of time selecting relevant source texts. We developed WideNet; a semantically-enhanced search tool which leverages the strengths of (imperfect) EL without getting in the way of its expert users. We evaluate this tool in two historical case-studies aiming to collect a set of references to historical periods in parliamentary debates from the last two decades; the first targeted the Dutch Golden Age, and the second World War II. The case-studies conclude with a critical reflection on the utility of WideNet for this kind of research, after which we outline how such a real-world application can help to improve EL technology in general.
Document type Conference contribution
Language English
Related publication Riches of the Poor: Using Crummy Entity Linkers for Interactive Search in Digital Humanities
Published at https://doi.org/10.1145/3132218.3132237
Published at https://arxiv.org/abs/1708.01162
Downloads
1708.01162 (Accepted author manuscript)
p81-olieman (Final published version)
Permalink to this page
Back