Who mentions whom? Recognizing political actors in proceedings

Open Access
Authors
Publication date 2020
Host editors
  • D. FiĊĦer
  • M. Eskevich
  • F. de Jong
Book title Creating, Using and Linking of Parliamentary Corpora with Other Types of Political Discourse (ParlaCLARIN II)
Book subtitle proceedings : LREC 2020 workshop : Language Resources and Evaluation Conference : 11-16 May 2020
ISBN (electronic)
  • 9791095546474
Event Creating, Using and Linking of Parliamentary Corpora with Other Types of Political Discourse (ParlaCLARIN II)
Pages (from-to) 35-39
Number of pages 5
Publisher Paris: European Language Resources Association
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
We show that it is straightforward to train a state of the art named entity tagger (spaCy) to recognize political actors in Dutch parliamentary proceedings with high accuracy. The tagger was trained on 3.4K manually labeled examples, which were created in a modest 2.5 days work. This resource is made available on github. Besides proper nouns of persons and political parties, the tagger can recognize quite complex definite descriptions referring to cabinet ministers, ministries, and parliamentary committees. We also provide a demo search engine which employs the tagged entities in its SERP and result summaries.
Document type Conference contribution
Language English
Published at https://aclanthology.org/2020.parlaclarin-1.7 http://www.lrec-conf.org/proceedings/lrec2020/workshops/ParlaCLARIN2/pdf/2020.parlaclarin-1.7.pdf
Downloads
2020.parlaclarin-1.7-1 (Final published version)
Permalink to this page
Back