Two-Way Parsimonious Classification Models for Evolving Hierarchies

Open Access
Authors
Publication date 2016
Host editors
  • N. Fuhr
  • P. Quaresma
  • T. Gonçalves
  • B. Larsen
  • K. Balog
  • C. Macdonald
  • L. Cappellato
  • N. Ferro
Book title Experimental IR Meets Multilinguality, Multimodality, and Interaction
Book subtitle 7th International Conference of the CLEF Association, CLEF 2016, Évora, Portugal, September 5-8, 2016: proceedings
ISBN
  • 9783319445632
ISBN (electronic)
  • 9783319445649
Series Lecture Notes in Computer Science
Event 7th International Conference of the CLEF Association
Pages (from-to) 69-82
Number of pages 14
Publisher Cham: Springer
Organisations
  • Faculty of Humanities (FGw)
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
There is an increasing volume of semantically annotated data available, in particular due to the emerging use of knowledge bases to annotate or classify dynamic data on the web. This is challenging as these knowledge bases have a dynamic hierarchical or graph structure demanding robustness against changes in the data structure over time. In general, this requires us to develop appropriate models for the hierarchical classes that capture all, and only, the essential solid features of the classes which remain valid even as the structure changes. We propose hierarchical significant words language models of textual objects in the intermediate levels of hierarchies as robust models for hierarchical classification by taking the hierarchical relations into consideration. We conduct extensive experiments on richly annotated parliamentary proceedings linking every speech to the respective speaker, their political party, and their role in the parliament. Our main findings are the following. First, we define hierarchical significant words language models as an iterative estimation process across the hierarchy, resulting in tiny models capturing only well grounded text features at each level. Second, we apply the resulting models to party membership and party position classification across time periods, where the structure of the parliament changes, and see the models dramatically better transfer across time periods, relative to the baselines.
Document type Conference contribution
Language English
Published at https://doi.org/10.1007/978-3-319-44564-9_6
Downloads
Two-Way Parsimonious Classification Models (Final published version)
Permalink to this page
Back