Empirical evidence for discourse markers at the lexical level

Open Access
Authors
Publication date 2017
Host editors
  • M. Wieling
  • M. Kroon
  • G. van Noord
  • G. Bouma
Book title From Semantics to Dialectometry
Book subtitle Festschrift in honor of John Nerbonne
ISBN
  • 9781848902305
Series Tributes
Pages (from-to) 45-55
Publisher College Publications
Organisations
  • Faculty of Humanities (FGw) - Amsterdam Institute for Humanities Research (AIHR) - Amsterdam Center for Language and Communication (ACLC)
Abstract
I use a discourse-annotated corpus to demonstrate a new method for identifying potential discourse makers. Discourse markers are often identified manually, but particularly for natural language processing purposes, it is useful to have a more objective, data-driven method of identification. I link this task to the task of identifying co-occurrences of words and constructions, a task where statistical association measures are often used to compute association strengths. I then apply a statistical association measure to the task of discourse marker identification, and present results for several discourse relation types. While the results are noisy due to the limited availability of corpus data, they appear usable after manual correction or as a feature in a classifier. Furthermore, the results highlight a few types of lexical discourse relation cues that are not traditionally considered discourse makers, but still have a clear association with particular discourse relation types.
Document type Chapter
Language English
Other links http://www.let.rug.nl/30years/festschrift/
Downloads
Permalink to this page
Back