Extracting the Discussion Structure in Comments on News-Articles

Open Access
Authors
Publication date 2007
Book title WIDM '07: proceedings of the 9th annual ACM international workshop on Web information and data management
ISBN
  • 9781595938299
Event WIDM '07: 9th annual ACM international workshop on Web information and data management
Pages (from-to) 97-104
Publisher New York, NY: ACM
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Several on-line daily newspapers offer readers the opportunity to directly comment on articles. In the Netherlands this feature is used quite often and the quality (grammatically and content-wise) is surprisingly high. We develop techniques to collect, store, enrichand analyze these comments. After giving a high-level overview of the Dutch 'commentosphere' we zoom in on extracting the discussion structure found in flat comment threads; people not only comment on the news article, they also heavily comment on other comments, resembling discussion fora. We show how techniques from information retrieval, natural language processing and machine learning can be used to extract the 'reacts-on' relation between comments with high precision and recall.
Document type Conference contribution
Language English
Published at https://doi.org/10.1145/1316902.1316919
Downloads
277158.pdf (Final published version)
Permalink to this page
Back