Finding key bloggers, one post at a time

Open Access
Authors
Publication date 2008
Journal Frontiers in Artificial Intelligence and Applications
Event 18th European Conference on Artificial Intelligence (ECAI 2008), Patras, Greece
Volume | Issue number 178
Pages (from-to) 318-322
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
User generated content in general, and blogs in particular, form an interesting and relatively little explored domain for mining knowledge. We address the task of blog distillation: to find blogs that are principally devoted to a given topic, as opposed to blogs that merely happen to discuss the topic in passing. Working in the setting of statistical language modeling, we model the task by aggregating a blogger's blog posts to collect evidence of relevance to the topic and persistence of interest in the topic. This approach achieves state-of-the-art performance. On top of this baseline, we extend our model by incorporating a number of blog-specific features, concerning document structure, social structure, and temporal structure. These blog-specific features yield further improvements.
Document type Article
Note Proceedings title: ECAI 2008: 18th European Conference on Artificial Intelligence, July 21-25, 2008, Patras, Greece: Including Prestigious Applications of Intelligent Systems (PAIS 2008): Proceedings Publisher: IOS Press Place of publication: Amsterdam ISBN: 978-1-58603-891-5 Editors: M. Ghallab, C.D. Spyropoulos, N. Fakotakis, N. Avouris
Published at http://staff.science.uva.nl/~mdr/Publications/Files/ecai2008.pdf
Downloads
Permalink to this page
Back