Bag-of-Fragments: Selecting and encoding video fragments for event detection and recounting

Open Access
Authors
Publication date 2015
Book title ICMR'15: proceedings of the 2015 ACM International Conference on Multimedia Retrieval: June 23-26, 2015, Shanghai, China
ISBN
  • 9781450332743
Event 2015 ACM International Conference on Multimedia Retrieval
Pages (from-to) 427-434
Publisher New York, NY: Association for Computing Machinery
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
The goal of this paper is event detection and recounting using a representation of concept detector scores. Different from existing work, which encodes videos by averaging concept scores over all frames, we propose to encode videos using fragments that are discriminatively learned per event. Our bag-of-fragments split a video into semantically coherent fragment proposals. From training video proposals we show how to select the most discriminative fragment for an event. An encoding of a video is in turn generated by matching and pooling these discriminative fragments to the fragment proposals of the video. The bag-of-fragments forms an effective encoding for event detection and is able to provide a precise temporally localized event recounting. Furthermore, we show how bag-of-fragments can be extended to deal with irrelevant concepts in the event recounting. Experiments on challenging web videos show that i) our modest number of fragment proposals give a high sub-event recall, ii) bag-of-fragments is complementary to global averaging and provides better event detection, iii) bag-of-fragments with concept filtering yields a desirable event recounting. We conclude that fragments matter for video event detection and recounting.
Document type Conference contribution
Language English
Published at https://doi.org/10.1145/2671188.2749404
Downloads
2671188.2749404 (Final published version)
Permalink to this page
Back