Querying for Video Events by Semantic Signatures from Few Examples

Authors
Publication date 2013
Book title MM '13
Book subtitle proceedings of the 2013 ACM Multimedia Conference : October 21-25, 2013, Barcelona, Spain
ISBN
  • 9781450324045
Event 2013 ACM Multimedia Conference
Volume | Issue number 2
Pages (from-to) 609-612
Publisher New York: ACM
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
We aim to query web video for complex events using only a handful of video query examples, where the standard approach learns a ranker from hundreds of examples. We consider a semantic signature representation, consisting of off-the-shelf concept detectors, to capture the variance in semantic appearance of events. Since it is unknown what similarity metric and query fusion to use in such an event retrieval setting, we perform three experiments on unconstrained web videos from the TRECVID event detection task. It reveals that: retrieval with semantic signatures using normalized correlation as similarity metric outperforms a low-level bag-of-words alternative, multiple queries are best combined using late fusion with an average operator, and event retrieval is preferred over event classification when less than eight positive video examples are available.
Document type Conference contribution
Language English
Published at https://doi.org/10.1145/2502081.2502160
Other links http://www.science.uva.nl/research/publications/2013/MazloomICM2013
Permalink to this page
Back