Querying for Video Events by Semantic Signatures from Few Examples

Authors	M. Mazloom A. Habibian C.G.M. Snoek
Publication date	2013
Book title	MM '13
Book subtitle	proceedings of the 2013 ACM Multimedia Conference : October 21-25, 2013, Barcelona, Spain
ISBN	9781450324045
Event	2013 ACM Multimedia Conference
Volume \| Issue number	2
Pages (from-to)	609-612
Publisher	New York: ACM
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	We aim to query web video for complex events using only a handful of video query examples, where the standard approach learns a ranker from hundreds of examples. We consider a semantic signature representation, consisting of off-the-shelf concept detectors, to capture the variance in semantic appearance of events. Since it is unknown what similarity metric and query fusion to use in such an event retrieval setting, we perform three experiments on unconstrained web videos from the TRECVID event detection task. It reveals that: retrieval with semantic signatures using normalized correlation as similarity metric outperforms a low-level bag-of-words alternative, multiple queries are best combined using late fusion with an average operator, and event retrieval is preferred over event classification when less than eight positive video examples are available.
Document type	Conference contribution
Language	English
Published at	https://doi.org/10.1145/2502081.2502160
Other links	http://www.science.uva.nl/research/publications/2013/MazloomICM2013
Permalink to this page

Back

UvA-DARE