Querying for Video Events by Semantic Signatures from Few Examples
| Authors |
|
|---|---|
| Publication date | 2013 |
| Book title | MM '13 |
| Book subtitle | proceedings of the 2013 ACM Multimedia Conference : October 21-25, 2013, Barcelona, Spain |
| ISBN |
|
| Event | 2013 ACM Multimedia Conference |
| Volume | Issue number | 2 |
| Pages (from-to) | 609-612 |
| Publisher | New York: ACM |
| Organisations |
|
| Abstract |
We aim to query web video for complex events using only a handful of video query examples, where the standard approach learns a ranker from hundreds of examples. We consider a semantic signature representation, consisting of off-the-shelf concept detectors, to capture the variance in semantic appearance of events. Since it is unknown what similarity metric and query fusion to use in such an event retrieval setting, we perform three experiments on unconstrained web videos from the TRECVID event detection task. It reveals that: retrieval with semantic signatures using normalized correlation as similarity metric outperforms a low-level bag-of-words alternative, multiple queries are best combined using late fusion with an average operator, and event retrieval is preferred over event classification when less than eight positive video examples are available.
|
| Document type | Conference contribution |
| Language | English |
| Published at | https://doi.org/10.1145/2502081.2502160 |
| Other links | http://www.science.uva.nl/research/publications/2013/MazloomICM2013 |
| Permalink to this page | |
