Spatial-Aware Object Embeddings for Zero-Shot Localization and Classification of Actions
| Authors | |
|---|---|
| Publication date | 2017 |
| Book title | 2017 IEEE International Conference on Computer Vision : ICCV 2017 |
| Book subtitle | proceedings : 22-29 October 2017, Venice, Italy |
| ISBN |
|
| ISBN (electronic) |
|
| Event | 2017 IEEE International Conference on Computer Vision |
| Pages (from-to) | 4453-4462 |
| Publisher | Los Alamitos, California: IEEE Computer Society |
| Organisations |
|
| Abstract |
We aim for zero-shot localization and classification of human actions in video. Where traditional approaches rely on global attribute or object classification scores for their zero-shot knowledge transfer, our main contribution is a spatial-aware object embedding. To arrive at spatial awareness, we build our embedding on top of freely available actor and object detectors. Relevance of objects is determined in a word embedding space and further enforced with estimated spatial preferences. Besides local object awareness, we also embed global object awareness into our embedding to maximize actor and object interaction. Finally, we exploit the object positions and sizes in the spatial-aware embedding to demonstrate a new spatiotemporal action retrieval scenario with composite queries. Action localization and classification experiments on four contemporary action video datasets support our proposal. Apart from state-of-the-art results in the zero-shot localization and classification settings, our spatial-aware embedding is even competitive with recent supervised action localization alternatives.
|
| Document type | Conference contribution |
| Language | English |
| Published at | https://doi.org/10.1109/ICCV.2017.476 |
| Other links | https://ivi.fnwi.uva.nl/isis/publications/2017/MettesICCV2017 |
| Downloads |
Mettes_Spatial-Aware_Object_Embeddings_ICCV_2017_paper
(Accepted author manuscript)
Spatial-Aware Object Embeddings
(Final published version)
|
| Permalink to this page | |
