Dance with Flow: Two-in-One Stream Action Detection
| Authors | |
|---|---|
| Publication date | 2019 |
| Book title | 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition |
| Book subtitle | proceedings : 16-20 June 2019, Long Beach, California |
| ISBN |
|
| ISBN (electronic) |
|
| Series | CVPR |
| Event | IEEE Conference on Computer Vision and Pattern Recognition |
| Pages (from-to) | 9927-9936 |
| Publisher | Los Alamitos, CA: IEEE Computer Society |
| Organisations |
|
| Abstract |
The goal of this paper is to detect the spatio-temporal extent of an action. The two-stream detection network based on RGB and flow provides state-of-the-art accuracy at the expense of a large model-size and heavy computation. We propose to embed RGB and optical-flow into a single two-in-one stream network with new layers. A motion condition layer extracts motion information from flow images, which is leveraged by the motion modulation layer to generate transformation parameters for modulating the low-level RGB features. The method is easily embedded in existing appearance- or two-stream action detection networks, and trained end-to-end. Experiments demonstrate that leveraging the motion condition to modulate RGB features improves detection accuracy. With only half the computation and parameters of the state-of-the-art two-stream methods, our two-in-one stream still achieves impressive results on UCF101-24, UCFSports and J-HMDB.
|
| Document type | Conference contribution |
| Language | English |
| Published at | https://doi.org/10.48550/arXiv.1904.00696 https://doi.org/10.1109/CVPR.2019.01017 |
| Other links | https://ivi.fnwi.uva.nl/isis/publications/2019/ZhaoCVPR2019 http://www.proceedings.com/52034.html |
| Downloads |
ZhaoCVPR2019
(Accepted author manuscript)
08953720
(Final published version)
|
| Permalink to this page | |
