On Semantic Similarity in Video Retrieval
| Authors |
|
|---|---|
| Publication date | 2021 |
| Book title | Proceedings, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition |
| Book subtitle | virtual, 9-25 June 2021 |
| ISBN |
|
| ISBN (electronic) |
|
| Series | CVPR |
| Event | 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition |
| Pages (from-to) | 3649-3659 |
| Publisher | Los Alamitos, California: Conference Publishing Services, IEEE Computer Society |
| Organisations |
|
| Abstract |
Current video retrieval efforts all found their evaluation on an instance-based assumption, that only a single caption is relevant to a query video and vice versa. We demonstrate that this assumption results in performance comparisons often not indicative of models’ retrieval capabilities. We propose a move to semantic similarity video retrieval, where (i) multiple videos/captions can be deemed equally relevant, and their relative ranking does not affect a method’s reported performance and (ii) retrieved videos/captions are ranked by their similarity to a query. We propose several proxies to estimate semantic similarities in large-scale retrieval datasets, without additional annotations. Our analysis is performed on three commonly used video retrieval datasets (MSR-VTT, YouCook2 and EPIC-KITCHENS).
|
| Document type | Conference contribution |
| Note | With supplementary material |
| Language | English |
| Published at | https://doi.org/10.48550/arXiv.2103.10095 https://doi.org/10.1109/CVPR46437.2021.00365 |
| Published at | https://openaccess.thecvf.com/content/CVPR2021/html/Wray_On_Semantic_Similarity_in_Video_Retrieval_CVPR_2021_paper.html |
| Other links | https://www.proceedings.com/60773.html |
| Downloads |
2103.10095
(Accepted author manuscript)
|
| Supplementary materials | |
| Permalink to this page | |