Causal video summarizer for video exploration
| Authors |
|
|---|---|
| Publication date | 2022 |
| Book title | ICME 2022 : conference proceedings |
| Book subtitle | IEEE International Conference on Multimedia and Expo 2022 |
| ISBN |
|
| ISBN (electronic) |
|
| Event | 2022 IEEE International Conference on Multimedia and Expo |
| Pages (from-to) | 349-354 |
| Number of pages | 6 |
| Publisher | Piscataway, NJ: IEEE |
| Organisations |
|
| Abstract |
Recently, video summarization has been proposed as a method to help video exploration. However, traditional video summarization models only generate a fixed video summary which is usually independent of user-specific needs and hence limits the effectiveness of video exploration. Multi-modal video summarization is one of the approaches utilized to address this issue. Multi-modal video summarization has a video input and a text-based query input. Hence, effective modeling of the interaction between a video input and text-based query is essential to multi-modal video summarization. In this work, a new causality-based method named Causal Video Summarizer (CVS) is proposed to effectively capture the interactive information between the video and query to tackle the task of multi-modal video summarization. The proposed method consists of a probabilistic encoder and a probabilistic decoder. Based on the evaluation of the existing multi-modal video summarization dataset, experimental results show that the proposed approach is effective with the increase of +5.4% in accuracy and +4.92% increase of F1-score, compared with the state-of-the-art method.
|
| Document type | Conference contribution |
| Language | English |
| Published at | https://doi.org/10.1109/ICME52920.2022.9859948 |
| Other links | https://www.proceedings.com/65366.html |
| Permalink to this page | |
