Paper
26 May 2023 Video summarization with local and global attention
Ziyan Wang
Author Affiliations +
Proceedings Volume 12700, International Conference on Electronic Information Engineering and Data Processing (EIEDP 2023); 127002L (2023) https://doi.org/10.1117/12.2682389
Event: International Conference on Electronic Information Engineering and Data Processing (EIEDP 2023), 2023, Nanchang, China
Abstract
In this work, we propose a novel framework for video summarization called the Frame and Shot Network (FSNet). Unlike existing supervised video summarization methods that focus on the relationship between frames in videos, we discovered that the correlation between shots is equally significant. Our FSNet comprises of two separate modules, one for calculating frame correlations and the other for shot correlations, both of which incorporate an attention mechanism. To demonstrate the effectiveness of our method, we evaluated it on the SumMe and TVSum datasets. The results indicate that our FSNet outperforms previous state-of-the-art algorithms.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Ziyan Wang "Video summarization with local and global attention", Proc. SPIE 12700, International Conference on Electronic Information Engineering and Data Processing (EIEDP 2023), 127002L (26 May 2023); https://doi.org/10.1117/12.2682389
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Education and training

Video processing

Feature extraction

Matrices

Data modeling

Neural networks

Back to Top