Paper
8 June 2023 3D human pose estimation in video with temporal and spatial transformer
Sha Peng, Jiwei Hu
Author Affiliations +
Proceedings Volume 12707, International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2023); 127070E (2023) https://doi.org/10.1117/12.2681195
Event: International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2023), 2023, Changsha, China
Abstract
Previous works on 3D human pose estimation have concentrated on predicting the 3D pose of the human body from a single image, ignoring correlation between adjacent frames in video. We design a transformer network structure that can extract video temporal information, and enhance the accuracy of human pose prediction by encoding relative position with temporal fusion transformer structure to enhance local feature learning capability. On Human3.6M, we quantitatively and qualitatively analyze our method. Research suggests that our TSFormer achieves state-of-the-art performance.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Sha Peng and Jiwei Hu "3D human pose estimation in video with temporal and spatial transformer", Proc. SPIE 12707, International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2023), 127070E (8 June 2023); https://doi.org/10.1117/12.2681195
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Transformers

Video

Pose estimation

Back to Top