3D human pose estimation in video with temporal and spatial transformer

Sha Peng; Jiwei Hu

doi:10.1117/12.2681195

8 June 2023 3D human pose estimation in video with temporal and spatial transformer

Sha Peng, Jiwei Hu

Proceedings Volume 12707, International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2023); 127070E (2023) https://doi.org/10.1117/12.2681195
Event: International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2023), 2023, Changsha, China

Abstract

Previous works on 3D human pose estimation have concentrated on predicting the 3D pose of the human body from a single image, ignoring correlation between adjacent frames in video. We design a transformer network structure that can extract video temporal information, and enhance the accuracy of human pose prediction by encoding relative position with temporal fusion transformer structure to enhance local feature learning capability. On Human3.6M, we quantitatively and qualitatively analyze our method. Research suggests that our TSFormer achieves state-of-the-art performance.

Citation Download Citation

Sha Peng and Jiwei Hu "3D human pose estimation in video with temporal and spatial transformer", Proc. SPIE 12707, International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2023), 127070E (8 June 2023); https://doi.org/10.1117/12.2681195

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $17.00

Non-members: $21.00 ADD TO CART

PROCEEDINGS
6 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Transformers

Video

Pose estimation

RELATED CONTENT

Joint misalignment aware bilateral detection network for human pose estimation...
Proceedings of SPIE (May 13 2024)

Fall detection of personnel working in deep pits of power...
Proceedings of SPIE (June 05 2024)

Adaptive spatial and temporal aggregation for table tennis shot recognition
Proceedings of SPIE (June 07 2023)

Visual tracking with confidence correction based on log-polar mapping
Proceedings of SPIE (June 08 2023)

Analysis of cell behavior in videos of fluorescence imagery using...
Proceedings of SPIE (January 01 1900)

A review of pedestrian pose recognition in cross passages in...
Proceedings of SPIE (November 08 2023)

Self supervised monocular depth and ego motion estimation for CT...
Proceedings of SPIE (March 29 2024)

Subscribe to Digital Library

Receive Erratum Email Alert

Keywords/Phrases

Search In:

Publication Years