Lightweight improved gesture recognition based on YOLOv5

Cheng Xu

doi:10.1117/12.3028971

22 May 2024 Lightweight improved gesture recognition based on YOLOv5

Cheng Xu

Proceedings Volume 13176, Fourth International Conference on Machine Learning and Computer Application (ICMLCA 2023); 131762B (2024) https://doi.org/10.1117/12.3028971
Event: Fourth International Conference on Machine Learning and Computer Application (ICMLCA 2023), 2023, Hangzhou, China

Abstract

With the rapid progress of human technology level and artificial intelligence field, gesture recognition technology based on deep learning plays an important role in the development of human-computer interaction. Currently, the accuracy of most target detection networks in recognising gesture actions has reached a relatively desirable level, while the training speed and results of many deep learning models are often easily limited by the computational power of these hardware due to the fact that they are used on some low-computing-power platforms, such as low-end CPUs and GPUs; Secondly, the size and complexity of the model will also have a non-negligible impact on the subsequent deployment phase of the application. Aiming at the series of problems raised above, this paper adopts a gesture recognition detection algorithm based on the improved YOLOv5, replacing part of the convolution module with the Ghost module to reduce the parameters, which in turn transforms C3 into C3Ghost, and Conv adopts a parallel structure and transforms it into GhostConv, which further reduces the amount of computation during the training process and accelerates the inference speed of the model, so as to realise that the model improves the training efficiency of the network under the condition of limited hardware arithmetic power. At the same time, in order to solve the problem of accuracy degradation that may be accompanied by the completion of lightweighting, the CBAM attention mechanism is also added to strengthen the network's ability to extract target features to improve detection accuracy; Then the α-IoU loss function is used instead of the CIoU loss function to make the model converge more quickly during training. Ultimately, experimental comparisons show that compared with the original algorithm, the improved YOLOv5s PRO has 46.2% fewer Parameters, 42.6% smaller Model Size, and 48.1% fewer GFLOPs, which can effectively balance the speed and accuracy of training.

(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.

Citation Download Citation

Cheng Xu "Lightweight improved gesture recognition based on YOLOv5", Proc. SPIE 13176, Fourth International Conference on Machine Learning and Computer Application (ICMLCA 2023), 131762B (22 May 2024); https://doi.org/10.1117/12.3028971

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $17.00

Non-members: $21.00 ADD TO CART

PROCEEDINGS
6 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Gesture recognition

RGB color model

Education and training

Detection and tracking algorithms

Convolution

Target detection

Visual process modeling

Show All Keywords

Keywords/Phrases

Search In:

Publication Years