Paper
23 August 2024 YOLOv8 garbage occlusion detection algorithm combining dynamic convolution and attention mechanism
Junshen Zhang, Li Kang, Xuan Xie
Author Affiliations +
Proceedings Volume 13250, Fourth International Conference on Image Processing and Intelligent Control (IPIC 2024); 132502W (2024) https://doi.org/10.1117/12.3038524
Event: 4th International Conference on Image Processing and Intelligent Control (IPIC 2024), 2024, Kuala Lumpur, Malaysia
Abstract
Due to its inadequate capacity to extract visual features, the YOLOv8 algorithm exhibits low accuracy and is not robust enough to handle the issue of several types of garbage obscuring one another. In order to improve feature extraction performance, this article modifies the YOLOv8 network structure. Initially, the traditional convolution in the backbone network is replaced by full-dimensional dynamic convolution. Second, the C2f module in the neck network is replaced with the deformable convolution structure C2F-DCNV3 module. In order to more precisely identify the trash objects that obscure one another, the CBAM attention mechanism module is finally implemented. The experimental results show that the improved algorithm has higher precision and better robustness in dealing with different occlusion types. By introducing the full-dimensional dynamic convolutional module, deformable convolutional module and CBAM attention module, the accuracy is improved by 1.6 percentage points, 2.1 percentage points and 4.3 percentage points respectively, and the feature extraction ability of the model is improved.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Junshen Zhang, Li Kang, and Xuan Xie "YOLOv8 garbage occlusion detection algorithm combining dynamic convolution and attention mechanism", Proc. SPIE 13250, Fourth International Conference on Image Processing and Intelligent Control (IPIC 2024), 132502W (23 August 2024); https://doi.org/10.1117/12.3038524
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Convolution

Feature extraction

Data modeling

Detection and tracking algorithms

Object detection

Deformation

Machine learning

Back to Top