28 November 2024 LMIINet: long-range and multi-scale information interaction network for 3D object detection
Chengfeng Mai, Haosen Wang, Cui Wang, Bo Zhang, Sarath Kodagoda, Shifeng Wang
Author Affiliations +
Abstract

LiDAR-based 3D object detection is crucial for perception systems in autonomous driving. However, current methods perform poorly in detecting occluded and small objects due to the sparse and uneven distribution of point clouds in 3D scenes. To address this problem, we propose a long-range and multi-scale information interaction network (LMIINet) for 3D object detection. First, a feature extraction backbone with a spatial feature pyramid block is designed to effectively capture long-range dependencies between features. Then, multi-scale spatial features from the 3D backbone are adaptively fused at the neck to aggregate local and global contextual information. Finally, by fully utilizing the correlations between bounding box parameters, the proposed rotation-decoupled corrected intersection-over-union (RCIoU) loss is employed for the classification and regression supervision of bounding boxes to improve the one-stage point cloud detectors. LMIINet achieves competitive performance on small objects (pedestrians and cyclists) detection in the KITTI dataset. Compared with the benchmark network SECOND, LMIINet increases mAP3D/mAPBEV by 1.98%/2.09%, 12.25%/11.14%, and 6.44%/6.45% for the car, pedestrian, and cyclist classes, respectively.

© 2024 SPIE and IS&T
Chengfeng Mai, Haosen Wang, Cui Wang, Bo Zhang, Sarath Kodagoda, and Shifeng Wang "LMIINet: long-range and multi-scale information interaction network for 3D object detection," Journal of Electronic Imaging 33(6), 063034 (28 November 2024). https://doi.org/10.1117/1.JEI.33.6.063034
Received: 26 June 2024; Accepted: 6 November 2024; Published: 28 November 2024
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
Back to Top