LMIINet: long-range and multi-scale information interaction network for 3D object detection

Chengfeng Mai; Haosen Wang; Cui Wang; Bo Zhang; Sarath Kodagoda; Shifeng Wang

doi:10.1117/1.JEI.33.6.063034

28 November 2024 LMIINet: long-range and multi-scale information interaction network for 3D object detection

Chengfeng Mai, Haosen Wang, Cui Wang, Bo Zhang, Sarath Kodagoda, Shifeng Wang

Author Affiliations +

Journal of Electronic Imaging, Vol. 33, Issue 6, 063034 (November 2024). https://doi.org/10.1117/1.JEI.33.6.063034

Abstract

LiDAR-based 3D object detection is crucial for perception systems in autonomous driving. However, current methods perform poorly in detecting occluded and small objects due to the sparse and uneven distribution of point clouds in 3D scenes. To address this problem, we propose a long-range and multi-scale information interaction network (LMIINet) for 3D object detection. First, a feature extraction backbone with a spatial feature pyramid block is designed to effectively capture long-range dependencies between features. Then, multi-scale spatial features from the 3D backbone are adaptively fused at the neck to aggregate local and global contextual information. Finally, by fully utilizing the correlations between bounding box parameters, the proposed rotation-decoupled corrected intersection-over-union (RCIoU) loss is employed for the classification and regression supervision of bounding boxes to improve the one-stage point cloud detectors. LMIINet achieves competitive performance on small objects (pedestrians and cyclists) detection in the KITTI dataset. Compared with the benchmark network SECOND, LMIINet increases mAP3D/mAPBEV by 1.98%/2.09%, 12.25%/11.14%, and 6.44%/6.45% for the car, pedestrian, and cyclist classes, respectively.

Citation Download Citation

Chengfeng Mai, Haosen Wang, Cui Wang, Bo Zhang, Sarath Kodagoda, and Shifeng Wang "LMIINet: long-range and multi-scale information interaction network for 3D object detection," Journal of Electronic Imaging 33(6), 063034 (28 November 2024). https://doi.org/10.1117/1.JEI.33.6.063034

Received: 26 June 2024; Accepted: 6 November 2024; Published: 28 November 2024

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available