11 October 2023 Efficient stereo matching using attention mechanism and edge optimization
Daliang Zhao, Kejian Liu, Zhen Zhang, Yinliang Song, Tao Peng, Yichun Tai, Zhijiang Zhang
Author Affiliations +
Abstract

In practical applications, to improve the real-time performance of end-to-end stereo matching networks, the existing methods build cost volume at low resolution. However, with detailed information missing in low-resolution features, it is difficult to get accurate disparity estimation results in weak texture regions. Besides, smooth L1 loss supervision also results in a loss of accuracy in disparity discontinuity areas. To solve these problems, we propose an efficient stereo-matching network based on multiple attention mechanisms and edge optimization, which can achieve high accuracy in a short time. The multi-scale attention module is applied to enhance the feature expression in detail regions. For weak texture areas, we construct a concatenation cost volume and a multi-level patch matching volume, which can be combined to improve the network’s attention to weak texture regions. In terms of edge optimization, we perform bimodal Laplace modeling of the sampled edge points’ disparity distribution and optimize the edge region of the initial disparity map using likelihood loss to obtain sharp edges. The experimental results show that, on the SceneFlow and KITTI datasets, the proposed network improves by 32% and 27% in accuracy compared with BGNet+.

© 2023 SPIE and IS&T
Daliang Zhao, Kejian Liu, Zhen Zhang, Yinliang Song, Tao Peng, Yichun Tai, and Zhijiang Zhang "Efficient stereo matching using attention mechanism and edge optimization," Journal of Electronic Imaging 32(5), 053030 (11 October 2023). https://doi.org/10.1117/1.JEI.32.5.053030
Received: 21 February 2023; Accepted: 10 August 2023; Published: 11 October 2023
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Education and training

Feature extraction

Convolution

Point clouds

Discontinuities

Visualization

Semantics

Back to Top