Scene disparity estimation with convolutional neural networks

Essa R. Anas; Li Guo; Ahmed Onsy; Bogdan J. Matuszewski

doi:10.1117/12.2527628

21 June 2019 Scene disparity estimation with convolutional neural networks

Essa R. Anas, Li Guo, Ahmed Onsy, Bogdan J. Matuszewski

Proceedings Volume 11059, Multimodal Sensing: Technologies and Applications; 110590T (2019) https://doi.org/10.1117/12.2527628
Event: SPIE Optical Metrology, 2019, Munich, Germany

Abstract

Estimation of stereovision disparity maps is important for many applications that require information about objects’ position and geometry. For example, as depth surrogate, disparity maps are essential for objects’ 3D shape reconstruction and indeed other applications that do require three-dimensional representation of a scene. Recently, deep learning (DL) methodology has enabled novel approaches for the disparity estimation with some focus on the real-time processing requirement that is critical for applications in robotics and autonomous navigation. Previously, that constraint was not always addressed. Furthermore, for robust disparity estimation the occlusion effects should be explicitly modelled. In the described method, the effective detection of occlusion regions is achieved through disparity estimation in both, forward and backward correspondence model with two matching deep subnetworks. These two subnetworks are trained jointly in a single training process. Initially the subnetworks are trained using simulated data with the know ground truth, then to improve generalisation properties the whole model is fine-tuned in an unsupervised fashion on real data. During the unsupervised training, the model is equipped with bilinear interpolation warping function to directly measure quality of the correspondence with the disparity maps estimated for both the left and right image. During this phase forward-backward consistency constraint loss function is also applied to regularise the disparity estimators for non-occluding pixels. The described network model computes, at the same time, the forward and backward disparity maps as well as corresponding occlusion masks. It showed improved results on simulated and real images with occluded objects, when compared with the results obtained without using the forward-backward consistency constraint loss function.

Conference Presentation

Citation Download Citation

Essa R. Anas, Li Guo, Ahmed Onsy, and Bogdan J. Matuszewski "Scene disparity estimation with convolutional neural networks", Proc. SPIE 11059, Multimodal Sensing: Technologies and Applications, 110590T (21 June 2019); https://doi.org/10.1117/12.2527628

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available