Paper
16 October 2019 Depth information calculation method for unstructured objects based on deep neural network
Wei Hu, Jinjun Rao, Zhe Xu, Jinbo Chen, Tao Wang, Mei Liu, Jingtao Lei
Author Affiliations +
Proceedings Volume 11205, Seventh International Conference on Optical and Photonic Engineering (icOPEN 2019); 1120525 (2019) https://doi.org/10.1117/12.2542220
Event: Seventh International Conference on Optical and Photonic Engineering (icOPEN 2019), 2019, Phuket, Thailand
Abstract
Depth information perception of unstructured scene images is an important problem for applications using computer vision. This paper proposes a method based on deep learning combined with self-attention mechanism to reason the depth information of unstructured indoor targets, which effectively solves the problem of blurred image detail and insufficient layering in depth information reasoning in unstructured scenes. First, the deep learning-based encoder-decoder model is trained to learn the depth information of indoor scenes on large 3D datasets. The trained model has good results for general structured indoor scenes. Secondly, the soft self-attention mechanism is used to obtain the disparity information between the upper and lower sequences of the input image, by which the depth map obtained in the first step is corrected to enhance the accuracy of depth. Finally, in order to get clear objects with obvious boundaries in the depth response map, the nearest neighbor regression is used to correct the contour of the objects. The experimental results show that the proposed method has very good depth information reasoning ability for indoor unstructured scenes. Through depth information reasoning, the obtained objects have obvious texture structure, strong geometric features, clear contour edges and delicate layers, and also the misleading of deep information reasoning in reflective and highlight areas is eliminated.
© (2019) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Wei Hu, Jinjun Rao, Zhe Xu, Jinbo Chen, Tao Wang, Mei Liu, and Jingtao Lei "Depth information calculation method for unstructured objects based on deep neural network", Proc. SPIE 11205, Seventh International Conference on Optical and Photonic Engineering (icOPEN 2019), 1120525 (16 October 2019); https://doi.org/10.1117/12.2542220
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Neural networks

Convolutional neural networks

Image analysis

3D image processing

Image processing

Video

RGB color model

Back to Top