Efficiently Fusing Sparse LiDAR for Enhanced Self-Supervised Monocular Depth Estimation

Abstract

Monocular self-supervised depth estimation with a low-cost sensor is the mainstream solution to gathering dense depth maps for robots and autonomous driving. In this paper, based on the philosophy “less is more” (i.e., focusing only on valid pixels in sparse LiDAR), we propose a novel framework, Efficient Sparse Depth (EffisDepth), for predicting dense depth. The Sparse Feature Extractor (SFE) embedded in the proposed framework effectively handles sparse LiDAR by forming sparse tensors. The Slender Group Block (SGB) is the main building block in SFE, which extracts features from sparse tensors via a structure of two branches. Extensive experiments show that our method achieves state-of-the-art performance on the KITTI benchmark, demonstrating the effectiveness of each proposed component and the self-supervised learning framework

Publication
ICASSP22
Create your slides in Markdown - click the Slides button to check out the example.

Supplementary notes can be added here, including code, math, and images.

Mingrong Gong
Mingrong Gong
Graduate Student in SIAT, University of Chinese Academy of Sciences.

My research interest mainly includes trustworthy AI, machine learning, especially domain generalization, and reinforcement learning.