通过一致的结构估计，多相机协作深度预测

论文标题

通过一致的结构估计，多相机协作深度预测

Multi-Camera Collaborative Depth Prediction via Consistent Structure Estimation

论文作者

Xu, Jialei, Liu, Xianming, Bai, Yuanchao, Jiang, Junjun, Wang, Kaixuan, Chen, Xiaozhi, Ji, Xiangyang

论文摘要

来自图像的深度图估计是机器人系统中的重要任务。现有方法可以分为两组，包括多视图立体声和单眼深度估计。前者要求摄像机在相机之间具有较大的重叠区域和足够的基线，而后者则独立处理每个图像的后者几乎无法保证摄像机之间的结构一致性。在本文中，我们提出了一种新型的多相机协作深度预测方法，该方法不需要大的重叠领域，同时保持相机之间的结构一致性。具体而言，我们将深度估计作为深度基础的加权组合，其中权重通过由提议的一致性损失驱动的改进网络迭代更新。在迭代更新期间，比较了跨摄像头的深度估计结果，并且在基本配方的帮助下将重叠区域的信息传播到整个深度图。 DDAD和NUSCENES数据集的实验结果证明了我们方法的出色性能。

Depth map estimation from images is an important task in robotic systems. Existing methods can be categorized into two groups including multi-view stereo and monocular depth estimation. The former requires cameras to have large overlapping areas and sufficient baseline between cameras, while the latter that processes each image independently can hardly guarantee the structure consistency between cameras. In this paper, we propose a novel multi-camera collaborative depth prediction method that does not require large overlapping areas while maintaining structure consistency between cameras. Specifically, we formulate the depth estimation as a weighted combination of depth basis, in which the weights are updated iteratively by a refinement network driven by the proposed consistency loss. During the iterative update, the results of depth estimation are compared across cameras and the information of overlapping areas is propagated to the whole depth maps with the help of basis formulation. Experimental results on DDAD and NuScenes datasets demonstrate the superior performance of our method.

下载PDF全文

下载文献需遵守相关版权规定

论文标题