论文标题
DDM-NET:关键点功能检测,描述和3D本地化匹配的端到端学习
DDM-NET: End-to-end learning of keypoint feature Detection, Description and Matching for 3D localization
论文作者
论文摘要
在本文中,我们提出了一个端到端框架,该框架共同学习基于图像的3D本地化任务的关键点检测,描述符表示和跨框架匹配。先前的艺术已经分别解决了这些组件中的每一个,据称是为了减轻有效培训整体网络的困难。我们为特征检测和匹配设计了一个自我监督的图像翘曲对应损失,相对摄像机姿势学习的弱监督的外两极约束损失以及一个定向匹配方案,该方案检测源图像中的关键点特征并在目标图像上执行粗到细的对应关系搜索。我们利用此框架在我们的匹配模块中执行周期一致性。此外,我们提出了新的损失,以稳健地处理确定的内部/离群匹配和不太确定的匹配。这些学习机制的集成使单个网络的端到端培训执行所有三个本地化组件。板凳标记我们在公共数据集上的方法,说明了这种端到端框架如何产生更准确的本地化,从而超过传统方法以及最先进的弱监督方法。
In this paper, we propose an end-to-end framework that jointly learns keypoint detection, descriptor representation and cross-frame matching for the task of image-based 3D localization. Prior art has tackled each of these components individually, purportedly aiming to alleviate difficulties in effectively train a holistic network. We design a self-supervised image warping correspondence loss for both feature detection and matching, a weakly-supervised epipolar constraints loss on relative camera pose learning, and a directional matching scheme that detects key-point features in a source image and performs coarse-to-fine correspondence search on the target image. We leverage this framework to enforce cycle consistency in our matching module. In addition, we propose a new loss to robustly handle both definite inlier/outlier matches and less-certain matches. The integration of these learning mechanisms enables end-to-end training of a single network performing all three localization components. Bench-marking our approach on public data-sets, exemplifies how such an end-to-end framework is able to yield more accurate localization that out-performs both traditional methods as well as state-of-the-art weakly supervised methods.