论文标题
从点到多对象3D重建
From Points to Multi-Object 3D Reconstruction
论文作者
论文摘要
我们提出了一种从单个RGB图像中检测和重建多个3D对象的方法。关键思想是在RGB图像中的所有对象上进行优化,以优化检测,对齐和形状,同时着重于现实且物理上合理的重建。为此,我们提出了一个将对象定位为中心点并直接预测所有对象属性的关键点检测器,包括9-DOF边界框和3D形状 - 全部在单个正向通道中。提出的方法将3D形状重建作为形状选择问题,即,它从给定数据库中选择了示例形状。这使得塑造表示形式的不可知论,从而使基于CAD模型的现实和视觉上的形状进行了轻巧的重建,而训练目标则围绕点云和体素表示。碰撞损失促进了非相互作用的对象,进一步增加了重建现实主义。鉴于RGB图像,呈现的方法在单个阶段执行轻量重建,它具有实时能力,完全可区分的,端到端可训练。我们的实验比较了用于9-DOF边界框估计的多种方法,评估新型形状选择机制,并根据3D边界盒估计和3D形状重建质量进行比较与最近的方法。
We propose a method to detect and reconstruct multiple 3D objects from a single RGB image. The key idea is to optimize for detection, alignment and shape jointly over all objects in the RGB image, while focusing on realistic and physically plausible reconstructions. To this end, we propose a keypoint detector that localizes objects as center points and directly predicts all object properties, including 9-DoF bounding boxes and 3D shapes -- all in a single forward pass. The proposed method formulates 3D shape reconstruction as a shape selection problem, i.e. it selects among exemplar shapes from a given database. This makes it agnostic to shape representations, which enables a lightweight reconstruction of realistic and visually-pleasing shapes based on CAD-models, while the training objective is formulated around point clouds and voxel representations. A collision-loss promotes non-intersecting objects, further increasing the reconstruction realism. Given the RGB image, the presented approach performs lightweight reconstruction in a single-stage, it is real-time capable, fully differentiable and end-to-end trainable. Our experiments compare multiple approaches for 9-DoF bounding box estimation, evaluate the novel shape-selection mechanism and compare to recent methods in terms of 3D bounding box estimation and 3D shape reconstruction quality.