通过自举辐射场反演的形状，姿势和外观

论文标题

通过自举辐射场反演的形状，姿势和外观

Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field Inversion

论文作者

Pavllo, Dario, Tan, David Joseph, Rakotosaona, Marie-Julie, Tombari, Federico

论文摘要

从单个角度，神经辐射场（NERF）和gan的结合在3D重建区域中代表了一个有希望的方向，这是因为它们有效地模拟了任意拓扑的能力。然而，该领域的最新工作主要集中在众所周知的确切地面真相姿势的合成数据集上，并且忽略了姿势估计，这对于某些下游应用（例如增强现实（AR）和机器人技术）很重要。我们为自然图像介绍了一个原则上的端到端重建框架，在该框架上没有准确的地面真相姿势。我们的方法从对象的单个图像中恢复了SDF参数化的3D形状，姿势和外观，而无需在训练过程中利用多个视图。更具体地说，我们利用了无条件的3D感知发电机，我们应用了混合反演方案，其中模型会产生解决方案的第一个猜测，然后通过优化对此进行完善。我们的框架可以在只有10个步骤中删除图像，从而在实际情况下使用它。我们在各种真实和合成的基准下展示了最先进的结果。

Neural Radiance Fields (NeRF) coupled with GANs represent a promising direction in the area of 3D reconstruction from a single view, owing to their ability to efficiently model arbitrary topologies. Recent work in this area, however, has mostly focused on synthetic datasets where exact ground-truth poses are known, and has overlooked pose estimation, which is important for certain downstream applications such as augmented reality (AR) and robotics. We introduce a principled end-to-end reconstruction framework for natural images, where accurate ground-truth poses are not available. Our approach recovers an SDF-parameterized 3D shape, pose, and appearance from a single image of an object, without exploiting multiple views during training. More specifically, we leverage an unconditional 3D-aware generator, to which we apply a hybrid inversion scheme where a model produces a first guess of the solution which is then refined via optimization. Our framework can de-render an image in as few as 10 steps, enabling its use in practical scenarios. We demonstrate state-of-the-art results on a variety of real and synthetic benchmarks.

下载PDF全文

下载文献需遵守相关版权规定

论文标题