论文标题
一杆3D摄影
One Shot 3D Photography
论文作者
论文摘要
3D摄影是一种新媒介,可让观众更充分地体验被捕获的时刻。在这项工作中,我们将3D照片称为显示通过移动视点引起的视差(与具有固定视点的立体对)引起的视差。 3D照片像传统照片一样是静态的,但在移动或桌面屏幕上以及在虚拟现实设备上展示了交互式视差,其中查看它还包括立体声。我们提出了一个用于创建和查看3D照片的端到端系统,其中的算法和设计选择。我们的3D照片单拍捕获,并直接在移动设备上处理。该方法首先使用针对移动设备进行优化的新单眼深度估计网络估算2D输入图像的深度。它具有竞争性的最新性能,但具有较低的延迟和峰值存储器消耗,并且参数较少的数量级。将所得深度提升为分层的深度图像,并在视差区域合成新的几何形状。我们还使用直接在LDI上的移动设备优化了视差区域中的颜色纹理和结构。最后,我们将结果转换为基于网格的表示,即使在低端设备和网络连接较差的情况下,也可以有效地传输和渲染。总的来说,处理仅需几秒钟的移动设备,因此可以立即查看和共享结果。我们执行广泛的定量评估,以验证我们的系统并将其新组件与当前最新组件进行比较。
3D photography is a new medium that allows viewers to more fully experience a captured moment. In this work, we refer to a 3D photo as one that displays parallax induced by moving the viewpoint (as opposed to a stereo pair with a fixed viewpoint). 3D photos are static in time, like traditional photos, but are displayed with interactive parallax on mobile or desktop screens, as well as on Virtual Reality devices, where viewing it also includes stereo. We present an end-to-end system for creating and viewing 3D photos, and the algorithmic and design choices therein. Our 3D photos are captured in a single shot and processed directly on a mobile device. The method starts by estimating depth from the 2D input image using a new monocular depth estimation network that is optimized for mobile devices. It performs competitively to the state-of-the-art, but has lower latency and peak memory consumption and uses an order of magnitude fewer parameters. The resulting depth is lifted to a layered depth image, and new geometry is synthesized in parallax regions. We synthesize color texture and structures in the parallax regions as well, using an inpainting network, also optimized for mobile devices, on the LDI directly. Finally, we convert the result into a mesh-based representation that can be efficiently transmitted and rendered even on low-end devices and over poor network connections. Altogether, the processing takes just a few seconds on a mobile device, and the result can be instantly viewed and shared. We perform extensive quantitative evaluation to validate our system and compare its new components against the current state-of-the-art.