机器人3D感知系统，用于手术室环境意识

论文标题

机器人3D感知系统，用于手术室环境意识

A Robotic 3D Perception System for Operating Room Environment Awareness

论文作者

Li, Zhaoshuo, Shaban, Amirreza, Simard, Jean-Gabriel, Rabindran, Dinesh, DiMaio, Simon, Mohareri, Omid

论文摘要

目的：我们描述了DA Vinci手术系统的3D多视觉感知系统，以使手术室（或）场景理解和上下文意识。方法：我们提出的系统由僵硬地附着在Davinci XI患者侧购物车（PSC）上的战略位置的四个飞行时间（TOF）摄像机组成。通过执行一次性校准例程，将相机注册到机器人的运动链中，因此，所有相机的信息都可以融合并在一个共同的坐标框架中融合并表示。基于此体系结构，创建了多视图3D场景语义分割算法，以允许在DA Vinci或Da Vinci中识别常见和显着的对象/设备和外科活动。我们提出的3D语义分割方法已在临床场景中捕获的新型注释数据集上进行了培训和验证。结果：结果表明，我们提出的架构具有可接受的注册误差（$ 3.3 \％\ pm1.4 \％\％$ $的对象相机距离），并且与单个观看方法相比，对于较少出现的类别（$ \ ge ge 0.013 $），较少经常出现的类别的场景细分性能（平均交叉点）（MIOU的平均交叉点）。结论：我们介绍了具有新颖的分割体系结构的第一个动态多视觉感知系统，该系统可用作外科工作流程分析，外科手术子任务和高级指导系统等应用程序的基础技术。

Purpose: We describe a 3D multi-view perception system for the da Vinci surgical system to enable Operating room (OR) scene understanding and context awareness. Methods: Our proposed system is comprised of four Time-of-Flight (ToF) cameras rigidly attached to strategic locations on the daVinci Xi patient side cart (PSC). The cameras are registered to the robot's kinematic chain by performing a one-time calibration routine and therefore, information from all cameras can be fused and represented in one common coordinate frame. Based on this architecture, a multi-view 3D scene semantic segmentation algorithm is created to enable recognition of common and salient objects/equipment and surgical activities in a da Vinci OR. Our proposed 3D semantic segmentation method has been trained and validated on a novel densely annotated dataset that has been captured from clinical scenarios. Results: The results show that our proposed architecture has acceptable registration error ($3.3\%\pm1.4\%$ of object-camera distance) and can robustly improve scene segmentation performance (mean Intersection Over Union - mIOU) for less frequently appearing classes ($\ge 0.013$) compared to a single-view method. Conclusion: We present the first dynamic multi-view perception system with a novel segmentation architecture, which can be used as a building block technology for applications such as surgical workflow analysis, automation of surgical sub-tasks and advanced guidance systems.

下载PDF全文

下载文献需遵守相关版权规定

论文标题