论文标题
强大的医疗工具分割挑战2019
Robust Medical Instrument Segmentation Challenge 2019
论文作者
论文摘要
腹腔镜仪器的术中跟踪通常是计算机和机器人辅助干预措施的先决条件。尽管在文献中提出了许多基于内窥镜视频图像的医疗仪器的检测,细分和跟踪的方法,但文献中仍提出了关键限制:首先,鲁棒性,即,在具有挑战性图像的情况下,最先进的方法的可靠性能(例如,在血液,烟雾,烟雾或运动型中)。其次,概括;接受特定医院的特定干预培训的算法应推广到其他干预措施或机构。 为了促进解决这些限制的解决方案,我们组织了强大的医疗工具细分(鲁棒)挑战,作为国际基准测试竞赛,特别关注算法的鲁棒性和泛化能力。在内窥镜图像处理领域,我们的挑战首次包括二进制分割的任务,还解决了多实体检测和分割。挑战是基于一个手术数据集,其中包括从三种不同类型的手术的总共30次手术程序中获得的10,040个注释图像。在三个不同的阶段进行了三个任务的竞争方法(二进制分割,多实体检测和多实体分段)的验证,并且在培训数据和测试数据之间存在域间隙的增加。结果证实了最初的假设,即算法性能会随着域间隙的增加而降低。虽然表现最佳算法的平均检测和分割质量很高,但未来的研究应集中于对小型,交叉,移动和透明仪器(S)(零件)的检测和分割。
Intraoperative tracking of laparoscopic instruments is often a prerequisite for computer and robotic-assisted interventions. While numerous methods for detecting, segmenting and tracking of medical instruments based on endoscopic video images have been proposed in the literature, key limitations remain to be addressed: Firstly, robustness, that is, the reliable performance of state-of-the-art methods when run on challenging images (e.g. in the presence of blood, smoke or motion artifacts). Secondly, generalization; algorithms trained for a specific intervention in a specific hospital should generalize to other interventions or institutions. In an effort to promote solutions for these limitations, we organized the Robust Medical Instrument Segmentation (ROBUST-MIS) challenge as an international benchmarking competition with a specific focus on the robustness and generalization capabilities of algorithms. For the first time in the field of endoscopic image processing, our challenge included a task on binary segmentation and also addressed multi-instance detection and segmentation. The challenge was based on a surgical data set comprising 10,040 annotated images acquired from a total of 30 surgical procedures from three different types of surgery. The validation of the competing methods for the three tasks (binary segmentation, multi-instance detection and multi-instance segmentation) was performed in three different stages with an increasing domain gap between the training and the test data. The results confirm the initial hypothesis, namely that algorithm performance degrades with an increasing domain gap. While the average detection and segmentation quality of the best-performing algorithms is high, future research should concentrate on detection and segmentation of small, crossing, moving and transparent instrument(s) (parts).