论文标题

通过诊断校准校准观察者间分段不确定性。

Calibrate the inter-observer segmentation uncertainty via diagnosis-first principle

论文作者

Wu, Junde, Fang, Huihui, Xiong, Hoayi, Duan, Lixin, Tan, Mingkui, Yang, Weihua, Liu, Huiying, Xu, Yanwu

论文摘要

在医学图像上,许多组织/病变可能模棱两可。这就是为什么一群临床专家会注释医疗细分以减轻个人偏见的原因。但是,这种临床常规也为机器学习算法的应用带来了新的挑战。没有确定的基础真相,很难训练和评估深度学习模型。当从不同的级别收集注释时,一个共同的选择是多数票。然而,这种策略忽略了分级专家之间的差异。在本文中,我们考虑使用校准的观察者间的不确定性来预测分割的任务。我们注意到,在临床实践中,医疗图像分割通常用于帮助疾病诊断。受这一观察的启发,我们提出了诊断优先原则,该原则是将疾病诊断作为校准观察者间分段不确定性的标准。在这个想法之后,提出了一个名为诊断的诊断框架(DIFF),以估算从原始图像中进行诊断优先于诊断。特别是,DIFF将首先学会融合多邀请者分段标签,以使疾病诊断性能最大化单一的基础真相。我们将融合的地面真相称为诊断第一基地真实(DF-GT)。我们验证了DIFF对三个不同的医学分割任务的有效性:对眼底图像的OD/OC分割,超声图像上的甲状腺结节分割以及皮肤镜图像上的皮肤病变分割。实验结果表明,拟议的差异能够显着促进相应的疾病诊断,这表现优于先前的最先进的多评价者学习方法。

On the medical images, many of the tissues/lesions may be ambiguous. That is why the medical segmentation is typically annotated by a group of clinical experts to mitigate the personal bias. However, this clinical routine also brings new challenges to the application of machine learning algorithms. Without a definite ground-truth, it will be difficult to train and evaluate the deep learning models. When the annotations are collected from different graders, a common choice is majority vote. However such a strategy ignores the difference between the grader expertness. In this paper, we consider the task of predicting the segmentation with the calibrated inter-observer uncertainty. We note that in clinical practice, the medical image segmentation is usually used to assist the disease diagnosis. Inspired by this observation, we propose diagnosis-first principle, which is to take disease diagnosis as the criterion to calibrate the inter-observer segmentation uncertainty. Following this idea, a framework named Diagnosis First segmentation Framework (DiFF) is proposed to estimate diagnosis-first segmentation from the raw images.Specifically, DiFF will first learn to fuse the multi-rater segmentation labels to a single ground-truth which could maximize the disease diagnosis performance. We dubbed the fused ground-truth as Diagnosis First Ground-truth (DF-GT).Then, we further propose Take and Give Modelto segment DF-GT from the raw image. We verify the effectiveness of DiFF on three different medical segmentation tasks: OD/OC segmentation on fundus images, thyroid nodule segmentation on ultrasound images, and skin lesion segmentation on dermoscopic images. Experimental results show that the proposed DiFF is able to significantly facilitate the corresponding disease diagnosis, which outperforms previous state-of-the-art multi-rater learning methods.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源