论文标题

AI可以不用标签而发展:通过知识蒸馏,用于胸部X射线诊断的自我发展的视觉变压器

AI can evolve without labels: self-evolving vision transformer for chest X-ray diagnosis through knowledge distillation

论文作者

Park, Sangjoon, Kim, Gwanghyun, Oh, Yujin, Seo, Joon Beom, Lee, Sang Min, Kim, Jin Hwan, Moon, Sungjun, Lim, Jae-Kwang, Park, Chang Min, Ye, Jong Chul

论文摘要

尽管基于深度学习的计算机辅助诊断系统最近已经实现了专家级的性能,但是开发出强大的深度学习模型需要具有手动注释的大型高质量数据,这很昂贵。这种情况提出了这样一个问题,即由于专家缺乏手动标记,尤其是在被剥夺的地区,无法使用每年在医院中收集的胸部X射线。为了解决这个问题,我们在这里提出了一个新颖的深度学习框架,该框架通过自我监督的学习和自我培训使用知识蒸馏,这表明,通过少数标签训练的原始模型的性能可以通过更无标记的数据逐渐改善。实验结果表明,该提出的框架对现实环境保持了令人印象深刻的鲁棒性,并且对多种诊断任务(例如结核病,气胸和Covid-19)具有一般适用性。值得注意的是,我们证明了我们的模型的性能比接受相同数量标记数据的训练的模型还要好。拟议的框架具有医学成像的巨大潜力,每年都会积累大量数据,但是地面真相注释很昂贵。

Although deep learning-based computer-aided diagnosis systems have recently achieved expert-level performance, developing a robust deep learning model requires large, high-quality data with manual annotation, which is expensive to obtain. This situation poses the problem that the chest x-rays collected annually in hospitals cannot be used due to the lack of manual labeling by experts, especially in deprived areas. To address this, here we present a novel deep learning framework that uses knowledge distillation through self-supervised learning and self-training, which shows that the performance of the original model trained with a small number of labels can be gradually improved with more unlabeled data. Experimental results show that the proposed framework maintains impressive robustness against a real-world environment and has general applicability to several diagnostic tasks such as tuberculosis, pneumothorax, and COVID-19. Notably, we demonstrated that our model performs even better than those trained with the same amount of labeled data. The proposed framework has a great potential for medical imaging, where plenty of data is accumulated every year, but ground truth annotations are expensive to obtain.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源