论文标题
迈向数据效率学习:COVID-19 CT肺和感染分段的基准
Towards Data-Efficient Learning: A Benchmark for COVID-19 CT Lung and Infection Segmentation
论文作者
论文摘要
目的:COVID-19 CT扫描中肺和感染的准确分割在患者的定量管理中起着重要作用。大多数现有研究都是基于从单个机构那里获得的大型和私人注释的数据集,尤其是当放射科医生忙于与冠状病毒疾病作斗争时。此外,很难比较当前的COVID-19 CT分割方法,因为它们是在不同数据集上开发,在不同设置中训练并通过不同指标进行评估的。方法:为了促进数据有效的深度学习方法的开发,在本文中,我们基于70个注释的COVID-19案例,为肺和感染细分建立了三个基准,其中包含当前的活跃研究领域,例如,很少射击学习,领域的概括和知识转移。为了在不同的分割方法之间进行公平的比较,我们还提供标准培训,验证和测试拆分,评估指标以及相应的代码。结果:基于最先进的网络,我们提供了40多种预训练的基线模型,不仅可以充当开箱即用的细分工具,而且还为对Covid-19肺和感染细分感兴趣的研究人员节省了计算时间。我们达到97.3 \%,97.7 \%和67.3 \%的平均骰子相似性系数(DSC)评分,分别为90.6 \%,91.4 \%\%和70.0 \%的平均归一化表面骰子(NSD)得分分别为70.0 \%\%。结论:据我们所知,这项工作为医疗图像细分和迄今为止训练的预培训最多的模型提供了第一个数据效率的学习基准。所有这些资源都是公开可用的,我们的工作奠定了促进深度学习方法的开发,以使用有限的数据进行有效的COVID-19 CT细分。
Purpose: Accurate segmentation of lung and infection in COVID-19 CT scans plays an important role in the quantitative management of patients. Most of the existing studies are based on large and private annotated datasets that are impractical to obtain from a single institution, especially when radiologists are busy fighting the coronavirus disease. Furthermore, it is hard to compare current COVID-19 CT segmentation methods as they are developed on different datasets, trained in different settings, and evaluated with different metrics. Methods: To promote the development of data-efficient deep learning methods, in this paper, we built three benchmarks for lung and infection segmentation based on 70 annotated COVID-19 cases, which contain current active research areas, e.g., few-shot learning, domain generalization, and knowledge transfer. For a fair comparison among different segmentation methods, we also provide standard training, validation and testing splits, evaluation metrics and, the corresponding code. Results: Based on the state-of-the-art network, we provide more than 40 pre-trained baseline models, which not only serve as out-of-the-box segmentation tools but also save computational time for researchers who are interested in COVID-19 lung and infection segmentation. We achieve average Dice Similarity Coefficient (DSC) scores of 97.3\%, 97.7\%, and 67.3\% and average Normalized Surface Dice (NSD) scores of 90.6\%, 91.4\%, and 70.0\% for left lung, right lung, and infection, respectively. Conclusions: To the best of our knowledge, this work presents the first data-efficient learning benchmark for medical image segmentation and the largest number of pre-trained models up to now. All these resources are publicly available, and our work lays the foundation for promoting the development of deep learning methods for efficient COVID-19 CT segmentation with limited data.