论文标题
BS-NET:在大胸部X射线数据集上学习COVID-19 COVID-19
BS-Net: learning COVID-19 pneumonia severity on a large Chest X-Ray dataset
论文作者
论文摘要
在这项工作中,我们设计了一种端到端的深度学习体系结构,用于预测胸部X射线图像(CXR),一种多区域评分,可传达COVID-19患者的肺部妥协程度。这种半定量评分系统,即Brixia〜评分,用于对此类患者的连续监测,显示出明显的预后价值,这是一家经历了意大利大流行峰值之一的医院。为了解决这样一个充满挑战的视觉任务,我们采用了一种弱势监督的学习策略,该策略构建了,以处理不同的任务(分段,空间对齐和得分估计),该任务培训了涉及不同数据集的“从始终到整体”的过程。特别是,我们利用了在同一医院收集的近5,000个CXR注释图像的临床数据集。我们的BS-NET在所有处理阶段都表现出自我煽动行为和高度准确性。通过评估者一致性测试和黄金标准比较,我们表明我们的解决方案在评级准确性和一致性方面优于单个人体注释,从而支持在计算机辅助监视的情况下使用此工具的可能性。还通过原始技术生成了高度分辨(超级像素级)的解释性图,以视觉上帮助理解肺部地区的网络活动。我们还考虑了文献中提出的其他分数,并与最近提出的非特异性方法进行了比较。我们最终在各种公共COVID-19数据集上测试了模型的性能鲁棒性,为此我们还提供了Brixia〜分数注释,观察良好的直接概括和微调功能,以突出其他临床环境中BS-NET的便携性。 CXR数据集以及源代码和训练有素的模型是出于研究目的公开发布的。
In this work we design an end-to-end deep learning architecture for predicting, on Chest X-rays images (CXR), a multi-regional score conveying the degree of lung compromise in COVID-19 patients. Such semi-quantitative scoring system, namely Brixia~score, is applied in serial monitoring of such patients, showing significant prognostic value, in one of the hospitals that experienced one of the highest pandemic peaks in Italy. To solve such a challenging visual task, we adopt a weakly supervised learning strategy structured to handle different tasks (segmentation, spatial alignment, and score estimation) trained with a "from-the-part-to-the-whole" procedure involving different datasets. In particular, we exploit a clinical dataset of almost 5,000 CXR annotated images collected in the same hospital. Our BS-Net demonstrates self-attentive behavior and a high degree of accuracy in all processing stages. Through inter-rater agreement tests and a gold standard comparison, we show that our solution outperforms single human annotators in rating accuracy and consistency, thus supporting the possibility of using this tool in contexts of computer-assisted monitoring. Highly resolved (super-pixel level) explainability maps are also generated, with an original technique, to visually help the understanding of the network activity on the lung areas. We also consider other scores proposed in literature and provide a comparison with a recently proposed non-specific approach. We eventually test the performance robustness of our model on an assorted public COVID-19 dataset, for which we also provide Brixia~score annotations, observing good direct generalization and fine-tuning capabilities that highlight the portability of BS-Net in other clinical settings. The CXR dataset along with the source code and the trained model are publicly released for research purposes.