共同发生的疾病严重影响了弱监督的学习模型的胸部CT分类的表现

论文标题

共同发生的疾病严重影响了弱监督的学习模型的胸部CT分类的表现

Co-occurring Diseases Heavily Influence the Performance of Weakly Supervised Learning Models for Classification of Chest CT

论文作者

Tushar, Fakrul Islam, D'Anniballe, Vincent M., Rubin, Geoffrey D., Samei, Ehsan, Lo, Joseph Y.

论文摘要

尽管有弱监督学习自动注释大量数据的潜力，但对其在计算机辅助诊断（CAD）中使用的局限性知之甚少。对于CT而言，鉴于大量共同发生的疾病，解释CAD算法的性能可能具有挑战性。本文通过弱监督的学习训练分类模型，特别是通过使用相同的培训数据比较多标签和多个二元分类器时，研究了同时发生疾病的效果。我们的结果表明，二进制模型在AUC方面优于每个疾病类别的多标签分类。但是，这种表现受二进制模型中同时发生疾病的严重影响，这表明它并不总是学会特定疾病的正确外观。例如，当没有其他同时发生疾病时，肺结节的二元分类导致AUC <0.65，但是当肺结节与肺气肿共发生时，性能达到了AUC> 0.80。我们希望本文能够揭示在弱监督模型中解释疾病分类表现的复杂性，并鼓励研究人员研究未来共同发生疾病对分类绩效的影响。

Despite the potential of weakly supervised learning to automatically annotate massive amounts of data, little is known about its limitations for use in computer-aided diagnosis (CAD). For CT specifically, interpreting the performance of CAD algorithms can be challenging given the large number of co-occurring diseases. This paper examines the effect of co-occurring diseases when training classification models by weakly supervised learning, specifically by comparing multi-label and multiple binary classifiers using the same training data. Our results demonstrated that the binary model outperformed the multi-label classification in every disease category in terms of AUC. However, this performance was heavily influenced by co-occurring diseases in the binary model, suggesting it did not always learn the correct appearance of the specific disease. For example, binary classification of lung nodules resulted in an AUC of < 0.65 when there were no other co-occurring diseases, but when lung nodules co-occurred with emphysema, the performance reached AUC > 0.80. We hope this paper revealed the complexity of interpreting disease classification performance in weakly supervised models and will encourage researchers to examine the effect of co-occurring diseases on classification performance in the future.

下载PDF全文

下载文献需遵守相关版权规定

论文标题