论文标题
将基于图像的乳腺癌风险模型中的固有风险和早期癌症征兆解耦
Decoupling Inherent Risk and Early Cancer Signs in Image-based Breast Cancer Risk Models
论文作者
论文摘要
准确估计患乳腺癌风险的能力对于临床决策将是无价的。一种有希望的新方法是将基于图像的风险模型集成基于深层神经网络。但是,在使用此类模型时必须小心,因为选择培训数据会影响网络将学会识别的模式。考虑到这一点,我们使用三个不同的标准训练了网络,以选择阳性训练数据(即患者会发展癌症的图像):一种固有的风险模型,该模型在没有可见的癌症迹象的图像上训练,癌症迹象训练了培训的癌症或早期症状症状的癌症模型,以及对癌症诊断患者的所有图像进行了培训的模型。我们发现,这三个模型学习着独特的特征,这些特征的重点是不同的模式,这将表现形成鲜明对比。短期风险最好通过癌症标志模型估算,而长期风险是固有的风险模型估算的。不小心的所有图像训练将固有的风险与早期的癌症征兆混为一谈,并且在这两个制度中都产生了次优估计。结果,当已经看到早期癌症症状时,混合模型可能会导致医生建议预防性行动。
The ability to accurately estimate risk of developing breast cancer would be invaluable for clinical decision-making. One promising new approach is to integrate image-based risk models based on deep neural networks. However, one must take care when using such models, as selection of training data influences the patterns the network will learn to identify. With this in mind, we trained networks using three different criteria to select the positive training data (i.e. images from patients that will develop cancer): an inherent risk model trained on images with no visible signs of cancer, a cancer signs model trained on images containing cancer or early signs of cancer, and a conflated model trained on all images from patients with a cancer diagnosis. We find that these three models learn distinctive features that focus on different patterns, which translates to contrasts in performance. Short-term risk is best estimated by the cancer signs model, whilst long-term risk is best estimated by the inherent risk model. Carelessly training with all images conflates inherent risk with early cancer signs, and yields sub-optimal estimates in both regimes. As a consequence, conflated models may lead physicians to recommend preventative action when early cancer signs are already visible.