评估肺CT中肺组织建模的3D甘甘

论文标题

评估肺CT中肺组织建模的3D甘甘

Evaluation of 3D GANs for Lung Tissue Modelling in Pulmonary CT

论文作者

Ellis, Sam, Manzanera, Octavio E. Martinez, Baltatzis, Vasileios, Nawaz, Ibrahim, Nair, Arjun, Folgoc, Loïc Le, Desai, Sujal, Glocker, Ben, Schnabel, Julia A.

论文摘要

甘斯能够准确地对复杂的高维数据集的分布进行建模，例如图像。这使得高质量的gan可用于医学成像中无监督的异常检测。但是，训练数据集（例如输出图像维度和语义上有意义的特征的外观）的差异意味着自然图像域中的GAN模型可能无法“开箱即用”进行医学成像，因此需要重新实现和重新评估。在这项工作中，我们适应三个GAN模型，以建模肺CT的3D健康图像贴片。据我们所知，这是第一次进行这样的评估。 DCGAN，Stylegan和Biggan体系结构因其在自然图像处理中的无处不在和高性能而进行了研究。我们训练这些方法的不同变体，并使用FID分数评估其性能。此外，通过人类观察者的研究评估了生成的图像的质量，研究了网络模拟3D域特异性特征的能力，并分析了GAN潜在空间的结构。结果表明，3D Stylegan产生具有有意义的3D结构的现实图像，但遭受模式崩溃的折磨，必须在训练过程中解决以获得样本多样性。相反，3D DCGAN模型显示出更大的图像可变性能力，但以质量质量不佳的成本为代价。 3D Biggan型号提供了中间图像质量的水平，但最准确地模拟了所选语义意义的特征的分布。结果表明，未来的开发需要实现具有足够能力的3D GAN进行基于补丁的肺CT异常检测，我们为将来的研究领域提供建议，例如尝试其他体系结构并纳入位置编码。

GANs are able to model accurately the distribution of complex, high-dimensional datasets, e.g. images. This makes high-quality GANs useful for unsupervised anomaly detection in medical imaging. However, differences in training datasets such as output image dimensionality and appearance of semantically meaningful features mean that GAN models from the natural image domain may not work `out-of-the-box' for medical imaging, necessitating re-implementation and re-evaluation. In this work we adapt and evaluate three GAN models to the task of modelling 3D healthy image patches for pulmonary CT. To the best of our knowledge, this is the first time that such an evaluation has been performed. The DCGAN, styleGAN and the bigGAN architectures were investigated due to their ubiquity and high performance in natural image processing. We train different variants of these methods and assess their performance using the FID score. In addition, the quality of the generated images was evaluated by a human observer study, the ability of the networks to model 3D domain-specific features was investigated, and the structure of the GAN latent spaces was analysed. Results show that the 3D styleGAN produces realistic-looking images with meaningful 3D structure, but suffer from mode collapse which must be addressed during training to obtain samples diversity. Conversely, the 3D DCGAN models show a greater capacity for image variability, but at the cost of poor-quality images. The 3D bigGAN models provide an intermediate level of image quality, but most accurately model the distribution of selected semantically meaningful features. The results suggest that future development is required to realise a 3D GAN with sufficient capacity for patch-based lung CT anomaly detection and we offer recommendations for future areas of research, such as experimenting with other architectures and incorporation of position-encoding.

下载PDF全文

下载文献需遵守相关版权规定

论文标题