论文标题
部分可观测时空混沌系统的无模型预测
Mixed-UNet: Refined Class Activation Mapping for Weakly-Supervised Semantic Segmentation with Multi-scale Inference
论文作者
论文摘要
深度学习技术在医学图像处理中表现出了巨大的潜力,尤其是通过对磁共振成像(MRI)扫描或计算机断层扫描(CT)扫描的准确可靠的图像分割,从而允许病变的定位和诊断。但是,训练这些分割模型需要大量的手动注释像素级标签,这些标签是耗时且劳动密集型的,与易于获得的图像级标签相比。必须使用图像级标签作为监督来通过弱监督的语义分割模型来解决此问题,因为它可以大大减少人类注释的工作。大多数高级解决方案利用了类激活映射(CAM)。但是,原始凸轮很少捕获病变的精确边界。在这项研究中,我们提出了多尺度推断的策略,以减少单尺度推理的细节损失来完善CAM。为了进行分割,我们开发了一个名为“混合unet”的新型模型,该模型在解码阶段具有两个平行分支。可以在将两个分支提取的特征融合后获得结果。我们对从当地医院和公共数据集收集的数据集中的几种普遍基于深度学习的细分方法进行了评估。验证结果表明,我们的模型在相同的监督水平下超过了可用的方法,从而分割了大脑成像的各种病变。
Deep learning techniques have shown great potential in medical image processing, particularly through accurate and reliable image segmentation on magnetic resonance imaging (MRI) scans or computed tomography (CT) scans, which allow the localization and diagnosis of lesions. However, training these segmentation models requires a large number of manually annotated pixel-level labels, which are time-consuming and labor-intensive, in contrast to image-level labels that are easier to obtain. It is imperative to resolve this problem through weakly-supervised semantic segmentation models using image-level labels as supervision since it can significantly reduce human annotation efforts. Most of the advanced solutions exploit class activation mapping (CAM). However, the original CAMs rarely capture the precise boundaries of lesions. In this study, we propose the strategy of multi-scale inference to refine CAMs by reducing the detail loss in single-scale reasoning. For segmentation, we develop a novel model named Mixed-UNet, which has two parallel branches in the decoding phase. The results can be obtained after fusing the extracted features from two branches. We evaluate the designed Mixed-UNet against several prevalent deep learning-based segmentation approaches on our dataset collected from the local hospital and public datasets. The validation results demonstrate that our model surpasses available methods under the same supervision level in the segmentation of various lesions from brain imaging.