论文标题
WSSS4LUAD:肺腺癌弱监督组织语义分割的巨大挑战
WSSS4LUAD: Grand Challenge on Weakly-supervised Tissue Semantic Segmentation for Lung Adenocarcinoma
论文作者
论文摘要
肺癌是全球癌症死亡的主要原因,腺癌(LUAD)是最常见的亚型。利用组织病理学图像的潜在价值可以促进肿瘤学中的精确医学。组织分割是组织病理学图像分析的基本上游任务。现有的深度学习模型已经达到了卓越的细分性能,但需要足够的像素级注释,这既耗时又昂贵。为了丰富LUAD的标签资源并减轻注释努力,我们组织了这一挑战WSSSS4LUAD,呼吁为LUAD的组织病理学图像提供出色的弱弱监督语义细分(WSSS)技术。参与者必须设计算法以分割肿瘤上皮,与肿瘤相关的基质和正常组织,仅具有斑块级标签。这项挑战包括10,091个补丁级注释(训练集)和超过1.3亿个标记的像素(验证和测试集),从87个WSI(来自GDPH的67个,来自TCGA的20个)。所有标签都是由AI型号的帮助下的病理学家在循环管道中生成的,并由标签审查委员会检查。在532个注册中,有28个团队在测试阶段提交了结果,其中有1,000多个提交。最后,第一名的团队达到了0.8413的MIOU(肿瘤:0.8389,Stroma:0.7931,正常:0.8919)。根据顶级团队的技术报告,CAM仍然是WSSS中最受欢迎的方法。 cutmix数据增强已被广泛采用以生成更可靠的样本。随着这项挑战的成功,我们认为WSSS与补丁级注释的方法可以补充传统的像素注释,同时减少注释努力。整个数据集已发布,以鼓励更多关于LUAD和更多新颖WSSS技术计算病理学的研究。
Lung cancer is the leading cause of cancer death worldwide, and adenocarcinoma (LUAD) is the most common subtype. Exploiting the potential value of the histopathology images can promote precision medicine in oncology. Tissue segmentation is the basic upstream task of histopathology image analysis. Existing deep learning models have achieved superior segmentation performance but require sufficient pixel-level annotations, which is time-consuming and expensive. To enrich the label resources of LUAD and to alleviate the annotation efforts, we organize this challenge WSSS4LUAD to call for the outstanding weakly-supervised semantic segmentation (WSSS) techniques for histopathology images of LUAD. Participants have to design the algorithm to segment tumor epithelial, tumor-associated stroma and normal tissue with only patch-level labels. This challenge includes 10,091 patch-level annotations (the training set) and over 130 million labeled pixels (the validation and test sets), from 87 WSIs (67 from GDPH, 20 from TCGA). All the labels were generated by a pathologist-in-the-loop pipeline with the help of AI models and checked by the label review board. Among 532 registrations, 28 teams submitted the results in the test phase with over 1,000 submissions. Finally, the first place team achieved mIoU of 0.8413 (tumor: 0.8389, stroma: 0.7931, normal: 0.8919). According to the technical reports of the top-tier teams, CAM is still the most popular approach in WSSS. Cutmix data augmentation has been widely adopted to generate more reliable samples. With the success of this challenge, we believe that WSSS approaches with patch-level annotations can be a complement to the traditional pixel annotations while reducing the annotation efforts. The entire dataset has been released to encourage more researches on computational pathology in LUAD and more novel WSSS techniques.