用U-NET进行滑坡分割：评估不同的采样方法和贴剂大小

论文标题

用U-NET进行滑坡分割：评估不同的采样方法和贴剂大小

Landslide Segmentation with U-Net: Evaluating Different Sampling Methods and Patch Sizes

论文作者

Soares, Lucas P., Dias, Helen C., Grohmann, Carlos H.

论文摘要

滑坡库存图对于验证预测性滑坡模型至关重要。但是，由于大多数映射方法都依赖于视觉解释或专家知识，因此仍然缺乏详细的库存图。这项研究使用了一个名为U-NET的完全卷积深度学习模型，以自动在巴西东南部里约热内卢山脉的Nova Friburgo市自动划分山体滑坡。目的是评估斑块大小，采样方法和数据集对模型整体准确性的影响。培训数据使用了Rapideye卫星的光学信息，以及来自ALOS卫星的L波段传感器的数字高程模型（DEM）。使用随机和常规网格方法对数据进行采样，并以三种尺寸（32x32、64x64和128x128像素）进行修补。在两个区域评估了模型，以召回，F1得分和平均值相交（MIOU）指标。结果表明，经过32x32瓷砖训练的模型由于较高的真实正速率而往往具有更高的召回值。但是，他们将更多背景区域误分类为滑坡（误报）。经过128x128瓷砖训练的模型通常会达到更高的精度值，因为它们会造成较少的假正错误。在这两个测试区域中，DEM和增强提高了模型的准确性。随机抽样有助于模型概括。从使用Rapideye图像，DEM信息和增强的数据中训练了128x128随机瓷砖的模型，达到了最高的F1分数，在测试区域中获得了0.55，在测试区域中获得了0.58。这项研究中获得的结果与文献中发现的其他完全卷积模型相媲美，从而增加了该地区的知识。

Landslide inventory maps are crucial to validate predictive landslide models; however, since most mapping methods rely on visual interpretation or expert knowledge, detailed inventory maps are still lacking. This study used a fully convolutional deep learning model named U-net to automatically segment landslides in the city of Nova Friburgo, located in the mountainous range of Rio de Janeiro, southeastern Brazil. The objective was to evaluate the impact of patch sizes, sampling methods, and datasets on the overall accuracy of the models. The training data used the optical information from RapidEye satellite, and a digital elevation model (DEM) derived from the L-band sensor of the ALOS satellite. The data was sampled using random and regular grid methods and patched in three sizes (32x32, 64x64, and 128x128 pixels). The models were evaluated on two areas with precision, recall, f1-score, and mean intersect over union (mIoU) metrics. The results show that the models trained with 32x32 tiles tend to have higher recall values due to higher true positive rates; however, they misclassify more background areas as landslides (false positives). Models trained with 128x128 tiles usually achieve higher precision values because they make less false positive errors. In both test areas, DEM and augmentation increased the accuracy of the models. Random sampling helped in model generalization. Models trained with 128x128 random tiles from the data that used the RapidEye image, DEM information, and augmentation achieved the highest f1-score, 0.55 in test area one, and 0.58 in test area two. The results achieved in this study are comparable to other fully convolutional models found in the literature, increasing the knowledge in the area.

下载PDF全文

下载文献需遵守相关版权规定

论文标题