边缘保留vipriors Challenge的指导性语义细分

论文标题

边缘保留vipriors Challenge的指导性语义细分

Edge-Preserving Guided Semantic Segmentation for VIPriors Challenge

论文作者

Hsu, Chih-Chung, Ma, Hsin-Ti

论文摘要

语义细分是计算机视觉中最有吸引力的研究领域之一。在Vipriors挑战赛中，只允许训练样本数量非常有限，从而使当前的最新和深度学习的语义细分技术很难训练。因此，为了克服这一缺点，我们提出了边缘保护指导以获取额外的先验信息，以避免在小规模培训数据集中过度拟合。首先，将两道融合的卷积层与常规语义分割网络的最后一层串联。然后，通过SOBEL操作从地面真相计算边缘图，然后将硬阈值操作串联以指示像素是否为边缘。然后，采用二维跨透明拷贝损失来计算预测边缘图与其地面真相之间的损失，称为边缘保留损失。这样，不同实例之间边界的连续性可以被提议的边缘保留损失强迫。实验表明，与最新的语义分割技术相比，在小规模训练集中提出的方法可以在小规模训练集中实现出色的性能。

Semantic segmentation is one of the most attractive research fields in computer vision. In the VIPriors challenge, only very limited numbers of training samples are allowed, leading to that the current state-of-the-art and deep learning-based semantic segmentation techniques are hard to train well. To overcome this shortcoming, therefore, we propose edge-preserving guidance to obtain the extra prior information, to avoid the overfitting under small-scale training dataset. First, a two-channeled convolutional layer is concatenated to the last layer of the conventional semantic segmentation network. Then, an edge map is calculated from the ground truth by Sobel operation and followed by concatenating a hard-thresholding operation to indicate whether the pixel is the edge or not. Then, the two-dimensional cross-entropy loss is adopted to calculate the loss between the predicted edge map and its ground truth, termed as an edge-preserving loss. In this way, the continuity of boundaries between different instances can be forced by the proposed edge-preserving loss. Experiments demonstrate that the proposed method can achieve excellent performance under small-scale training set, compared to state-of-the-art semantic segmentation techniques.

下载PDF全文

下载文献需遵守相关版权规定

论文标题