论文标题
利用上下文以鲁棒性在主动学习中标记噪声
Exploiting Context for Robustness to Label Noise in Active Learning
论文作者
论文摘要
计算机视觉中的几项作品证明了当新的未标记数据可用时,积极学习以适应识别模型的有效性。这些作品中的大多数都认为从注释者获得的标签是正确的。但是,在实际情况下,由于标签的质量取决于注释者,因此某些标签可能是错误的,这会导致识别性能降低。在本文中,我们解决了i)i)系统如何确定哪些查询标签是错误的,ii)如何适应多级活动的活性学习系统以最大程度地减少标签噪声的负面影响。为了解决问题,我们提出了一种基于嘈杂的标签过滤方法,其中使用自然数据中很常见的相互关系(上下文)来检测错误的标签。我们构建了未标记数据的图形表示,以编码这些关系,并在嘈杂的标签可用时在图表上获得新的信念。将新信念与先前的关系信息进行比较,我们生成一个差异分数,以检测不正确的标签并使用正确的标签更新识别模型,从而获得更好的识别性能。在三种不同的应用程序中证明了这一点:场景分类,活动分类和文档分类。
Several works in computer vision have demonstrated the effectiveness of active learning for adapting the recognition model when new unlabeled data becomes available. Most of these works consider that labels obtained from the annotator are correct. However, in a practical scenario, as the quality of the labels depends on the annotator, some of the labels might be wrong, which results in degraded recognition performance. In this paper, we address the problems of i) how a system can identify which of the queried labels are wrong and ii) how a multi-class active learning system can be adapted to minimize the negative impact of label noise. Towards solving the problems, we propose a noisy label filtering based learning approach where the inter-relationship (context) that is quite common in natural data is utilized to detect the wrong labels. We construct a graphical representation of the unlabeled data to encode these relationships and obtain new beliefs on the graph when noisy labels are available. Comparing the new beliefs with the prior relational information, we generate a dissimilarity score to detect the incorrect labels and update the recognition model with correct labels which result in better recognition performance. This is demonstrated in three different applications: scene classification, activity classification, and document classification.