论文标题

从具有深神网络的嘈杂标签中学习:一项调查

Learning from Noisy Labels with Deep Neural Networks: A Survey

论文作者

Song, Hwanjun, Kim, Minseok, Park, Dongmin, Shin, Yooju, Lee, Jae-Gil

论文摘要

在大量大数据的帮助下,深度学习在众多领域取得了巨大的成功。但是,数据标签的质量是一个问题,因为在许多实际情况下缺乏高质量的标签。由于嘈杂的标签严重降低了深神经网络的概括性能,因此从嘈杂的标签中学习(强大的培训)正在成为现代深度学习应用中的重要任务。在这项调查中,我们首先从监督的学习角度描述了用标签噪音学习的问题。接下来,我们对62种最先进的鲁棒训练方法进行了全面的综述,所有这些方法都根据其方法论差异归类为五组,然后对用于评估其优越性的六种属性进行系统比较。随后,我们对噪声率估计的深入分析,并总结了典型的评估方法,包括公共嘈杂的数据集和评估指标。最后,我们提出了一些有前途的研究方向,可以作为未来研究的指导。所有内容将在https://github.com/songhwanjun/awesome-noisy-labels上找到。

Deep learning has achieved remarkable success in numerous domains with help from large amounts of big data. However, the quality of data labels is a concern because of the lack of high-quality labels in many real-world scenarios. As noisy labels severely degrade the generalization performance of deep neural networks, learning from noisy labels (robust training) is becoming an important task in modern deep learning applications. In this survey, we first describe the problem of learning with label noise from a supervised learning perspective. Next, we provide a comprehensive review of 62 state-of-the-art robust training methods, all of which are categorized into five groups according to their methodological difference, followed by a systematic comparison of six properties used to evaluate their superiority. Subsequently, we perform an in-depth analysis of noise rate estimation and summarize the typically used evaluation methodology, including public noisy datasets and evaluation metrics. Finally, we present several promising research directions that can serve as a guideline for future studies. All the contents will be available at https://github.com/songhwanjun/Awesome-Noisy-Labels.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源