论文标题

通过从潜在类别中学习的长尾识别

Long-tailed Recognition by Learning from Latent Categories

论文作者

Liu, Weide, Wu, Zhonghua, Wang, Yiming, Ding, Henghui, Liu, Fayao, Lin, Jie, Lin, Guosheng

论文摘要

在这项工作中,我们解决了长尾图像识别的具有挑战性的任务。以前的长尾识别方法通常专注于尾部类别的数据增强或重新平衡策略,以在模型训练期间更加关注尾巴类。但是,由于尾巴类别的训练图像有限,尾部类图像的多样性仍受到限制,从而导致特征表现不佳。在这项工作中,我们假设头部和尾部类别中的常见潜在特征可用于提供更好的功能表示。在此激励的情况下,我们引入了基于潜在类别的长尾识别(LCREG)方法。具体而言,我们建议学习一组在头部和尾部类中共享的类不足的潜在特征。然后,我们通过将语义数据扩展应用于潜在特征,隐式地丰富了训练样本的多样性。对五个长尾图识别数据集进行的广泛实验表明,我们提出的LCREG能够显着超过以前的方法并实现最新结果。

In this work, we address the challenging task of long-tailed image recognition. Previous long-tailed recognition methods commonly focus on the data augmentation or re-balancing strategy of the tail classes to give more attention to tail classes during the model training. However, due to the limited training images for tail classes, the diversity of tail class images is still restricted, which results in poor feature representations. In this work, we hypothesize that common latent features among the head and tail classes can be used to give better feature representation. Motivated by this, we introduce a Latent Categories based long-tail Recognition (LCReg) method. Specifically, we propose to learn a set of class-agnostic latent features shared among the head and tail classes. Then, we implicitly enrich the training sample diversity via applying semantic data augmentation to the latent features. Extensive experiments on five long-tailed image recognition datasets demonstrate that our proposed LCReg is able to significantly outperform previous methods and achieve state-of-the-art results.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源