论文标题
用于任意为导向对象检测的动态锚学习
Dynamic Anchor Learning for Arbitrary-Oriented Object Detection
论文作者
论文摘要
任意面向的物体广泛出现在自然场景,航空照片,遥感图像等中,因此,任意面向的对象检测受到了很大的关注。许多当前的旋转探测器使用大量具有不同方向的锚点来实现与地面真相盒的空间对齐,然后将联合会的交叉点(IOU)用于采样训练的正面和负面候选者。但是,我们观察到所选的阳性不能始终确保回归后的准确检测,而某些负样本可以实现准确的定位。这表明通过IOU对锚的质量评估是不合适的,这进一步导致分类信心和本地化准确性之间的不一致。在本文中,我们提出了一种动态锚学习(DAL)方法,该方法利用新定义的匹配度来全面评估锚的本地化潜力并进行更有效的标签分配过程。通过这种方式,检测器可以动态选择高质量的锚以实现准确的对象检测,并且将减轻分类和回归之间的差异。通过新引入的DAL,我们获得了只有几个水平预设锚的任意对象的出色检测性能。三个遥感数据集HRSC2016,DOTA,UCAS-AOD以及场景文本数据集ICDAR 2015上的实验结果表明,与基线模型相比,我们的方法可以实现实质性改进。此外,我们的方法也是通用的,用于使用水平绑定框进行对象检测。代码和型号可在https://github.com/ming71/dal上找到。
Arbitrary-oriented objects widely appear in natural scenes, aerial photographs, remote sensing images, etc., thus arbitrary-oriented object detection has received considerable attention. Many current rotation detectors use plenty of anchors with different orientations to achieve spatial alignment with ground truth boxes, then Intersection-over-Union (IoU) is applied to sample the positive and negative candidates for training. However, we observe that the selected positive anchors cannot always ensure accurate detections after regression, while some negative samples can achieve accurate localization. It indicates that the quality assessment of anchors through IoU is not appropriate, and this further lead to inconsistency between classification confidence and localization accuracy. In this paper, we propose a dynamic anchor learning (DAL) method, which utilizes the newly defined matching degree to comprehensively evaluate the localization potential of the anchors and carry out a more efficient label assignment process. In this way, the detector can dynamically select high-quality anchors to achieve accurate object detection, and the divergence between classification and regression will be alleviated. With the newly introduced DAL, we achieve superior detection performance for arbitrary-oriented objects with only a few horizontal preset anchors. Experimental results on three remote sensing datasets HRSC2016, DOTA, UCAS-AOD as well as a scene text dataset ICDAR 2015 show that our method achieves substantial improvement compared with the baseline model. Besides, our approach is also universal for object detection using horizontal bound box. The code and models are available at https://github.com/ming71/DAL.