大天使：一种基于混合无人机的人类检测基准，具有位置和姿势元数据

论文标题

大天使：一种基于混合无人机的人类检测基准，具有位置和姿势元数据

Archangel: A Hybrid UAV-based Human Detection Benchmark with Position and Pose Metadata

论文作者

Shen, Yi-Ting, Lee, Yaesop, Kwon, Heesung, Conover, Damon M., Bhattacharyya, Shuvra S., Vale, Nikolas, Gray, Joshua D., Leong, G. Jeremy, Evensen, Kenneth, Skirlo, Frank

论文摘要

学习在无人驾驶飞机（UAV）捕获的图像中检测物体（例如人类）通常会遭受无人机对物体的位置造成的巨大变化。此外，现有的基于无人机的基准数据集并未提供足够的数据集元数据，这对于这些变体的精确模型诊断和学习功能至关重要。在本文中，我们介绍了大天使，这是第一个基于无人机的对象检测数据集，该数据集由具有相似想象条件以及无人机位置以及对象姿势元数据捕获的真实和合成子集组成。一系列实验经过精心设计，使用最新的对象检测器设计，以证明在模型评估过程中利用元数据的好处。此外，还提供了涉及模型优化过程中实际和合成数据的几种关键见解。最后，我们讨论了有关大天使的优势，局限性和未来方向，以突出其对更广泛的机器学习社区的独特价值。

Learning to detect objects, such as humans, in imagery captured by an unmanned aerial vehicle (UAV) usually suffers from tremendous variations caused by the UAV's position towards the objects. In addition, existing UAV-based benchmark datasets do not provide adequate dataset metadata, which is essential for precise model diagnosis and learning features invariant to those variations. In this paper, we introduce Archangel, the first UAV-based object detection dataset composed of real and synthetic subsets captured with similar imagining conditions and UAV position and object pose metadata. A series of experiments are carefully designed with a state-of-the-art object detector to demonstrate the benefits of leveraging the metadata during model evaluation. Moreover, several crucial insights involving both real and synthetic data during model optimization are presented. In the end, we discuss the advantages, limitations, and future directions regarding Archangel to highlight its distinct value for the broader machine learning community.

下载PDF全文

下载文献需遵守相关版权规定

论文标题