论文标题
朝在线域自适应对象检测
Towards Online Domain Adaptive Object Detection
论文作者
论文摘要
现有的对象检测模型假定训练数据和测试数据是从同一源域采样的。当这些检测器部署在遇到新的视觉域的现实应用程序中时,此假设并不成立。通常采用无监督的域适应性(UDA)方法来减轻域移位引起的不利影响。现有的UDA方法以离线方式运行,该方法首先适应目标域,然后部署在现实世界应用程序中。但是,由于模型经常遇到新的域移动,因此这种离线适应策略不适合现实世界应用。因此,开发一种可行的UDA方法至关重要,该方法将以连续的在线方式推广到部署时间期间遇到的这些域转移。为此,我们提出了一个新颖的统一适应框架,该框架适应并改善了在线设置中目标域的概括。特别是,我们介绍了MEMXFORMER-基于跨意义变压器的内存模块,其中存储器中的项目利用域移动并记录目标分布的原型模式。此外,MEMXFORMER会产生强大的正面和负面对,以指导新型的对比损失,从而增强了目标特定表示的学习。关于各种检测基准的实验表明,拟议的策略可以在在线和离线设置中产生最先进的性能。据我们所知,这是第一项解决在线和离线适应设置以进行对象检测的工作。 https://github.com/vibashan/memxformer-online-da上的代码
Existing object detection models assume both the training and test data are sampled from the same source domain. This assumption does not hold true when these detectors are deployed in real-world applications, where they encounter new visual domain. Unsupervised Domain Adaptation (UDA) methods are generally employed to mitigate the adverse effects caused by domain shift. Existing UDA methods operate in an offline manner where the model is first adapted towards the target domain and then deployed in real-world applications. However, this offline adaptation strategy is not suitable for real-world applications as the model frequently encounters new domain shifts. Hence, it becomes critical to develop a feasible UDA method that generalizes to these domain shifts encountered during deployment time in a continuous online manner. To this end, we propose a novel unified adaptation framework that adapts and improves generalization on the target domain in online settings. In particular, we introduce MemXformer - a cross-attention transformer-based memory module where items in the memory take advantage of domain shifts and record prototypical patterns of the target distribution. Further, MemXformer produces strong positive and negative pairs to guide a novel contrastive loss, which enhances target specific representation learning. Experiments on diverse detection benchmarks show that the proposed strategy can produce state-of-the-art performance in both online and offline settings. To the best of our knowledge, this is the first work to address online and offline adaptation settings for object detection. Code at https://github.com/Vibashan/memXformer-online-da