分布式在线学习以在认知雷达网络中共存

论文标题

分布式在线学习以在认知雷达网络中共存

Distributed Online Learning for Coexistence in Cognitive Radar Networks

论文作者

Howard, William, Martone, Anthony, Buehrer, R. Michael

论文摘要

这项工作解决了雷达网络的共存问题。具体而言，我们建模了一个合作，独立和非共同雷达节点的网络，该网络必须在网络中共享资源以及附近的不合作性发射器。我们使用在线机器学习（ML）技术解决此问题。由于每个雷达节点对环境或其他雷达节点的位置都不了解，因此特别优选在线学习方法，并且由于问题的顺序性质。对于此任务，我们专门选择了多武器多臂强盗（MMAB）模型，该模型将问题作为顺序游戏提出，在该游戏中，网络中的每个雷达节点都可以独立选择中心频率和波形，并具有相同的目标，即改进网络的跟踪性能。为了进行准确的跟踪，每个雷达节点都以设定的间隔将观测值传达给融合中心。融合中心具有雷达节点放置的知识，但无法与单个节点通信足够快以进行波形控制。网络中的每个雷达节点都必须学习环境的行为，其中包括奖励，干涉行为和目标行为。我们的贡献包括适合雷达网络方案的MMAB框架的数学描述。我们以对几种不同网络配置的模拟研究进行结论。实验结果表明，使用MMAB的迭代，在线学习优于更传统的感觉和避免感（SAA）和固定分配方法。

This work addresses the coexistence problem for radar networks. Specifically, we model a network of cooperative, independent, and non-communicating radar nodes which must share resources within the network as well as with non-cooperative nearby emitters. We approach this problem using online Machine Learning (ML) techniques. Online learning approaches are specifically preferred due to the fact that each radar node has no prior knowledge of the environment nor of the positions of the other radar nodes, and due to the sequential nature of the problem. For this task we specifically select the multi-player multi-armed bandit (MMAB) model, which poses the problem as a sequential game, where each radar node in a network makes independent selections of center frequency and waveform with the same goal of improving tracking performance for the network as a whole. For accurate tracking, each radar node communicates observations to a fusion center on set intervals. The fusion center has knowledge of the radar node placement, but cannot communicate to the individual nodes fast enough for waveform control. Every radar node in the network must learn the behavior of the environment, which includes rewards, interferer behavior, and target behavior. Our contributions include a mathematical description of the MMAB framework adapted to the radar network scenario. We conclude with a simulation study of several different network configurations. Experimental results show that iterative, online learning using MMAB outperforms the more traditional sense-and-avoid (SAA) and fixed-allocation approaches.

下载PDF全文

下载文献需遵守相关版权规定

论文标题