无线电力传输的无人机的在线路线进行深入加强学习

论文标题

无线电力传输的无人机的在线路线进行深入加强学习

Deep Reinforcement Learning for Online Routing of Unmanned Aerial Vehicles with Wireless Power Transfer

论文作者

Li, Kaiwen, Zhang, Tao, Wang, Rui, Wang, Ling

论文摘要

由于其灵活性和多功能性，无人驾驶飞机（UAV）在各种应用中起着至关重要的作用。本文提出了一种深入的加固学习方法，以解决无线电源传输的无人机在线路由问题，该方法可以远程充电无线电，从而扩展了电池有限的无人机的能力。我们的研究认为无人机的功耗和无线充电过程。与以前的作品不同，我们通过设计的深神经网络解决了问题。该模型是使用深入的强化学习方法离线训练的，并用于在线优化无人机路由问题。在小规模和大规模实例上，提议的模型的运行速度比Google Or-Tools（最先进的组合优化求解器）的速度从四次到500倍，具有相同的解决方案质量。就运行时和最佳性而言，它还优于不同类型的启发式和局部搜索方法。此外，一旦对模型进行了训练，它可以扩展到具有训练过程中未见任意拓扑的新生成的问题实例。当问题量表较大并且响应时间至关重要时，所提出的方法实际上适用。

The unmanned aerial vehicle (UAV) plays an vital role in various applications such as delivery, military mission, disaster rescue, communication, etc., due to its flexibility and versatility. This paper proposes a deep reinforcement learning method to solve the UAV online routing problem with wireless power transfer, which can charge the UAV remotely without wires, thus extending the capability of the battery-limited UAV. Our study considers the power consumption of the UAV and the wireless charging process. Unlike the previous works, we solve the problem by a designed deep neural network. The model is trained using a deep reinforcement learning method offline, and is used to optimize the UAV routing problem online. On small and large scale instances, the proposed model runs from four times to 500 times faster than Google OR-tools, the state-of-the-art combinatorial optimization solver, with identical solution quality. It also outperforms different types of heuristic and local search methods in terms of both run-time and optimality. In addition, once the model is trained, it can scale to new generated problem instances with arbitrary topology that are not seen during training. The proposed method is practically applicable when the problem scale is large and the response time is crucial.

下载PDF全文

下载文献需遵守相关版权规定

论文标题