HARFANG3D狗斗盒：用于战斗机定制控制任务的加固学习研究平台

论文标题

HARFANG3D狗斗盒：用于战斗机定制控制任务的加固学习研究平台

Harfang3D Dog-Fight Sandbox: A Reinforcement Learning Research Platform for the Customized Control Tasks of Fighter Aircrafts

论文作者

Özbek, Muhammed Murat, Yıldırım, Süleyman, Aksoy, Muhammet, Kernin, Eric, Koyuncu, Emre

论文摘要

深度学习（DL）的出现引起了增强学习（RL）研究的重大突破。深入加强学习（DRL）算法在ATARI 2600游戏中应用于基于视觉的控制问题时已达到超人水平的技能，从Pixel信息中提取了环境状态。不幸的是，这些环境远非适用于高度动态和复杂的现实世界任务，就像对战斗机的自主控制一样，因为这些环境仅涉及视觉世界的2D表示。在这里，我们提出了一个半现实的飞行模拟环境Harfang3D狗斗盒，用于战斗机。它的目的是一种灵活的工具箱，用于调查使用强化学习中航空研究中的主要挑战。该程序可轻松访问飞行动力学模型，环境状态和飞机空气动力学，使用户能够自定义任何特定任务，以便通过RL构建智能决策（控制）系统。该软件还允许部署机器人飞机并开发多代理任务。这样，可以将多组飞机配置为具有竞争力或合作代理，以执行复杂的任务，包括狗打架。在实验中，我们对两种不同的情况进行了培训：导航到指定位置，在视觉范围内（WVR）战斗，不久之后。在两种情况下，使用深厚的增强学习技术，我们能够培训表现出类似人类行为的能力的代理。基于此结果，可以证实Harfang3D狗斗盒可以用作3D现实的RL研究平台。

The advent of deep learning (DL) gave rise to significant breakthroughs in Reinforcement Learning (RL) research. Deep Reinforcement Learning (DRL) algorithms have reached super-human level skills when applied to vision-based control problems as such in Atari 2600 games where environment states were extracted from pixel information. Unfortunately, these environments are far from being applicable to highly dynamic and complex real-world tasks as in autonomous control of a fighter aircraft since these environments only involve 2D representation of a visual world. Here, we present a semi-realistic flight simulation environment Harfang3D Dog-Fight Sandbox for fighter aircrafts. It is aimed to be a flexible toolbox for the investigation of main challenges in aviation studies using Reinforcement Learning. The program provides easy access to flight dynamics model, environment states, and aerodynamics of the plane enabling user to customize any specific task in order to build intelligent decision making (control) systems via RL. The software also allows deployment of bot aircrafts and development of multi-agent tasks. This way, multiple groups of aircrafts can be configured to be competitive or cooperative agents to perform complicated tasks including Dog Fight. During the experiments, we carried out training for two different scenarios: navigating to a designated location and within visual range (WVR) combat, shortly Dog Fight. Using Deep Reinforcement Learning techniques for both scenarios, we were able to train competent agents that exhibit human-like behaviours. Based on this results, it is confirmed that Harfang3D Dog-Fight Sandbox can be utilized as a 3D realistic RL research platform.

下载PDF全文

下载文献需遵守相关版权规定

论文标题