AVDDPG：适用于自动排控制的联合加固学习

论文标题

AVDDPG：适用于自动排控制的联合加固学习

AVDDPG: Federated reinforcement learning applied to autonomous platoon control

论文作者

Boin, Christian, Lei, Lei, Yang, Simon X.

论文摘要

自2016年以来，联邦学习（FL）一直是人工智能（AI）研究社区中不断发展的讨论话题。 FL的应用导致了联邦加固学习（FRL）的发展和研究。关于适用于自动驾驶汽车（AV）排的FRL主题的作品很少。此外，大多数FRL作品都选择一种单一聚合方法（通常是重量或梯度聚集）。我们通过在自定义AVONOON环境上设计和实施FRL框架来探索FRL的有效性，以作为改善AV排的一种手段。在两种情况下，研究了FRL在AV排中的应用：（1）平面间FRL（Inter-FRL），其中FRL在不同的排量上应用于AVS；（2）platoon frl（frl），其中FRL应用于单个排内的AV。使用梯度和重量聚集，将FRL和FRL间的FRL应用于自定义的AV平台环境，以观察FRL相对于未经FRL训练的AV排周围环境，可以观察FRL对AV排的效果。得出的结论是，使用重量聚集（FRLWA）的FRL为控制AVENOON提供了最佳性能。此外，我们发现FRL中的重量聚集用于AV，相对于梯度聚集，性能可提高性能。最后，针对FRLWA内部的性能分析，而没有FRL的排量环境，对于长度为3、4和5车的排。可以得出结论，弗拉瓦内部的表现在很大程度上超过了没有FRL的训练的培训环境。

Since 2016 federated learning (FL) has been an evolving topic of discussion in the artificial intelligence (AI) research community. Applications of FL led to the development and study of federated reinforcement learning (FRL). Few works exist on the topic of FRL applied to autonomous vehicle (AV) platoons. In addition, most FRL works choose a single aggregation method (usually weight or gradient aggregation). We explore FRL's effectiveness as a means to improve AV platooning by designing and implementing an FRL framework atop a custom AV platoon environment. The application of FRL in AV platooning is studied under two scenarios: (1) Inter-platoon FRL (Inter-FRL) where FRL is applied to AVs across different platoons; (2) Intra-platoon FRL (Intra-FRL) where FRL is applied to AVs within a single platoon. Both Inter-FRL and Intra-FRL are applied to a custom AV platooning environment using both gradient and weight aggregation to observe the performance effects FRL can have on AV platoons relative to an AV platooning environment trained without FRL. It is concluded that Intra-FRL using weight aggregation (Intra-FRLWA) provides the best performance for controlling an AV platoon. In addition, we found that weight aggregation in FRL for AV platooning provides increases in performance relative to gradient aggregation. Finally, a performance analysis is conducted for Intra-FRLWA versus a platooning environment without FRL for platoons of length 3, 4 and 5 vehicles. It is concluded that Intra-FRLWA largely out-performs the platooning environment that is trained without FRL.

下载PDF全文

下载文献需遵守相关版权规定

论文标题