论文标题
使用分散的深钢筋学习多机器人合作对象运输
Multi-robot Cooperative Object Transportation using Decentralized Deep Reinforcement Learning
论文作者
论文摘要
由于超大和/或超重问题,对象运输对于一个机器人来说可能是一个具有挑战性的问题。多机器人系统可以利用增加驾驶功率和更灵活的配置来解决此类问题。但是,增加的个体也改变了系统的动力学,这使得控制多机器人系统更加复杂。更糟糕的是,如果整个系统都坐在集中决策单元上,则由于系统的升级,数据流很容易使数据流过载。在这项研究中,我们在多机器人系统上提出了一个分散的控制方案,每个人都配备了深Q-Network(DQN)控制器来执行超大对象运输任务。因此,DQN是一种深厚的增强学习算法,因此不需要系统动力学知识,因此,它使机器人能够通过在任务环境中的反复试验风格相互作用来学习适当的控制策略。由于类似的控制器分布在个体上,因此系统地避免了计算瓶颈。我们在一个两轮球队将超大杆载在门口的情况下,证明了这样的系统。提出的多机器人系统学习任务的抽象特征,并观察到合作行为。分散的DQN风格控制器对不确定性表现出强大的鲁棒性。此外,我们提出了一个通用度量标准,以定量评估合作。
Object transportation could be a challenging problem for a single robot due to the oversize and/or overweight issues. A multi-robot system can take the advantage of increased driving power and more flexible configuration to solve such a problem. However, increased number of individuals also changed the dynamics of the system which makes control of a multi-robot system more complicated. Even worse, if the whole system is sitting on a centralized decision making unit, the data flow could be easily overloaded due to the upscaling of the system. In this research, we propose a decentralized control scheme on a multi-robot system with each individual equipped with a deep Q-network (DQN) controller to perform an oversized object transportation task. DQN is a deep reinforcement learning algorithm thus does not require the knowledge of system dynamics, instead, it enables the robots to learn appropriate control strategies through trial-and-error style interactions within the task environment. Since analogous controllers are distributed on the individuals, the computational bottleneck is avoided systematically. We demonstrate such a system in a scenario of carrying an oversized rod through a doorway by a two-robot team. The presented multi-robot system learns abstract features of the task and cooperative behaviors are observed. The decentralized DQN-style controller is showing strong robustness against uncertainties. In addition, We propose a universal metric to assess the cooperation quantitatively.