论文标题
学习跨团队规模转移角色分配
Learning to Transfer Role Assignment Across Team Sizes
论文作者
论文摘要
多代理增强学习是解决需要学习代理协调的复杂任务的关键。但是,强大的协调通常会导致昂贵的大型国家行动空间进行昂贵的探索。一种有力的方法是将团队工作分解为角色,理想情况下将其分配给具有相关技能的代理商。因此,培训代理人可以适应地选择和扮演团队中新兴角色,从而使团队可以扩展到复杂的任务并迅速适应不断变化的环境。但是,这些承诺尚未通过当前基于角色的多代理增强学习方法完全实现,因为它们假设了预定义的角色结构或固定的团队规模。我们提出了一个框架,以学习跨团队规模的角色分配和转移。特别是,我们通过演示并将网络转移到较大的团队来培训小型团队的角色分配网络,这些团队继续通过与环境的互动来学习。我们证明,重新使用基于角色的信用分配结构可以促进更大的强化学习团队的学习过程,以实现需要不同角色的任务。我们的提案在丰富角色强制捕食者游戏和Starcraft II微型管理基准中的新方案中的竞争技术优于竞争技术。
Multi-agent reinforcement learning holds the key for solving complex tasks that demand the coordination of learning agents. However, strong coordination often leads to expensive exploration over the exponentially large state-action space. A powerful approach is to decompose team works into roles, which are ideally assigned to agents with the relevant skills. Training agents to adaptively choose and play emerging roles in a team thus allows the team to scale to complex tasks and quickly adapt to changing environments. These promises, however, have not been fully realised by current role-based multi-agent reinforcement learning methods as they assume either a pre-defined role structure or a fixed team size. We propose a framework to learn role assignment and transfer across team sizes. In particular, we train a role assignment network for small teams by demonstration and transfer the network to larger teams, which continue to learn through interaction with the environment. We demonstrate that re-using the role-based credit assignment structure can foster the learning process of larger reinforcement learning teams to achieve tasks requiring different roles. Our proposal outperforms competing techniques in enriched role-enforcing Prey-Predator games and in new scenarios in the StarCraft II Micro-Management benchmark.