论文标题
Tanksworld:AI安全研究的多机构环境
TanksWorld: A Multi-Agent Environment for AI Safety Research
论文作者
论文摘要
创建能够执行复杂任务的人工智能(AI)的能力迅速超过了我们确保AI-ai-ai-Systems安全操作的能力。幸运的是,AI安全研究的景观正在响应这种不对称性,但还有很长的路要走。特别是,为说明AI安全风险而创建的最新模拟环境相对简单或专注于特定问题。因此,我们看到对AI安全研究环境的迫切需求,该环境使复杂的现实世界应用的基本方面提出了重要的意义。在这项工作中,我们介绍了AI Safety Tanksworld,作为AI安全研究的环境,具有三个基本方面:竞争性能目标,人机组合和多代理竞争。 AI安全坦克世界旨在通过提供一个软件框架来支持具有系统性能和安全目标的竞争,以加快安全多代理决策算法的进步。在进行的工作中,本文介绍了我们的研究目标和学习环境,并使用参考代码和基线绩效指标遵循未来的工作。
The ability to create artificial intelligence (AI) capable of performing complex tasks is rapidly outpacing our ability to ensure the safe and assured operation of AI-enabled systems. Fortunately, a landscape of AI safety research is emerging in response to this asymmetry and yet there is a long way to go. In particular, recent simulation environments created to illustrate AI safety risks are relatively simple or narrowly-focused on a particular issue. Hence, we see a critical need for AI safety research environments that abstract essential aspects of complex real-world applications. In this work, we introduce the AI safety TanksWorld as an environment for AI safety research with three essential aspects: competing performance objectives, human-machine teaming, and multi-agent competition. The AI safety TanksWorld aims to accelerate the advancement of safe multi-agent decision-making algorithms by providing a software framework to support competitions with both system performance and safety objectives. As a work in progress, this paper introduces our research objectives and learning environment with reference code and baseline performance metrics to follow in a future work.