积极的任务随机化：通过无监督的多样化和可行的任务来学习强大的技能

论文标题

积极的任务随机化：通过无监督的多样化和可行的任务来学习强大的技能

Active Task Randomization: Learning Robust Skills via Unsupervised Generation of Diverse and Feasible Tasks

论文作者

Fang, Kuan, Migimatsu, Toki, Mandlekar, Ajay, Fei-Fei, Li, Bohg, Jeannette

论文摘要

解决现实世界的操纵任务要求机器人具有适用于各种情况的技能的曲目。当使用基于学习的方法获得此类技能时，主要的挑战是获得涵盖任务多种多样且可行的变化的培训数据，这通常需要非平凡的体力劳动和领域知识。在这项工作中，我们引入了主动任务随机化（ATR），这种方法通过无监督的培训任务来学习强大的技能。 ATR选择合适的任务，包括最初的环境状态和操纵目标，用于通过平衡任务的多样性和可行性来学习强大的技能。我们建议通过共同学习紧凑的任务表示来预测任务多样性和可行性。然后，使用基于图的参数化在模拟中生成所选任务。这些培训任务的积极选择使我们可以在测试时稳健地处理各种物体和安排的技能政策。我们证明，任务计划者可以根据视觉输入来解决看不见的顺序操纵问题，可以通过视觉输入来构成学习的技能。与基线方法相比，ATR可以在单步和顺序操纵任务中获得较高的成功率。

Solving real-world manipulation tasks requires robots to have a repertoire of skills applicable to a wide range of circumstances. When using learning-based methods to acquire such skills, the key challenge is to obtain training data that covers diverse and feasible variations of the task, which often requires non-trivial manual labor and domain knowledge. In this work, we introduce Active Task Randomization (ATR), an approach that learns robust skills through the unsupervised generation of training tasks. ATR selects suitable tasks, which consist of an initial environment state and manipulation goal, for learning robust skills by balancing the diversity and feasibility of the tasks. We propose to predict task diversity and feasibility by jointly learning a compact task representation. The selected tasks are then procedurally generated in simulation using graph-based parameterization. The active selection of these training tasks enables skill policies trained with our framework to robustly handle a diverse range of objects and arrangements at test time. We demonstrate that the learned skills can be composed by a task planner to solve unseen sequential manipulation problems based on visual inputs. Compared to baseline methods, ATR can achieve superior success rates in single-step and sequential manipulation tasks.

下载PDF全文

下载文献需遵守相关版权规定

论文标题