基于梯度的轨迹优化具有学识渊博的动力学

论文标题

基于梯度的轨迹优化具有学识渊博的动力学

Gradient-Based Trajectory Optimization With Learned Dynamics

论文作者

Sukhija, Bhavya, Köhler, Nathanael, Zamora, Miguel, Zimmermann, Simon, Curi, Sebastian, Krause, Andreas, Coros, Stelian

论文摘要

近年来，轨迹优化方法已在现实世界机器人上达到了出色的性能水平。这些方法在很大程度上依赖于动态的准确分析模型，但是物理世界的某些方面只能在有限的程度上捕获。另一种方法是利用机器学习技术从数据中学习系统的可区分动力学模型。在这项工作中，我们使用轨迹优化和模型学习在没有精确的动力学分析模型的情况下，使用机器人系统执行高度动态和复杂的任务。我们表明，从仅在两个不同的机器人上的25分钟相互作用的数据中收集的数据，神经网络可以准确地对高度非线性行为进行建模：（i）波士顿动力学点和（ii）RC Car。此外，我们使用神经网络的梯度来执行基于梯度的轨迹优化。在我们的硬件实验中，我们证明了我们所学的模型可以代表现场和无线电控制（RC）汽车的复杂动力学，并与轨迹优化方法结合使用良好的性能。

Trajectory optimization methods have achieved an exceptional level of performance on real-world robots in recent years. These methods heavily rely on accurate analytical models of the dynamics, yet some aspects of the physical world can only be captured to a limited extent. An alternative approach is to leverage machine learning techniques to learn a differentiable dynamics model of the system from data. In this work, we use trajectory optimization and model learning for performing highly dynamic and complex tasks with robotic systems in absence of accurate analytical models of the dynamics. We show that a neural network can model highly nonlinear behaviors accurately for large time horizons, from data collected in only 25 minutes of interactions on two distinct robots: (i) the Boston Dynamics Spot and an (ii) RC car. Furthermore, we use the gradients of the neural network to perform gradient-based trajectory optimization. In our hardware experiments, we demonstrate that our learned model can represent complex dynamics for both the Spot and Radio-controlled (RC) car, and gives good performance in combination with trajectory optimization methods.

下载PDF全文

下载文献需遵守相关版权规定

论文标题