论文标题
Glocalnet:班级意识的长期人类运动合成
GlocalNet: Class-aware Long-term Human Motion Synthesis
论文作者
论文摘要
Synthesis of long-term human motion skeleton sequences is essential to aid human-centric video generation with potential applications in Augmented Reality, 3D character animations, pedestrian trajectory prediction, etc. Long-term human motion synthesis is a challenging task due to multiple factors like, long-term temporal dependencies among poses, cyclic repetition across poses, bi-directional and multi-scale dependencies among poses, variable speed of actions, and a large as well as跨多种类别/人类活动类型的时间姿势变化的部分重叠空间。本文旨在解决这些挑战,以综合各种人类活动类别(> 50)的长期(> 6000毫秒)人类运动轨迹。我们提出了一种两阶段的活动生成方法来实现这一目标,其中第一阶段通过学习综合稀疏运动轨迹来学习活动序列中的长期全球姿势依赖性,而第二阶段则解决了以第一阶段的输出为生的密集运动轨迹的产生。我们使用在公开可用数据集上使用各种定量评估指标来证明所提出的方法比SOTA方法的优越性。
Synthesis of long-term human motion skeleton sequences is essential to aid human-centric video generation with potential applications in Augmented Reality, 3D character animations, pedestrian trajectory prediction, etc. Long-term human motion synthesis is a challenging task due to multiple factors like, long-term temporal dependencies among poses, cyclic repetition across poses, bi-directional and multi-scale dependencies among poses, variable speed of actions, and a large as well as partially overlapping space of temporal pose variations across multiple class/types of human activities. This paper aims to address these challenges to synthesize a long-term (> 6000 ms) human motion trajectory across a large variety of human activity classes (>50). We propose a two-stage activity generation method to achieve this goal, where the first stage deals with learning the long-term global pose dependencies in activity sequences by learning to synthesize a sparse motion trajectory while the second stage addresses the generation of dense motion trajectories taking the output of the first stage. We demonstrate the superiority of the proposed method over SOTA methods using various quantitative evaluation metrics on publicly available datasets.