论文标题
边界内容图神经网络用于时间动作提案生成
Boundary Content Graph Neural Network for Temporal Action Proposal Generation
论文作者
论文摘要
时间行动提案生成在视频动作理解中起着重要作用,这需要精确定位高质量的动作内容。但是,生成具有精确界限和高质量动作内容的时间提案非常具有挑战性。为了解决这个问题,我们提出了一个新颖的边界内容图神经网络(BC-GNN),以建模图神经网络的时间提案的边界和动作内容之间的有见地的关系。在BC-GNN中,分别将时间提案的边界和内容视为图形神经网络的节点和边缘,它们是自发链接的。然后提出了一种新颖的图计算操作来更新边缘和节点的特征。之后,使用一个更新的边缘和两个节点来预测边界概率和内容置信度得分,将合并以生成最终的高质量建议。实验是在两个主流数据集上进行的:ActivityNet-1.3和Thumos14。在没有铃铛和哨子的情况下,BC-GNN在时间动作提案和时间动作检测任务中都优于先前的最先进方法。
Temporal action proposal generation plays an important role in video action understanding, which requires localizing high-quality action content precisely. However, generating temporal proposals with both precise boundaries and high-quality action content is extremely challenging. To address this issue, we propose a novel Boundary Content Graph Neural Network (BC-GNN) to model the insightful relations between the boundary and action content of temporal proposals by the graph neural networks. In BC-GNN, the boundaries and content of temporal proposals are taken as the nodes and edges of the graph neural network, respectively, where they are spontaneously linked. Then a novel graph computation operation is proposed to update features of edges and nodes. After that, one updated edge and two nodes it connects are used to predict boundary probabilities and content confidence score, which will be combined to generate a final high-quality proposal. Experiments are conducted on two mainstream datasets: ActivityNet-1.3 and THUMOS14. Without the bells and whistles, BC-GNN outperforms previous state-of-the-art methods in both temporal action proposal and temporal action detection tasks.