论文标题
使用经常性注意网络对可穿戴传感器上的多功能定位和识别的顺序弱标记
Sequential Weakly Labeled Multi-Activity Localization and Recognition on Wearable Sensors using Recurrent Attention Networks
论文作者
论文摘要
随着可穿戴设备(例如智能手机)的受欢迎程度和开发,基于传感器的人类活动识别(HAR)已成为人类计算机交互和无处不在的计算的关键研究领域。深度学习的出现导致HAR研究的最新转变,这需要大量严格标记的数据。与视频数据相比,从加速度计或陀螺仪记录的活动数据通常更难解释和细分。最近,提出了几种注意机制来处理弱标记的人类活动数据,这些数据不需要准确的数据注释。但是,这些基于注意力的模型只能处理弱标记的数据集,其样本包括一个目标活动,从而限制了效率和实用性。在本文中,我们提出了一个经常性的注意网络(RAN),以处理顺序弱标记的多活动性识别和位置任务。该模型可以在一个样本的多个活动上反复执行注意步骤,每个步骤都对应于当前的集中活动。 RAN模型的有效性在收集的顺序弱标记的多活动数据集和其他两个公共数据集上进行了验证。实验结果表明,我们的RAN模型可以从粗粒顺序弱标记中同时推断多活性类型,并确定每个目标活动的特定位置,仅知道长序列中包含哪些类型的活动。它将大大减轻手动标签的负担。我们的工作代码可在https://github.com/kenncoder7/ran上找到。
With the popularity and development of the wearable devices such as smartphones, human activity recognition (HAR) based on sensors has become as a key research area in human computer interaction and ubiquitous computing. The emergence of deep learning leads to a recent shift in the research of HAR, which requires massive strictly labeled data. In comparison with video data, activity data recorded from accelerometer or gyroscope is often more difficult to interpret and segment. Recently, several attention mechanisms are proposed to handle the weakly labeled human activity data, which do not require accurate data annotation. However, these attention-based models can only handle the weakly labeled dataset whose sample includes one target activity, as a result it limits efficiency and practicality. In the paper, we propose a recurrent attention networks (RAN) to handle sequential weakly labeled multi-activity recognition and location tasks. The model can repeatedly perform steps of attention on multiple activities of one sample and each step is corresponding to the current focused activity. The effectiveness of the RAN model is validated on a collected sequential weakly labeled multi-activity dataset and the other two public datasets. The experiment results show that our RAN model can simultaneously infer multi-activity types from the coarse-grained sequential weak labels and determine specific locations of every target activity with only knowledge of which types of activities contained in the long sequence. It will greatly reduce the burden of manual labeling. The code of our work is available at https://github.com/KennCoder7/RAN.