数据增强策略在可穿戴传感器数据的知识蒸馏中的作用

论文标题

数据增强策略在可穿戴传感器数据的知识蒸馏中的作用

Role of Data Augmentation Strategies in Knowledge Distillation for Wearable Sensor Data

论文作者

Jeon, Eun Som, Som, Anirudh, Shukla, Ankita, Hasanaj, Kristina, Buman, Matthew P., Turaga, Pavan

论文摘要

深度神经网络被数千或数百万参数参数化，并在许多分类问题中显示出巨大的成功。但是，大量参数使将这些型号集成到智能手机和可穿戴设备等边缘设备中变得困难。为了解决这个问题，知识蒸馏（KD）已被广泛使用，它使用预先培训的高容量网络来训练一个适合边缘设备的更小的网络。在本文中，我们首次研究了将KD用于时间序列数据的适用性和挑战。 KD的成功应用需要在培训期间进行数据增强方法的具体选择。但是，尚不清楚是否存在在KD期间选择增强方法的连贯策略。在本文中，我们报告了一项详细研究的结果，该研究比较和对比基于KD的人类活动分析中的各种共同选择和一些混合数据增强策略。该领域的研究通常受到限制，因为可穿戴设备的公共领域中没有很多综合数据库。我们的研究考虑了从公共可用的小规模的数据库到来自大规模介入的人类活动和久坐行为的数据库。我们发现，KD期间的数据增强技术的选择对最终性能的影响有可变的水平，并发现最佳网络选择以及数据增强策略是特定于手头数据集的。但是，我们还以一系列建议总结，可以在数据库中提供强大的基线性能。

Deep neural networks are parametrized by several thousands or millions of parameters, and have shown tremendous success in many classification problems. However, the large number of parameters makes it difficult to integrate these models into edge devices such as smartphones and wearable devices. To address this problem, knowledge distillation (KD) has been widely employed, that uses a pre-trained high capacity network to train a much smaller network, suitable for edge devices. In this paper, for the first time, we study the applicability and challenges of using KD for time-series data for wearable devices. Successful application of KD requires specific choices of data augmentation methods during training. However, it is not yet known if there exists a coherent strategy for choosing an augmentation approach during KD. In this paper, we report the results of a detailed study that compares and contrasts various common choices and some hybrid data augmentation strategies in KD based human activity analysis. Research in this area is often limited as there are not many comprehensive databases available in the public domain from wearable devices. Our study considers databases from small scale publicly available to one derived from a large scale interventional study into human activity and sedentary behavior. We find that the choice of data augmentation techniques during KD have a variable level of impact on end performance, and find that the optimal network choice as well as data augmentation strategies are specific to a dataset at hand. However, we also conclude with a general set of recommendations that can provide a strong baseline performance across databases.

下载PDF全文

下载文献需遵守相关版权规定

论文标题