口语痴呆检测的数据增强

论文标题

口语痴呆检测的数据增强

Data Augmentation for Dementia Detection in Spoken Language

论文作者

Hlédiková, Anna, Woszczyk, Dominika, Akman, Alican, Demetriou, Soteris, Schuller, Björn

论文摘要

随着我们的社会年龄的增长，痴呆症是一个日益严重的问题，检测方法通常是侵入性且昂贵的。最近的深度学习技术可以提供更快的诊断，并显示出令人鼓舞的结果。但是，它们需要大量的标记数据，这对于痴呆症检测任务不容易获得。稀疏数据问题的一种有效解决方案是数据增加，尽管需要仔细选择确切的方法。迄今为止，还没有关于NLP和语音处理的阿尔茨海默氏病（AD）数据集的数据增强的实证研究。在这项工作中，我们研究了针对AD检测任务的数据增强技术，并对文本和音频域的两种模型上的不同方法进行经验评估。我们为两个域，SVM和随机森林模型分别为文本和音频域使用基于变压器的模型。我们使用传统和基于深度学习的方法生成其他样本，并表明数据增强改善了基于文本和音频的模型的性能，并且这些结果可与流行的Adress集合中的最新结果相媲美，并具有精心制作的体系结构和功能。

Dementia is a growing problem as our society ages, and detection methods are often invasive and expensive. Recent deep-learning techniques can offer a faster diagnosis and have shown promising results. However, they require large amounts of labelled data which is not easily available for the task of dementia detection. One effective solution to sparse data problems is data augmentation, though the exact methods need to be selected carefully. To date, there has been no empirical study of data augmentation on Alzheimer's disease (AD) datasets for NLP and speech processing. In this work, we investigate data augmentation techniques for the task of AD detection and perform an empirical evaluation of the different approaches on two kinds of models for both the text and audio domains. We use a transformer-based model for both domains, and SVM and Random Forest models for the text and audio domains, respectively. We generate additional samples using traditional as well as deep learning based methods and show that data augmentation improves performance for both the text- and audio-based models and that such results are comparable to state-of-the-art results on the popular ADReSS set, with carefully crafted architectures and features.

下载PDF全文

下载文献需遵守相关版权规定

论文标题