使用预训练的阿尔茨海默氏病检测来利用及时学习

论文标题

使用预训练的阿尔茨海默氏病检测来利用及时学习

Exploiting prompt learning with pre-trained language models for Alzheimer's Disease detection

论文作者

Wang, Yi, Deng, Jiajun, Wang, Tianzi, Zheng, Bo, Hu, Shoukang, Liu, Xunying, Meng, Helen

论文摘要

阿尔茨海默氏病（AD）的早期诊断对于促进预防性护理和延迟进一步发展至关重要。基于语音的自动广告筛选系统为其他临床筛查技术提供了一种非侵入性，更可扩展的替代方案。预先训练的语言模型（PLM）（例如BERT）产生的文本嵌入功能被广泛用于此类系统中。但是，PLM域微调通常是基于与后端广告检测任务不一致的蒙版单词或句子预测成本。为此，本文研究了对PLM的及时基于迅速的微调，这些PLM始终使用AD分类错误作为训练目标函数。在PLM微调过程中，基于犹豫或暂停填充令牌频率基于犹豫或暂停填充令牌的频率特征将进一步纳入及时短语中。使用不同的PLM（BERT和ROBERTA）或具有不同微调范式的系统（常规掩盖语言建模微调和基于及时的基于及时的微调）的系统之间的基于决策投票的组合。在15个实验运行范围内的平均偏差和准确分数之间的最高分数作为AD检测系统的性能测量。平均检测准确性为84.20％（STD 2.09％，最佳87.5％）和82.64％（具有STD 4.0％，最佳89.58％）的平均检测精度分别使用ADRESS20测试集中的ASR语音转录分别获得了48位老年扬声器。

Early diagnosis of Alzheimer's disease (AD) is crucial in facilitating preventive care and to delay further progression. Speech based automatic AD screening systems provide a non-intrusive and more scalable alternative to other clinical screening techniques. Textual embedding features produced by pre-trained language models (PLMs) such as BERT are widely used in such systems. However, PLM domain fine-tuning is commonly based on the masked word or sentence prediction costs that are inconsistent with the back-end AD detection task. To this end, this paper investigates the use of prompt-based fine-tuning of PLMs that consistently uses AD classification errors as the training objective function. Disfluency features based on hesitation or pause filler token frequencies are further incorporated into prompt phrases during PLM fine-tuning. The decision voting based combination among systems using different PLMs (BERT and RoBERTa) or systems with different fine-tuning paradigms (conventional masked-language modelling fine-tuning and prompt-based fine-tuning) is further applied. Mean, standard deviation and the maximum among accuracy scores over 15 experiment runs are adopted as performance measurements for the AD detection system. Mean detection accuracy of 84.20% (with std 2.09%, best 87.5%) and 82.64% (with std 4.0%, best 89.58%) were obtained using manual and ASR speech transcripts respectively on the ADReSS20 test set consisting of 48 elderly speakers.

下载PDF全文

下载文献需遵守相关版权规定

论文标题