分析EEG频率带的构想语音识别

论文标题

分析EEG频率带的构想语音识别

Analysis of EEG frequency bands for Envisioned Speech Recognition

论文作者

Tripathi, Ayush

论文摘要

自动语音识别（ASR）界面的使用已在日常生活中越来越流行，以用于电子设备的交互和控制。当前正在使用的接口对于各种用户，例如患有言语障碍，锁定综合征，瘫痪或具有最大隐私要求的人的用户不可行。在这种情况下，可以使用脑电图（EEG）信号来识别所设想的语音的接口可能会带来很大的好处。过去针对此问题的各种工作都是在过去完成的。然而，在识别脑电图信号的频带（$δ，θ，α，β，γ$）方面的工作有限，这有助于构想的语音识别。因此，在这项工作中，我们旨在分析从大脑的不同裂片获得的不同脑电频段和信号的重要性，以及它们对认识到设想的语音的贡献。从不同叶和对不同频带过滤的带通滤波的信号被送入具有卷积神经网络（CNN）和长短期记忆（LSTM）的时空深度学习体系结构。在包含三个分类任务的公开数据集中评估性能 - 数字，角色和图像。我们获得了分别为$ 85.93 \％$，$ 87.27 \％$和$ 87.51 \％$的分类准确性。该实现的代码已在https://github.com/ayushayt/imaginedspeechrecognition上提供。

The use of Automatic speech recognition (ASR) interfaces have become increasingly popular in daily life for use in interaction and control of electronic devices. The interfaces currently being used are not feasible for a variety of users such as those suffering from a speech disorder, locked-in syndrome, paralysis or people with utmost privacy requirements. In such cases, an interface that can identify envisioned speech using electroencephalogram (EEG) signals can be of great benefit. Various works targeting this problem have been done in the past. However, there has been limited work in identifying the frequency bands ($δ, θ, α, β, γ$) of the EEG signal that contribute towards envisioned speech recognition. Therefore, in this work, we aim to analyze the significance of different EEG frequency bands and signals obtained from different lobes of the brain and their contribution towards recognizing envisioned speech. Signals obtained from different lobes and bandpass filtered for different frequency bands are fed to a spatio-temporal deep learning architecture with Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM). The performance is evaluated on a publicly available dataset comprising of three classification tasks - digit, character and images. We obtain a classification accuracy of $85.93\%$, $87.27\%$ and $87.51\%$ for the three tasks respectively. The code for the implementation has been made available at https://github.com/ayushayt/ImaginedSpeechRecognition.

下载PDF全文

下载文献需遵守相关版权规定

论文标题