论文标题

基于室内声音的盲目估计,基于室内脉冲响应的扩展模型的语音信号估算

Blind estimation of room acoustic parameters from speech signals based on extended model of room impulse response

论文作者

Wang, Lijun, Duangpummet, Suradej, Unoki, Masashi

论文摘要

语音传输指数(STI)和房间声学参数(RAP)源自房间冲动响应(RIR),例如回响时间和早期衰减时间,对于评估语音传输并预测声场中的聆听难度至关重要。由于很难在日常占用的空间中衡量RIR,因此必须解决对STI和RAP的同时盲目估计,因为这是一个势在必行且充满挑战的问题。本文提出了一种根据RIR随机模型盲目估计STI和五次说唱的确定性方法,该模型近似于未知RIR。所提出的方法为回合语音信号的时间功率信封制定,以获取RIR模型的最佳参数。进行了模拟,以评估观察到的回响语音信号的性传播感染和说唱。估计结果和地面真相结果之间的根平方误差用于使用先前方法对所提出的方法进行比较。结果表明,所提出的方法可以在没有任何训练的情况下有效地估算性传播感染和说唱。

The speech transmission index (STI) and room acoustic parameters (RAPs), which are derived from a room impulse response (RIR), such as reverberation time and early decay time, are essential to assess speech transmission and to predict the listening difficulty in a sound field. Since it is difficult to measure RIR in daily occupied spaces, simultaneous blind estimation of STI and RAPs must be resolved as it is an imperative and challenging issue. This paper proposes a deterministic method for blindly estimating STI and five RAPs on the basis of an RIR stochastic model that approximates an unknown RIR. The proposed method formulates a temporal power envelope of a reverberant speech signal to obtain the optimal parameters for the RIR model. Simulations were conducted to evaluate STI and RAPs from observed reverberant speech signals. The root-mean-square errors between the estimated and ground-truth results were used to comparatively evaluate the proposed method with the previous method. The results showed that the proposed method can estimate STI and RAPs effectively without any training.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源