论文标题

迈向听觉刺激的感知距离度量

Towards a perceptual distance metric for auditory stimuli

论文作者

Oh, Sarah, Bowen, Elijah FW, Rodriguez, Antonio, Sowinski, Damian, Childers, Eva, Brown, Annemarie, Ray, Laura, Granger, Richard

论文摘要

尽管感觉刺激之间的感知(dis)相似性似乎类似于距离,但测量听觉刺激的矢量表示之间的欧几里得距离是主观差异的估计量很差。在听力中,非线性响应模式,刺激成分之间的相互作用,时间效应和自上而下的调制改变了传入的频域刺激中所包含的信息,似乎可以保留某些距离的概念,但不能保留熟悉的欧几里得空间的距离。这项工作提出,在听力过程中应用于听觉刺激的转换可以建模为函数映射刺激指向其在感知空间中的表示,从而诱导Riemannian距离度量。在主观的听力实验中收集了一个数据集,其结果用于探索方法(受生物启发,数据驱动和其组合),以近似感知图。与最先进的音频质量度量相比,每项提出的措施与主观评分(r〜0.8)的相当或更强的相关性。

Although perceptual (dis)similarity between sensory stimuli seems akin to distance, measuring the Euclidean distance between vector representations of auditory stimuli is a poor estimator of subjective dissimilarity. In hearing, nonlinear response patterns, interactions between stimulus components, temporal effects, and top-down modulation transform the information contained in incoming frequency-domain stimuli in a way that seems to preserve some notion of distance, but not that of familiar Euclidean space. This work proposes that transformations applied to auditory stimuli during hearing can be modeled as a function mapping stimulus points to their representations in a perceptual space, inducing a Riemannian distance metric. A dataset was collected in a subjective listening experiment, the results of which were used to explore approaches (biologically inspired, data-driven, and combinations thereof) to approximating the perceptual map. Each of the proposed measures achieved comparable or stronger correlations with subjective ratings (r ~ 0.8) compared to state-of-the-art audio quality measures.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源