论文标题
Interspeech 2022音频深度数据包丢失隐藏挑战
INTERSPEECH 2022 Audio Deep Packet Loss Concealment Challenge
论文作者
论文摘要
音频数据包丢失隐藏(PLC)是由数据包开关网络中数据传输故障引起的音频流中差距的隐藏。这是一个普遍的问题,并且随着端到端VoIP电话和电信系统成为业务和个人用法中越来越广泛使用的沟通形式,并且越来越重要。本文提出了Interspeech 2022音频深度数据包丢失隐藏挑战。我们首先概述了PLC问题,并介绍了一些经典方法以及最近的工作。然后,我们介绍作为此挑战的一部分发布的开源数据集以及用于确定获胜者的评估方法和指标。我们还简要介绍了PLCMO,这是一种新型的数据驱动度量,可用于快速评估性能PLC系统。最后,我们介绍了Interspeech 2022 Audio Deep PLC挑战的结果,并提供了重要的收获摘要。
Audio Packet Loss Concealment (PLC) is the hiding of gaps in audio streams caused by data transmission failures in packet switched networks. This is a common problem, and of increasing importance as end-to-end VoIP telephony and teleconference systems become the default and ever more widely used form of communication in business as well as in personal usage. This paper presents the INTERSPEECH 2022 Audio Deep Packet Loss Concealment challenge. We first give an overview of the PLC problem, and introduce some classical approaches to PLC as well as recent work. We then present the open source dataset released as part of this challenge as well as the evaluation methods and metrics used to determine the winner. We also briefly introduce PLCMOS, a novel data-driven metric that can be used to quickly evaluate the performance PLC systems. Finally, we present the results of the INTERSPEECH 2022 Audio Deep PLC Challenge, and provide a summary of important takeaways.