论文标题
Foley声音综合挑战的建议
A Proposal for Foley Sound Synthesis Challenge
论文作者
论文摘要
“ Foley”是指在后期制作过程中添加到多媒体中的声音效果,以增强其感知的声学特性,例如,通过模拟屏幕上的脚步声,环境环境声音或可见物体的声音。尽管Foley是Foley Artists制作的,但人们对自动或机器辅助技术的兴趣越来越多,基于声音综合和生成模型的最新进展。为了促进对这个不断增长的研究领域的更多参与,我们提出了对自动Foley合成的挑战。通过对音频和机器学习成功挑战的案例研究,我们设定了拟议挑战的目标:对不同的Foley合成系统的严格,统一和有效评估,其总体目标是从研究社区中积极参与。我们概述了Foley声音综合挑战的详细信息和设计注意事项,包括任务定义,数据集要求和评估标准。
"Foley" refers to sound effects that are added to multimedia during post-production to enhance its perceived acoustic properties, e.g., by simulating the sounds of footsteps, ambient environmental sounds, or visible objects on the screen. While foley is traditionally produced by foley artists, there is increasing interest in automatic or machine-assisted techniques building upon recent advances in sound synthesis and generative models. To foster more participation in this growing research area, we propose a challenge for automatic foley synthesis. Through case studies on successful previous challenges in audio and machine learning, we set the goals of the proposed challenge: rigorous, unified, and efficient evaluation of different foley synthesis systems, with an overarching goal of drawing active participation from the research community. We outline the details and design considerations of a foley sound synthesis challenge, including task definition, dataset requirements, and evaluation criteria.