论文标题

使用变压器对乘压缩MP3音频的法医分析和定位

Forensic Analysis and Localization of Multiply Compressed MP3 Audio Using Transformers

论文作者

Xiang, Ziyue, Bestagini, Paolo, Tubaro, Stefano, Delp, Edward J.

论文摘要

音频信号通常以压缩格式存储和传输。在许多可用的音频压缩方案中,MPEG-1音频III(MP3)非常流行且广泛使用。由于MP3是有损的,因此可以在压缩音频中留下特征痕迹,该音频可用于揭示音频文件的过去历史记录。在本文中,我们考虑了通过压缩和未压缩音频信号的时间剪接来完成音频信号操纵的情况。我们提出了一种基于变压器网络的接相位置的方法。我们的方法确定音频信号的时间部分在时间框架级别上经历了单个或多个压缩,这是MP3压缩的最小时间单元。我们在486,743 MP3音频剪辑的数据集上测试了我们的方法。与现有方法相比,我们的方法实现了更高的性能,并证明了相对于不同的MP3数据的鲁棒性。

Audio signals are often stored and transmitted in compressed formats. Among the many available audio compression schemes, MPEG-1 Audio Layer III (MP3) is very popular and widely used. Since MP3 is lossy it leaves characteristic traces in the compressed audio which can be used forensically to expose the past history of an audio file. In this paper, we consider the scenario of audio signal manipulation done by temporal splicing of compressed and uncompressed audio signals. We propose a method to find the temporal location of the splices based on transformer networks. Our method identifies which temporal portions of a audio signal have undergone single or multiple compression at the temporal frame level, which is the smallest temporal unit of MP3 compression. We tested our method on a dataset of 486,743 MP3 audio clips. Our method achieved higher performance and demonstrated robustness with respect to different MP3 data when compared with existing methods.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源