论文标题
发现敌人:用于枪声分类和本地化的游戏内枪声数据集
Enemy Spotted: in-game gun sound dataset for gunshot classification and localization
论文作者
论文摘要
最近,由于没有声音分类和本地化任务的域知识,基于深度学习的方法引起了人们的关注。但是,现有数据集中缺乏枪支声音是实施支持系统来通过利用深度学习模型从枪击中发现罪犯的主要障碍。由于枪声的发生是罕见且无法预测的,因此在现实世界中收集枪声是不切实际的。另外,可以从旨在模仿现实世界战争的FPS游戏中获得枪声。最近的FPS游戏提供了一个现实的环境,我们可以在模拟危险情况的同时安全地收集枪声数据。通过利用游戏环境的优势,我们构建了一个枪声数据集,即BGG,用于枪支分类和枪击定位任务。 BGG数据集由37种不同类型的枪支,距离以及声音源和接收器之间的方向组成。我们仔细验证游戏中的枪声数据具有足够的信息来通过训练BGG数据集上的几个声音分类和本地化基线来识别枪声的位置和类型。之后,我们证明可以通过使用BGG数据集来增强现实世界枪支分类和本地化任务的准确性。
Recently, deep learning-based methods have drawn huge attention due to their simple yet high performance without domain knowledge in sound classification and localization tasks. However, a lack of gun sounds in existing datasets has been a major obstacle to implementing a support system to spot criminals from their gunshots by leveraging deep learning models. Since the occurrence of gunshot is rare and unpredictable, it is impractical to collect gun sounds in the real world. As an alternative, gun sounds can be obtained from an FPS game that is designed to mimic real-world warfare. The recent FPS game offers a realistic environment where we can safely collect gunshot data while simulating even dangerous situations. By exploiting the advantage of the game environment, we construct a gunshot dataset, namely BGG, for the firearm classification and gunshot localization tasks. The BGG dataset consists of 37 different types of firearms, distances, and directions between the sound source and a receiver. We carefully verify that the in-game gunshot data has sufficient information to identify the location and type of gunshots by training several sound classification and localization baselines on the BGG dataset. Afterward, we demonstrate that the accuracy of real-world firearm classification and localization tasks can be enhanced by utilizing the BGG dataset.