论文标题
使用人类姿势和武器外观的手枪检测
Handgun detection using combined human pose and weapon appearance
论文作者
论文摘要
如今,闭路电视(CCTV)系统对于防止安全威胁或危险情况至关重要,在这种情况下,早期发现至关重要。新颖的基于深度学习的方法已允许开发自动武器探测器,并具有令人鼓舞的结果。但是,这些方法主要基于视觉武器外观。对于手枪,身体姿势可能是一个有用的提示,尤其是在枪支几乎看不见的情况下。在这项工作中,提出了一种新颖的方法,可以将武器外观和人类姿势信息结合在一起。首先,估计姿势关键点可以提取手部区域并生成二进制姿势图像,即模型输入。然后,每个输入都在不同的子网络中处理,并合并以产生手枪边界框。结果表明,组合模型改善了最新技术的手枪检测状态,从4.23点到18.9个AP点,比最佳先前方法多。
Closed-circuit television (CCTV) systems are essential nowadays to prevent security threats or dangerous situations, in which early detection is crucial. Novel deep learning-based methods have allowed to develop automatic weapon detectors with promising results. However, these approaches are mainly based on visual weapon appearance only. For handguns, body pose may be a useful cue, especially in cases where the gun is barely visible. In this work, a novel method is proposed to combine, in a single architecture, both weapon appearance and human pose information. First, pose keypoints are estimated to extract hand regions and generate binary pose images, which are the model inputs. Then, each input is processed in different subnetworks and combined to produce the handgun bounding box. Results obtained show that the combined model improves the handgun detection state of the art, achieving from 4.23 to 18.9 AP points more than the best previous approach.