同时优化黑盒对抗补丁攻击的扰动和位置

论文标题

同时优化黑盒对抗补丁攻击的扰动和位置

Simultaneously Optimizing Perturbations and Positions for Black-box Adversarial Patch Attacks

论文作者

Wei, Xingxing, Guo, Ying, Yu, Jie, Zhang, Bo

论文摘要

对抗斑块是现实世界中对抗性攻击的一种重要形式，它为深层神经网络的稳健性带来了严重的风险。以前的方法通过在固定粘贴位置时优化其扰动值或在修复补丁的内容时对位置进行操作，从而生成对抗补丁。这表明这些位置和扰动对对抗性攻击都很重要。为此，在本文中，我们提出了一种新颖的方法，可以同时优化对抗贴片的位置和扰动，从而在黑盒子设置中获得很高的攻击成功率。从技术上讲，我们将贴剂的位置，预设的超参数确定为变量，以确定贴片的扰动，并利用加固学习框架同时解决基于从目标模型获得的最佳解决方案的最佳解决方案，并使用少量查询。对面部识别（FR）任务进行了广泛的实验，并且四个代表性FR模型的结果表明，我们的方法可以显着提高攻击成功率和查询效率。此外，有关商业FR服务和物理环境的实验证实了其实际的应用价值。我们还将方法扩展到流量标志识别任务，以验证其概括能力。

Adversarial patch is an important form of real-world adversarial attack that brings serious risks to the robustness of deep neural networks. Previous methods generate adversarial patches by either optimizing their perturbation values while fixing the pasting position or manipulating the position while fixing the patch's content. This reveals that the positions and perturbations are both important to the adversarial attack. For that, in this paper, we propose a novel method to simultaneously optimize the position and perturbation for an adversarial patch, and thus obtain a high attack success rate in the black-box setting. Technically, we regard the patch's position, the pre-designed hyper-parameters to determine the patch's perturbations as the variables, and utilize the reinforcement learning framework to simultaneously solve for the optimal solution based on the rewards obtained from the target model with a small number of queries. Extensive experiments are conducted on the Face Recognition (FR) task, and results on four representative FR models show that our method can significantly improve the attack success rate and query efficiency. Besides, experiments on the commercial FR service and physical environments confirm its practical application value. We also extend our method to the traffic sign recognition task to verify its generalization ability.

下载PDF全文

下载文献需遵守相关版权规定

论文标题