基于共轭梯度法的多元化对抗攻击

论文标题

基于共轭梯度法的多元化对抗攻击

Diversified Adversarial Attacks based on Conjugate Gradient Method

论文作者

Yamamura, Keiichiro, Sato, Haruki, Tateiwa, Nariaki, Hata, Nozomi, Mitsutake, Toru, Oe, Issa, Ishikura, Hiroki, Fujisawa, Katsuki

论文摘要

深度学习模型容易受到对抗性例子的影响，用于产生此类例子的对抗性攻击引起了相当大的研究兴趣。尽管基于最陡峭下降的现有方法取得了很高的攻击成功率，但条件不足的问题偶尔会降低其性能。为了解决此限制，我们使用了共轭梯度（CG）方法，该方法对这种类型的问题有效，并提出了一种受CG方法启发的新型攻击算法，称为自动结合梯度（ACG）攻击。在最新的健壮模型上进行的大规模评估实验的结果表明，对于大多数模型而言，ACG能够找到比现有SOTA算法自动PGD（APGD）更少迭代的对抗性示例。我们研究了ACG和APGD之间的搜索性能差异在多元化和强化方面，并定义了一种称为多样性指数（DI）的度量，以量化多样性的程度。从使用此指数对多样性的分析中，我们表明对所提出方法的更多样化的搜索显着提高了其攻击成功率。

Deep learning models are vulnerable to adversarial examples, and adversarial attacks used to generate such examples have attracted considerable research interest. Although existing methods based on the steepest descent have achieved high attack success rates, ill-conditioned problems occasionally reduce their performance. To address this limitation, we utilize the conjugate gradient (CG) method, which is effective for this type of problem, and propose a novel attack algorithm inspired by the CG method, named the Auto Conjugate Gradient (ACG) attack. The results of large-scale evaluation experiments conducted on the latest robust models show that, for most models, ACG was able to find more adversarial examples with fewer iterations than the existing SOTA algorithm Auto-PGD (APGD). We investigated the difference in search performance between ACG and APGD in terms of diversification and intensification, and define a measure called Diversity Index (DI) to quantify the degree of diversity. From the analysis of the diversity using this index, we show that the more diverse search of the proposed method remarkably improves its attack success rate.

下载PDF全文

下载文献需遵守相关版权规定

论文标题