论文标题
对抗训练的稳健性,隐私和概括
Robustness, Privacy, and Generalization of Adversarial Training
论文作者
论文摘要
对抗性训练可以大大鲁棒性的深层神经网络抵抗对抗性攻击。但是,一些作品表明,对抗训练可能包括保护隐私和概括能力。本文建立并量化了从理论和经验方面的对抗性培训中的对抗性培训中的隐私性权衡和概括性权衡。我们首先定义一个概念,{\ it robustified强度}测量对抗训练算法的鲁棒性。通过渐近一致的经验估计量,{\ IT经验强度强度}可以在经验上近似。基于强大的强度,我们证明(1)对抗训练是$(\ varepsilon,δ)$ - 差异私有,其中差异隐私的幅度与稳健强度具有正相关; (2)对抗训练的概括可能会受到$ \ Mathcal o(\ sqrt {\ log n}/n)$ on-plavery Bound的限制,并且$ \ Mathcal O(1/\ sqrt {n})$高验证能力限制,它们都具有良好的量强度。此外,我们的概括界限并不明确依赖于参数大小,这些参数大小在深度学习中会非常大。在标准数据集(CIFAR-10和CIFAR-100)上进行的系统实验与我们的理论完全一致。源代码软件包可在\ url {https://github.com/fshp971/rpg}上找到。
Adversarial training can considerably robustify deep neural networks to resist adversarial attacks. However, some works suggested that adversarial training might comprise the privacy-preserving and generalization abilities. This paper establishes and quantifies the privacy-robustness trade-off and generalization-robustness trade-off in adversarial training from both theoretical and empirical aspects. We first define a notion, {\it robustified intensity} to measure the robustness of an adversarial training algorithm. This measure can be approximate empirically by an asymptotically consistent empirical estimator, {\it empirical robustified intensity}. Based on the robustified intensity, we prove that (1) adversarial training is $(\varepsilon, δ)$-differentially private, where the magnitude of the differential privacy has a positive correlation with the robustified intensity; and (2) the generalization error of adversarial training can be upper bounded by an $\mathcal O(\sqrt{\log N}/N)$ on-average bound and an $\mathcal O(1/\sqrt{N})$ high-probability bound, both of which have positive correlations with the robustified intensity. Additionally, our generalization bounds do not explicitly rely on the parameter size which would be prohibitively large in deep learning. Systematic experiments on standard datasets, CIFAR-10 and CIFAR-100, are in full agreement with our theories. The source code package is available at \url{https://github.com/fshp971/RPG}.