通过闭环控制自我修复强大的神经网络

论文标题

通过闭环控制自我修复强大的神经网络

Self-Healing Robust Neural Networks via Closed-Loop Control

论文作者

Chen, Zhuotong, Li, Qianxiao, Zhang, Zheng

论文摘要

尽管神经网络的应用广泛，但人们对其脆弱性问题的担忧越来越多。尽管已经开发了许多攻击和防御技术，但这项工作从一个新角度研究了鲁棒性问题：我们可以设计一个可以自动检测和解决脆弱性问题的自我修复神经网络吗？典型的自我修复机制是人体的免疫系统。这种受生物学启发的想法已用于许多工程设计中，但很少在深度学习中进行研究。本文考虑了神经网络的训练后自我修复，并提出了闭环控制公式，以自动检测和修复由各种攻击或扰动引起的错误。我们提供基于保证金的分析，以解释该公式如何改善分类器的鲁棒性。为了加快拟议的自我修复网络的推断，我们通过改善Pontryagin最大基于原理的求解器来解决控制问题。最后，我们提出了具有非线性激活函数的神经网络拟议框架的错误估计。我们验证了几个网络体系结构的性能，以防止各种扰动。由于自我修复方法不需要有关数据扰动/攻击的A-Priori信息，因此它可以处理一类不可预见的扰动。

Despite the wide applications of neural networks, there have been increasing concerns about their vulnerability issue. While numerous attack and defense techniques have been developed, this work investigates the robustness issue from a new angle: can we design a self-healing neural network that can automatically detect and fix the vulnerability issue by itself? A typical self-healing mechanism is the immune system of a human body. This biology-inspired idea has been used in many engineering designs but is rarely investigated in deep learning. This paper considers the post-training self-healing of a neural network, and proposes a closed-loop control formulation to automatically detect and fix the errors caused by various attacks or perturbations. We provide a margin-based analysis to explain how this formulation can improve the robustness of a classifier. To speed up the inference of the proposed self-healing network, we solve the control problem via improving the Pontryagin Maximum Principle-based solver. Lastly, we present an error estimation of the proposed framework for neural networks with nonlinear activation functions. We validate the performance on several network architectures against various perturbations. Since the self-healing method does not need a-priori information about data perturbations/attacks, it can handle a broad class of unforeseen perturbations.

下载PDF全文

下载文献需遵守相关版权规定

论文标题