硬件安全性的强化学习：机遇，发展和挑战

论文标题

硬件安全性的强化学习：机遇，发展和挑战

Reinforcement Learning for Hardware Security: Opportunities, Developments, and Challenges

论文作者

Patnaik, Satwik, Gohil, Vasudev, Guo, Hao, Jeyavijayan, Rajendran

论文摘要

强化学习（RL）是一种机器学习范式，自主代理人通过与基础环境进行互动来学习最佳决策序列。 RL引导的工作流在解开电子设计自动化问题方面所证明的诺言鼓励硬件安全研究人员利用自动RL代理解决域特异性问题。从硬件安全性的角度来看，这种自主代理人可以在未知的对抗环境中产生最佳动作。另一方面，综合电路供应链的持续全球化迫使芯片制造成为离岸，不信任的实体，从而增加了对硬件安全性的担忧。此外，未知的对抗环境和不断增加的设计复杂性使防守者发现攻击者（又称硬件特洛伊木马）进行的微妙修改具有挑战性。在此简介中，我们概述了RL代理在检测硬件Trojans时的开发，这是最具挑战性的硬件安全问题之一。此外，我们概述了潜在的机会，并提出了应用RL解决硬件安全问题的挑战。

Reinforcement learning (RL) is a machine learning paradigm where an autonomous agent learns to make an optimal sequence of decisions by interacting with the underlying environment. The promise demonstrated by RL-guided workflows in unraveling electronic design automation problems has encouraged hardware security researchers to utilize autonomous RL agents in solving domain-specific problems. From the perspective of hardware security, such autonomous agents are appealing as they can generate optimal actions in an unknown adversarial environment. On the other hand, the continued globalization of the integrated circuit supply chain has forced chip fabrication to off-shore, untrustworthy entities, leading to increased concerns about the security of the hardware. Furthermore, the unknown adversarial environment and increasing design complexity make it challenging for defenders to detect subtle modifications made by attackers (a.k.a. hardware Trojans). In this brief, we outline the development of RL agents in detecting hardware Trojans, one of the most challenging hardware security problems. Additionally, we outline potential opportunities and enlist the challenges of applying RL to solve hardware security problems.

下载PDF全文

下载文献需遵守相关版权规定

论文标题