论文标题
随机动态信息流动跟踪游戏使用监督学习来检测高级持续威胁
Stochastic Dynamic Information Flow Tracking Game using Supervised Learning for Detecting Advanced Persistent Threats
论文作者
论文摘要
先进的持久威胁(APTS)是由成熟的攻击者组织长时间的网络攻击。尽管合适的活动是隐形的,但它们与系统组件相互作用,这些相互作用会导致信息流。动态信息流跟踪(DIFT)已被提出是使用信息流检测APT的有效方法之一。但是,使用DIFT的广泛安全性分析导致绩效开销显着提高,而fivt产生的假阳性和假阴性的高率很高。在本文中,我们将APT和DIFT之间的战略相互作用建模为一种非合作随机游戏。游戏在从系统日志中提取的信息流图(IFG)构建的状态空间上展开。游戏中APT的目的是选择IFG中的过渡,以从攻击点到攻击目标找到IFG中的最佳路径。另一方面,DIFT的目的是动态选择IFG中的节点,以执行用于检测APT的安全分析。我们的游戏模型具有不完美的信息,因为玩家没有有关对手的行动的信息。我们考虑游戏的两种情况(i)当玩家都知道虚假阳性和假阴性率时,并且(ii)两个玩家都不知道虚假阳性和错误阴性率。案例(i)转化为具有完整信息的游戏模型,我们提出了一种基于价值的迭代算法并证明收敛性。案例(ii)转化为具有未知过渡概率的游戏。在这种情况下,我们提出了整合神经网络的层次监督学习(HSL)算法,以预测游戏的价值向量,并将其与策略迭代算法进行计算近似平衡。我们在实际攻击数据集上实施了算法,并验证了方法的性能。
Advanced persistent threats (APTs) are organized prolonged cyberattacks by sophisticated attackers. Although APT activities are stealthy, they interact with the system components and these interactions lead to information flows. Dynamic Information Flow Tracking (DIFT) has been proposed as one of the effective ways to detect APTs using the information flows. However, wide range security analysis using DIFT results in a significant increase in performance overhead and high rates of false-positives and false-negatives generated by DIFT. In this paper, we model the strategic interaction between APT and DIFT as a non-cooperative stochastic game. The game unfolds on a state space constructed from an information flow graph (IFG) that is extracted from the system log. The objective of the APT in the game is to choose transitions in the IFG to find an optimal path in the IFG from an entry point of the attack to an attack target. On the other hand, the objective of DIFT is to dynamically select nodes in the IFG to perform security analysis for detecting APT. Our game model has imperfect information as the players do not have information about the actions of the opponent. We consider two scenarios of the game (i) when the false-positive and false-negative rates are known to both players and (ii) when the false-positive and false-negative rates are unknown to both players. Case (i) translates to a game model with complete information and we propose a value iteration-based algorithm and prove the convergence. Case (ii) translates to a game with unknown transition probabilities. In this case, we propose Hierarchical Supervised Learning (HSL) algorithm that integrates a neural network, to predict the value vector of the game, with a policy iteration algorithm to compute an approximate equilibrium. We implemented our algorithms on real attack datasets and validated the performance of our approach.