论文标题
Deep Vulman:深入强化学习支持的网络脆弱性管理框架
Deep VULMAN: A Deep Reinforcement Learning-Enabled Cyber Vulnerability Management Framework
论文作者
论文摘要
网络脆弱性管理是网络安全操作中心(CSOC)的关键功能,该中心有助于保护组织免受计算机和网络系统上的网络攻击。对手比CSOC具有不对称的优势,因为与安全团队在资源受限的环境中减轻它们相比,这些系统中的缺陷次数的增长速度明显更高。当前的方法是确定性和一次性决策方法,在优先级和选择缓解漏洞时,这些方法不考虑未来的不确定性。这些方法也受到资源的亚最佳分布的约束,没有灵活性来调整其对脆弱性到达波动的响应的灵活性。我们提出了一个新颖的框架,深深的瓦尔曼,由深入的强化学习代理和整数编程方法组成,以填补网络脆弱性管理过程中的这一空白。我们的顺序决策框架首先确定在给定系统状态下在不确定性下分配的近乎最理想的资源,然后确定缓解优先级的漏洞实例的最佳集合。我们提出的框架在一年内观察到的模拟和现实世界漏洞数据上,在模拟和现实世界漏洞数据上选择重要的组织特定漏洞的优先级优于当前方法。
Cyber vulnerability management is a critical function of a cybersecurity operations center (CSOC) that helps protect organizations against cyber-attacks on their computer and network systems. Adversaries hold an asymmetric advantage over the CSOC, as the number of deficiencies in these systems is increasing at a significantly higher rate compared to the expansion rate of the security teams to mitigate them in a resource-constrained environment. The current approaches are deterministic and one-time decision-making methods, which do not consider future uncertainties when prioritizing and selecting vulnerabilities for mitigation. These approaches are also constrained by the sub-optimal distribution of resources, providing no flexibility to adjust their response to fluctuations in vulnerability arrivals. We propose a novel framework, Deep VULMAN, consisting of a deep reinforcement learning agent and an integer programming method to fill this gap in the cyber vulnerability management process. Our sequential decision-making framework, first, determines the near-optimal amount of resources to be allocated for mitigation under uncertainty for a given system state and then determines the optimal set of prioritized vulnerability instances for mitigation. Our proposed framework outperforms the current methods in prioritizing the selection of important organization-specific vulnerabilities, on both simulated and real-world vulnerability data, observed over a one-year period.