论文标题
通过定向集团树的因果关系的主动结构学习
Active Structure Learning of Causal DAGs via Directed Clique Tree
论文作者
论文摘要
越来越多的工作已经开始研究干预设计,以实现因果关系的有效结构学习(DAGS)。一个典型的设置是一种有因果关系的设置,即没有潜在混杂因素,选择偏见或反馈的系统,当观察等效类别(EC)的基本图被视为输入,并且认为干预措施是无噪声的。大多数现有作品都集中在最差的或平均案例下限上,以实现DAG所需的干预措施数量。这些最糟糕的案例下限仅确定基本图中最大的集团可能会使学习真实的DAG很难。在这项工作中,我们为单个节点干预措施开发了一个通用的下限,该措施确定最大的集团始终是结构学习的根本障碍。具体而言,我们通过定向群集将DAG分解为独立定向的组件,并使用它证明将EC中的任何DAG定向所需的单节点干预措施的数量至少是必需图的每个链分量中最大群集的一半。此外,我们提出了一种两阶段的干预设计算法,该算法在弦骨架上的某些条件下,将最佳的干预措施与最大数量数量的最佳干预措施相匹配。我们通过合成实验表明,与大多数相关工作相比,我们的算法可以扩展到更大的图表,并且比其他可伸缩方法获得了更好的最差案例性能。可以在https://github.com/csquires/dct-policy上找到重新创建这些结果的代码库
A growing body of work has begun to study intervention design for efficient structure learning of causal directed acyclic graphs (DAGs). A typical setting is a causally sufficient setting, i.e. a system with no latent confounders, selection bias, or feedback, when the essential graph of the observational equivalence class (EC) is given as an input and interventions are assumed to be noiseless. Most existing works focus on worst-case or average-case lower bounds for the number of interventions required to orient a DAG. These worst-case lower bounds only establish that the largest clique in the essential graph could make it difficult to learn the true DAG. In this work, we develop a universal lower bound for single-node interventions that establishes that the largest clique is always a fundamental impediment to structure learning. Specifically, we present a decomposition of a DAG into independently orientable components through directed clique trees and use it to prove that the number of single-node interventions necessary to orient any DAG in an EC is at least the sum of half the size of the largest cliques in each chain component of the essential graph. Moreover, we present a two-phase intervention design algorithm that, under certain conditions on the chordal skeleton, matches the optimal number of interventions up to a multiplicative logarithmic factor in the number of maximal cliques. We show via synthetic experiments that our algorithm can scale to much larger graphs than most of the related work and achieves better worst-case performance than other scalable approaches. A code base to recreate these results can be found at https://github.com/csquires/dct-policy