论文标题

通过不对称的反事实学习政策学习

Policy Learning with Asymmetric Counterfactual Utilities

论文作者

Ben-Michael, Eli, Imai, Kosuke, Jiang, Zhichao

论文摘要

即使在医学和公共政策等高风险设置中,数据驱动的决策也起着重要作用。从观察到的数据中学习最佳策略需要仔细提出效用函数,该功能的期望值在人群中最大化。尽管研究人员通常使用依赖于观察到的结果的实用程序,但在许多情况下,决策者的效用函数的特征是所有行动中的潜在结果的联合集合。例如,“不伤害”的希波克拉底原则意味着,造成死亡的成本在未经治疗的情况下生存的患者的成本大于放弃挽救生命的治疗的成本。我们考虑使用该形式的不对称反事实功能的最佳政策学习,这些功能考虑了潜在结果的联合集合。我们表明,不对称的反事实实用程序导致了无法识别的预期效用函数,因此我们首先部分识别它。利用统计决策理论,我们通过将最大预期效用损失相对于不同的替代政策来得出最小的决策规则。我们表明,通过解决中间分类问题,可以从观察到的数据中学习最小值损失决策规则,并确定有限的样本过多的预期效用效用该过程受这些中间分类器的遗憾所限制。我们将此概念框架和方法应用于可能是否使用肺高血压患者是否使用正确的心脏导管插入术的决定。

Data-driven decision making plays an important role even in high stakes settings like medicine and public policy. Learning optimal policies from observed data requires a careful formulation of the utility function whose expected value is maximized across a population. Although researchers typically use utilities that depend on observed outcomes alone, in many settings the decision maker's utility function is more properly characterized by the joint set of potential outcomes under all actions. For example, the Hippocratic principle to "do no harm" implies that the cost of causing death to a patient who would otherwise survive without treatment is greater than the cost of forgoing life-saving treatment. We consider optimal policy learning with asymmetric counterfactual utility functions of this form that consider the joint set of potential outcomes. We show that asymmetric counterfactual utilities lead to an unidentifiable expected utility function, and so we first partially identify it. Drawing on statistical decision theory, we then derive minimax decision rules by minimizing the maximum expected utility loss relative to different alternative policies. We show that one can learn minimax loss decision rules from observed data by solving intermediate classification problems, and establish that the finite sample excess expected utility loss of this procedure is bounded by the regret of these intermediate classifiers. We apply this conceptual framework and methodology to the decision about whether or not to use right heart catheterization for patients with possible pulmonary hypertension.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源