论文标题
通过敏感属性预测变量估算和控制均衡的赔率
Estimating and Controlling for Equalized Odds via Sensitive Attribute Predictors
论文作者
论文摘要
随着在现实世界中的高风险决策设置中使用机器学习模型的使用,我们必须能够审核和控制这些模型可能会对某些群体表现出任何潜在的公平性行为,这一点非常重要。为此,人们自然需要访问敏感属性,例如人口统计学,性别或其他决定组成员资格的潜在敏感特征。不幸的是,在许多情况下,此信息通常不可用。在这项工作中,我们研究了众所周知的\ emph {均衡的赔率}(eod)公平的定义。在没有敏感属性的设置中,我们首先为EOD违反预测变量提供紧密而可计算的上限。这些界限准确反映了最严重的EOD违规行为。其次,我们演示了如何通过一种新的后处理校正方法来证明人们如何控制最坏情况。当直接控制EOD相对于预测的敏感属性时,我们的结果表征是 - 何时不是 - 在控制最坏情况时最佳。我们的结果在比以前的工作温和的假设下保持不变,我们通过有关合成和实际数据集的实验来说明这些结果。
As the use of machine learning models in real world high-stakes decision settings continues to grow, it is highly important that we are able to audit and control for any potential fairness violations these models may exhibit towards certain groups. To do so, one naturally requires access to sensitive attributes, such as demographics, gender, or other potentially sensitive features that determine group membership. Unfortunately, in many settings, this information is often unavailable. In this work we study the well known \emph{equalized odds} (EOD) definition of fairness. In a setting without sensitive attributes, we first provide tight and computable upper bounds for the EOD violation of a predictor. These bounds precisely reflect the worst possible EOD violation. Second, we demonstrate how one can provably control the worst-case EOD by a new post-processing correction method. Our results characterize when directly controlling for EOD with respect to the predicted sensitive attributes is -- and when is not -- optimal when it comes to controlling worst-case EOD. Our results hold under assumptions that are milder than previous works, and we illustrate these results with experiments on synthetic and real datasets.