论文标题
在离散数据中的局部可识别性和测量误差
Partial Identifiability in Discrete Data With Measurement Error
论文作者
论文摘要
当数据包含测量误差时,有必要做出将观察到的错误数据与未观察到的真实现象相关的假设。这些假设应基于实质性的理由是合理的,但通常是出于数学便利性的动机,以确切地识别推理的目标。我们采用这样的观点,即比在可疑的假设下提出界限要比进行确切识别的观点。为此,我们演示了涉及离散变量的广泛建模假设(包括常见的测量误差和条件独立性假设)如何表示为模型参数的线性约束。然后,我们使用线性编程技术在此类模型中的测量误差下产生尖锐的界限,以实现事实和反事实分布。我们还提出了一种在非线性模型上获得外界的程序。我们的方法在许多重要的设置中产生急剧的界限(例如具有测量误差的仪器变量方案),以前尚无界限。
When data contains measurement errors, it is necessary to make assumptions relating the observed, erroneous data to the unobserved true phenomena of interest. These assumptions should be justifiable on substantive grounds, but are often motivated by mathematical convenience, for the sake of exactly identifying the target of inference. We adopt the view that it is preferable to present bounds under justifiable assumptions than to pursue exact identification under dubious ones. To that end, we demonstrate how a broad class of modeling assumptions involving discrete variables, including common measurement error and conditional independence assumptions, can be expressed as linear constraints on the parameters of the model. We then use linear programming techniques to produce sharp bounds for factual and counterfactual distributions under measurement error in such models. We additionally propose a procedure for obtaining outer bounds on non-linear models. Our method yields sharp bounds in a number of important settings -- such as the instrumental variable scenario with measurement error -- for which no bounds were previously known.