使用电子健康记录进行风险预测的复合密度网络

论文标题

使用电子健康记录进行风险预测的复合密度网络

Compound Density Networks for Risk Prediction using Electronic Health Records

论文作者

Liu, Yuxi, Qin, Shaowen, Zhang, Zhenhao, Shao, Wei

论文摘要

由于患者状况和治疗需求的变化，电子健康记录（EHR）表现出大量丢失的数据。缺失价值的插补被认为是应对这一挑战的有效方法。现有的工作将插补方法和预测模型分为基于EHR的机器学习系统的两个独立部分。我们通过利用复合密度网络（CDNET）提出了一种集成的端到端方法，该方法允许插入方法和预测模型在单个框架中调整在一起。 CDNET由一个封闭式复发单元（GRU），混合物密度网络（MDN）和正则注意网络（RAN）组成。 GRU用作对EHR数据进行建模的潜在变量模型。 MDN旨在采样GRU产生的潜在变量。该运行是适用于不太可靠的估算值的正规化程序。 CDNET的架构使GRU和MDN迭代地利用彼此的输出来估算缺失值，从而导致更准确，更健壮的预测。我们验证了模仿数据集上的死亡率预测任务的CDNET。我们的模型以大幅度的利润优于最先进的模型。我们还从经验上表明，正规化值是出色预测性能的关键因素。对预测不确定性的分析表明，我们的模型可以捕获核心和认知不确定性，这使模型用户可以更好地了解模型结果。

Electronic Health Records (EHRs) exhibit a high amount of missing data due to variations of patient conditions and treatment needs. Imputation of missing values has been considered an effective approach to deal with this challenge. Existing work separates imputation method and prediction model as two independent parts of an EHR-based machine learning system. We propose an integrated end-to-end approach by utilizing a Compound Density Network (CDNet) that allows the imputation method and prediction model to be tuned together within a single framework. CDNet consists of a Gated recurrent unit (GRU), a Mixture Density Network (MDN), and a Regularized Attention Network (RAN). The GRU is used as a latent variable model to model EHR data. The MDN is designed to sample latent variables generated by GRU. The RAN serves as a regularizer for less reliable imputed values. The architecture of CDNet enables GRU and MDN to iteratively leverage the output of each other to impute missing values, leading to a more accurate and robust prediction. We validate CDNet on the mortality prediction task on the MIMIC-III dataset. Our model outperforms state-of-the-art models by significant margins. We also empirically show that regularizing imputed values is a key factor for superior prediction performance. Analysis of prediction uncertainty shows that our model can capture both aleatoric and epistemic uncertainties, which offers model users a better understanding of the model results.

下载PDF全文

下载文献需遵守相关版权规定

论文标题