深度学习以量化胸部射线照相中的肺水肿

论文标题

深度学习以量化胸部射线照相中的肺水肿

Deep Learning to Quantify Pulmonary Edema in Chest Radiographs

论文作者

Horng, Steven, Liao, Ruizhi, Wang, Xin, Dalal, Sandeep, Golland, Polina, Berkowitz, Seth J

论文摘要

目的：开发机器学习模型以对胸部X光片上的肺水肿的严重程度分类。材料和方法：在这项回顾性研究中，包括来自Mimic-CXR胸部X光片数据集的64,581例（平均年龄为51.71; 54.51％女性）患者的369,071张胸部X光片和相关的放射学报告。该数据集分为具有充血性心力衰竭（CHF）的患者。从CHF患者中提取了相关放射学报告中的肺水肿严重性标签，为四个不同的序数水平：0，无水肿； 1，血管充血； 2，间质水肿；和3，肺泡水肿。使用两种方法开发了深度学习模型：使用各种自动编码器的半监督模型和使用密集的神经网络进行预训练的监督学习模型。对两个模型进行了接收器操作特征曲线分析。结果：针对半监督模型的无水肿区分肺泡水肿的接收器操作特征曲线（AUC）的面积为0.99，预训练模型为0.87。该算法的性能与对肺水肿的较温和状态的难度成反比（分别为半监督模型和预训练模型的AUC）：2对0、0.88和0.81； 1对0、0.79和0.66; 3对1、0.93和0.82; 2对1、0.69和0.73; 3对2、0.88和0.63。结论：深度学习模型接受了大型胸部X光片数据集的训练，并且可以在高性能的胸部X光片上对肺水肿的严重程度进行评分。

Purpose: To develop a machine learning model to classify the severity grades of pulmonary edema on chest radiographs. Materials and Methods: In this retrospective study, 369,071 chest radiographs and associated radiology reports from 64,581 (mean age, 51.71; 54.51% women) patients from the MIMIC-CXR chest radiograph dataset were included. This dataset was split into patients with and without congestive heart failure (CHF). Pulmonary edema severity labels from the associated radiology reports were extracted from patients with CHF as four different ordinal levels: 0, no edema; 1, vascular congestion; 2, interstitial edema; and 3, alveolar edema. Deep learning models were developed using two approaches: a semi-supervised model using a variational autoencoder and a pre-trained supervised learning model using a dense neural network. Receiver operating characteristic curve analysis was performed on both models. Results: The area under the receiver operating characteristic curve (AUC) for differentiating alveolar edema from no edema was 0.99 for the semi-supervised model and 0.87 for the pre-trained models. Performance of the algorithm was inversely related to the difficulty in categorizing milder states of pulmonary edema (shown as AUCs for semi-supervised model and pre-trained model, respectively): 2 versus 0, 0.88 and 0.81; 1 versus 0, 0.79 and 0.66; 3 versus 1, 0.93 and 0.82; 2 versus 1, 0.69 and 0.73; and, 3 versus 2, 0.88 and 0.63. Conclusion: Deep learning models were trained on a large chest radiograph dataset and could grade the severity of pulmonary edema on chest radiographs with high performance.

下载PDF全文

下载文献需遵守相关版权规定

论文标题