从1亿次医学图像中的自学学习

论文标题

从1亿次医学图像中的自学学习

Self-supervised Learning from 100 Million Medical Images

论文作者

Ghesu, Florin C., Georgescu, Bogdan, Mansoor, Awais, Yoo, Youngjin, Neumann, Dominik, Patel, Pragneshkumar, Vishwanath, R. S., Balter, James M., Cao, Yue, Grbic, Sasa, Comaniciu, Dorin

论文摘要

构建准确，健壮的人工智能系统进行医学图像评估不仅需要先进的深度学习模型的研究和设计，而且还需要创建大型和精选的带注释的培训示例。但是，构建此类数据集通常是非常昂贵的 - 由于注释任务的复杂性质以及解释医学图像所需的高水平（例如，专家放射科医生）。为了应对这一限制，我们提出了一种基于对比度学习和在线特征群集的自我监督学习丰富图像特征的方法。为此，我们利用了超过100,000,000种各种方式的医学图像的大型培训数据集，包括射线照相，计算机断层扫描（CT），磁共振（MR）成像和超声检查。我们建议使用这些功能来指导在各种下游任务的监督和混合自我监督/监督制度中进行模型培训。我们强调了这种策略在X射线照相，CT和MR中挑战图像评估问题的许多优势：1）与最先进的表现相比，准确性显着提高（例如，AUC提高了3-7％的3-7％，可检测到胸部X线射线照相扫描和大脑CT上的出血性检测中的异常和出血）; 2）与不使用训练相比，在训练过程中，模型收敛的加速度高达85％（例如，在训练MR扫描中检测脑转移模型的模型时，有83％）； 3）增加对各种图像增强的鲁棒性增加，例如强度变化，旋转或缩放反映现场看到的数据变化。

Building accurate and robust artificial intelligence systems for medical image assessment requires not only the research and design of advanced deep learning models but also the creation of large and curated sets of annotated training examples. Constructing such datasets, however, is often very costly -- due to the complex nature of annotation tasks and the high level of expertise required for the interpretation of medical images (e.g., expert radiologists). To counter this limitation, we propose a method for self-supervised learning of rich image features based on contrastive learning and online feature clustering. For this purpose we leverage large training datasets of over 100,000,000 medical images of various modalities, including radiography, computed tomography (CT), magnetic resonance (MR) imaging and ultrasonography. We propose to use these features to guide model training in supervised and hybrid self-supervised/supervised regime on various downstream tasks. We highlight a number of advantages of this strategy on challenging image assessment problems in radiography, CT and MR: 1) Significant increase in accuracy compared to the state-of-the-art (e.g., AUC boost of 3-7% for detection of abnormalities from chest radiography scans and hemorrhage detection on brain CT); 2) Acceleration of model convergence during training by up to 85% compared to using no pretraining (e.g., 83% when training a model for detection of brain metastases in MR scans); 3) Increase in robustness to various image augmentations, such as intensity variations, rotations or scaling reflective of data variation seen in the field.

下载PDF全文

下载文献需遵守相关版权规定

论文标题