通过转移学习自动为组织图像像病理学家一样为图像评分

论文标题

通过转移学习自动为组织图像像病理学家一样为图像评分

Automatically Score Tissue Images Like a Pathologist by Transfer Learning

论文作者

Yan, Iris

论文摘要

癌症是世界上第二大死亡原因。尽早诊断癌症可以挽救许多生命。病理学家必须手动查看组织微阵列（TMA）图像，以鉴定肿瘤，这些肿瘤可能耗时，不一致且主观。现有的自动算法要么没有达到病理学家的准确性水平，要么需要大量的人类参与。一个主要的挑战是，具有不同形状，大小和位置的TMA图像可以具有相同的分数。在TMA图像中学习染色模式需要大量图像，由于医疗组织的隐私和监管问题，这些图像受到严重限制。来自不同癌症类型的TMA图像可能具有某些共同特征，但是将它们结合起来直接损害了由于其染色模式异质性而引起的准确性。转移学习是一种新兴的学习范式，可以从类似问题中借用强度。但是，现有方法通常需要从类似的学习问题中进行大量样本，而不同癌症类型的TMA图像通常以较小的样本量获得，并且进一步的现有算法仅限于从一个类似问题中转移学习。我们提出了一种新的转移学习算法，该算法可以从多个相关问题中学习，其中每个问题都有一个很小的样本，并且与原始问题的分布可能大不相同。拟议的算法使打破关键精度屏障（病理学家的75％精度水平）成为可能，据报道，来自斯坦福组织微阵列数据库的乳腺癌TMA图像的准确性为75.9％。转移学习理论和聚类技术中经验证据的最新发展得到了支持。这将使病理学家可以自信地采用自动算法在实时识别肿瘤方面持续识别肿瘤。

Cancer is the second leading cause of death in the world. Diagnosing cancer early on can save many lives. Pathologists have to look at tissue microarray (TMA) images manually to identify tumors, which can be time-consuming, inconsistent and subjective. Existing automatic algorithms either have not achieved the accuracy level of a pathologist or require substantial human involvements. A major challenge is that TMA images with different shapes, sizes, and locations can have the same score. Learning staining patterns in TMA images requires a huge number of images, which are severely limited due to privacy and regulation concerns in medical organizations. TMA images from different cancer types may share certain common characteristics, but combining them directly harms the accuracy due to heterogeneity in their staining patterns. Transfer learning is an emerging learning paradigm that allows borrowing strength from similar problems. However, existing approaches typically require a large sample from similar learning problems, while TMA images of different cancer types are often available in small sample size and further existing algorithms are limited to transfer learning from one similar problem. We propose a new transfer learning algorithm that could learn from multiple related problems, where each problem has a small sample and can have a substantially different distribution from the original one. The proposed algorithm has made it possible to break the critical accuracy barrier (the 75% accuracy level of pathologists), with a reported accuracy of 75.9% on breast cancer TMA images from the Stanford Tissue Microarray Database. It is supported by recent developments in transfer learning theory and empirical evidence in clustering technology. This will allow pathologists to confidently adopt automatic algorithms in recognizing tumors consistently with a higher accuracy in real time.

下载PDF全文

下载文献需遵守相关版权规定

论文标题