论文标题
半监督的机器学习模型,用于分析传输电子显微镜图像的纳米线形态
Semi-supervised machine learning model for analysis of nanowire morphologies from transmission electron microscopy images
论文作者
论文摘要
在材料科学领域,显微镜是第一个且通常仅用于结构表征的方法。对可以自动化显微镜图像的分析和解释的机器学习方法的开发越来越感兴趣。通常,对机器学习模型进行培训需要大量具有相关结构标签的图像,但是,手动标记图像需要域知识,并且容易受到人为错误和主观性的影响。为了克服这些局限性,我们提出了一种半监督的转移学习方法,该方法使用少量标记的显微镜图像进行训练,并像在明显更大的图像数据集中训练的方法一样有效地执行。具体来说,我们使用自我监督的学习方法训练图像编码器,并使用编码器来传输不同下游图像任务(分类和分割),并使用最少数量的标记图像进行培训来传递该编码器。我们测试了两种自我监督学习方法的转移学习能力:传输电子显微镜(TEM)图像的Simclr和Barlow-Twins。我们详细说明了该机器学习工作流程如何应用于蛋白质纳米线的TEM图像如何实现纳米线形态的自动分类(例如,单纳米线,纳米线,纳米线捆,相位分离)以及可以作为定量纳米域域尺寸和形状分析的基础的分段任务。我们还将机器学习工作流程的应用扩展到纳米颗粒形态的分类以及从TEM图像中鉴定不同类型病毒的分类。
In the field of materials science, microscopy is the first and often only accessible method for structural characterization. There is a growing interest in the development of machine learning methods that can automate the analysis and interpretation of microscopy images. Typically training of machine learning models requires large numbers of images with associated structural labels, however, manual labeling of images requires domain knowledge and is prone to human error and subjectivity. To overcome these limitations, we present a semi-supervised transfer learning approach that uses a small number of labeled microscopy images for training and performs as effectively as methods trained on significantly larger image datasets. Specifically, we train an image encoder with unlabeled images using self-supervised learning methods and use that encoder for transfer learning of different downstream image tasks (classification and segmentation) with a minimal number of labeled images for training. We test the transfer learning ability of two self-supervised learning methods: SimCLR and Barlow-Twins on transmission electron microscopy (TEM) images. We demonstrate in detail how this machine learning workflow applied to TEM images of protein nanowires enables automated classification of nanowire morphologies (e.g., single nanowires, nanowire bundles, phase separated) as well as segmentation tasks that can serve as groundwork for quantification of nanowire domain sizes and shape analysis. We also extend the application of the machine learning workflow to classification of nanoparticle morphologies and identification of different type of viruses from TEM images.