3D Frustum超声波中导管分割的弱监督学习

论文标题

3D Frustum超声波中导管分割的弱监督学习

Weakly-supervised Learning For Catheter Segmentation in 3D Frustum Ultrasound

论文作者

Yang, Hongxu, Shan, Caifeng, Kolen, Alexander F., de With, Peter H. N.

论文摘要

3D超声（US）中的准确有效的导管分割对于心脏干预至关重要。当前，最新的分割算法基于卷积神经网络（CNN），在标准的笛卡尔体积数据中实现了出色的性能。然而，这些方法遇到了低效率和GPU不友好图像大小的挑战。因此，这种困难和昂贵的硬件要求成为一种瓶颈，以建立用于实际临床应用的准确有效的分割模型。在本文中，我们提出了一种新型的基于Flustum超声的导管分割方法。具体而言，Frustum Ultrasound是一个基于极坐标的图像，它包含标准笛卡尔图像的相同信息，但尺寸要小得多，它比传统的笛卡尔图像要克服效率的瓶颈。然而，不规则和变形的粉丝图像为准确的体素水平注释提供了更多的努力。为了解决这一限制，提出了一个弱监督的学习框架，只需要覆盖利益区域的3D边界框注释来培训CNN。尽管边界框注释包括噪声和不准确的注释对模型的误导，但它由提议的伪标签生成的方案解决。训练体素的标签是通过将类激活图与线过滤结合结合在一起来生成的，这在训练过程中是迭代更新的。我们的实验结果表明，所提出的方法以每卷0.25秒的效率达到了最先进的性能。更重要的是，Frustum图像分割为3D US图像中的分割提供了更快，更便宜的解决方案，可满足临床应用的需求。

Accurate and efficient catheter segmentation in 3D ultrasound (US) is essential for cardiac intervention. Currently, the state-of-the-art segmentation algorithms are based on convolutional neural networks (CNNs), which achieved remarkable performances in a standard Cartesian volumetric data. Nevertheless, these approaches suffer the challenges of low efficiency and GPU unfriendly image size. Therefore, such difficulties and expensive hardware requirements become a bottleneck to build accurate and efficient segmentation models for real clinical application. In this paper, we propose a novel Frustum ultrasound based catheter segmentation method. Specifically, Frustum ultrasound is a polar coordinate based image, which includes same information of standard Cartesian image but has much smaller size, which overcomes the bottleneck of efficiency than conventional Cartesian images. Nevertheless, the irregular and deformed Frustum images lead to more efforts for accurate voxel-level annotation. To address this limitation, a weakly supervised learning framework is proposed, which only needs 3D bounding box annotations overlaying the region-of-interest to training the CNNs. Although the bounding box annotation includes noise and inaccurate annotation to mislead to model, it is addressed by the proposed pseudo label generated scheme. The labels of training voxels are generated by incorporating class activation maps with line filtering, which is iteratively updated during the training. Our experimental results show the proposed method achieved the state-of-the-art performance with an efficiency of 0.25 second per volume. More crucially, the Frustum image segmentation provides a much faster and cheaper solution for segmentation in 3D US image, which meet the demands of clinical applications.

下载PDF全文

下载文献需遵守相关版权规定

论文标题