Covsegnet：用于改进Covid-19胸部CT扫描病变分割的多码编码器结构

论文标题

Covsegnet：用于改进Covid-19胸部CT扫描病变分割的多码编码器结构

CovSegNet: A Multi Encoder-Decoder Architecture for Improved Lesion Segmentation of COVID-19 Chest CT Scans

论文作者

Mahmud, Tanvir, Rahman, Md Awsafur, Fattah, Shaikh Anowarul, Kung, Sun-Yuan

论文摘要

胸部CT扫描的自动肺部病变分割被认为是旨在准确诊断和严重程度测量的关键阶段。传统的U形编码器架构及其变体在汇总/UPS采样操作中的上下文信息减少，而编码和解码的特征图之间的语义差异增加，并引起了消失的梯度梯度问题的消失梯度问题，从而导致了子次数绩效。此外，由于计算复杂性的指数呈增加，因此使用3D CT-ALLIME运行会构成进一步的局限性。在本文中，提出了一种使用高效的神经网络体系结构（即Covsegnet）来克服这些限制，提出了一种自动化的Covid-19病变细分方案。此外，还引入了两阶段训练方案，其中采用了更深层次的2D网络来产生Roi增强的CT-CT-VOLUME，然后使用较浅的3D NetWork，以进一步增强，并在不增加计算负担的情况下使用更多的上下文信息。除了传统的UNET垂直扩展外，我们还引入了水平扩展，并使用多阶段编码器模块来实现最佳性能。此外，将多尺度特征图集成到量表过渡过程中，以克服上下文信息的丢失。此外，引入了多尺度融合模块，采用金字塔融合方案引入，以减少随后的编码器/解码器模块之间的语义差距，同时促进并行优化以进行有效的梯度传播。在三个公开可获得的数据集中，已经取得了出色的表现，这些数据集在很大程度上要超过其他最先进的方法。可以轻松扩展所提出的方案，以实现各种应用中的最佳分割性能。

Automatic lung lesions segmentation of chest CT scans is considered a pivotal stage towards accurate diagnosis and severity measurement of COVID-19. Traditional U-shaped encoder-decoder architecture and its variants suffer from diminutions of contextual information in pooling/upsampling operations with increased semantic gaps among encoded and decoded feature maps as well as instigate vanishing gradient problems for its sequential gradient propagation that result in sub-optimal performance. Moreover, operating with 3D CT-volume poses further limitations due to the exponential increase of computational complexity making the optimization difficult. In this paper, an automated COVID-19 lesion segmentation scheme is proposed utilizing a highly efficient neural network architecture, namely CovSegNet, to overcome these limitations. Additionally, a two-phase training scheme is introduced where a deeper 2D-network is employed for generating ROI-enhanced CT-volume followed by a shallower 3D-network for further enhancement with more contextual information without increasing computational burden. Along with the traditional vertical expansion of Unet, we have introduced horizontal expansion with multi-stage encoder-decoder modules for achieving optimum performance. Additionally, multi-scale feature maps are integrated into the scale transition process to overcome the loss of contextual information. Moreover, a multi-scale fusion module is introduced with a pyramid fusion scheme to reduce the semantic gaps between subsequent encoder/decoder modules while facilitating the parallel optimization for efficient gradient propagation. Outstanding performances have been achieved in three publicly available datasets that largely outperform other state-of-the-art approaches. The proposed scheme can be easily extended for achieving optimum segmentation performances in a wide variety of applications.

下载PDF全文

下载文献需遵守相关版权规定

论文标题