介入内窥镜镜面及其对图像对应的影响的时间学习方法

论文标题

介入内窥镜镜面及其对图像对应的影响的时间学习方法

A Temporal Learning Approach to Inpainting Endoscopic Specularities and Its effect on Image Correspondence

论文作者

Daher, Rema, Vasconcelos, Francisco, Stoyanov, Danail

论文摘要

视频流可用于在广泛的过程中指导微创手术和诊断程序，并且已经开发了许多计算机辅助技术来自动分析它们。这些方法可以为外科医生提供其他信息，例如病变检测，仪器导航或解剖3D形状建模。但是，由于存在不规则的光模式，例如镜面突出显示的反射，并不总是能够可靠地检测到这些模式的必要图像特征。在本文中，我们旨在使用机器学习从内窥镜视频中删除镜面亮点。我们建议使用时间生成对抗网络（GAN）在镜面下对隐藏的解剖结构进行涂漆，从而在空间上和相邻框架中推断出它们在同一位置不存在的框架。这是使用胃内窥镜检查（Hyper-Kvasir）的体内数据以一种完全无监督的方式来实现的，该数据依赖于自动检测镜面亮点。系统评估通过直接比较以及其他机器学习技术通过消融研究来显示出显着改善，该研究描述了网络时间和转移学习组件的重要性。我们的系统对不同手术设置和程序的普遍性也对胃内窥镜检查和前体内猪数据（Serv-CT，Scred，Scared）进行定性评估。我们还评估了基于3D重建和摄像机运动估计的计算机视觉任务的效果，即立体声差异，光流和稀疏点功能匹配。这些在定量和定性上进行了定量和定性评估，结果在新的综合分析中显示了对这些任务的镜面介绍的积极作用。

Video streams are utilised to guide minimally-invasive surgery and diagnostic procedures in a wide range of procedures, and many computer assisted techniques have been developed to automatically analyse them. These approaches can provide additional information to the surgeon such as lesion detection, instrument navigation, or anatomy 3D shape modeling. However, the necessary image features to recognise these patterns are not always reliably detected due to the presence of irregular light patterns such as specular highlight reflections. In this paper, we aim at removing specular highlights from endoscopic videos using machine learning. We propose using a temporal generative adversarial network (GAN) to inpaint the hidden anatomy under specularities, inferring its appearance spatially and from neighbouring frames where they are not present in the same location. This is achieved using in-vivo data of gastric endoscopy (Hyper-Kvasir) in a fully unsupervised manner that relies on automatic detection of specular highlights. System evaluations show significant improvements to traditional methods through direct comparison as well as other machine learning techniques through an ablation study that depicts the importance of the network's temporal and transfer learning components. The generalizability of our system to different surgical setups and procedures was also evaluated qualitatively on in-vivo data of gastric endoscopy and ex-vivo porcine data (SERV-CT, SCARED). We also assess the effect of our method in computer vision tasks that underpin 3D reconstruction and camera motion estimation, namely stereo disparity, optical flow, and sparse point feature matching. These are evaluated quantitatively and qualitatively and results show a positive effect of specular highlight inpainting on these tasks in a novel comprehensive analysis.

下载PDF全文

下载文献需遵守相关版权规定

论文标题