实例语义细分受益于生成对抗网络

论文标题

实例语义细分受益于生成对抗网络

Instance Semantic Segmentation Benefits from Generative Adversarial Networks

论文作者

Le, Quang H., Youcef-Toumi, Kamal, Tsetserukou, Dzmitry, Jahanian, Ali

论文摘要

在重建掩码的实例分割网络的设计中，分割通常被视为其字面定义 - 分配每个像素一个标签。这导致将问题视为一个模板，其目标是最大程度地减少重建和地面真相像素之间的损失。将重建网络重新考虑为生成器，我们定义了将口罩作为gans游戏框架的问题：细分网络生成掩码，歧视者网络决定掩模的质量。为了演示这款游戏，我们在Mask R-CNN中展示了对一般细分框架的有效修改。我们发现，在特征空间中玩游戏比导致歧视器和发电机之间稳定训练的像素空间更有效，预测对象坐标应通过预测对象的上下文区域来代替对象坐标，总体而言，对抗性损失有助于性能并消除每个不同数据域的任何自定义设置的需求。我们在各个领域测试我们的框架，并报告手机回收，自动驾驶，大规模对象检测和医疗腺体。我们观察到甘恩产生的面具，这些口罩是较脆的边界，杂物，小物体和细节，处于常规形状或异质和融合形状的领域。我们用于复制结果的代码可公开获得。

In design of instance segmentation networks that reconstruct masks, segmentation is often taken as its literal definition -- assigning each pixel a label. This has led to thinking the problem as a template matching one with the goal of minimizing the loss between the reconstructed and the ground truth pixels. Rethinking reconstruction networks as a generator, we define the problem of predicting masks as a GANs game framework: A segmentation network generates the masks, and a discriminator network decides on the quality of the masks. To demonstrate this game, we show effective modifications on the general segmentation framework in Mask R-CNN. We find that playing the game in feature space is more effective than the pixel space leading to stable training between the discriminator and the generator, predicting object coordinates should be replaced by predicting contextual regions for objects, and overall the adversarial loss helps the performance and removes the need for any custom settings per different data domain. We test our framework in various domains and report on cellphone recycling, autonomous driving, large-scale object detection, and medical glands. We observe in general GANs yield masks that account for crispier boundaries, clutter, small objects, and details, being in domain of regular shapes or heterogeneous and coalescing shapes. Our code for reproducing the results is available publicly.

下载PDF全文

下载文献需遵守相关版权规定

论文标题