自我监督的调整以进行几次分段

论文标题

自我监督的调整以进行几次分段

Self-Supervised Tuning for Few-Shot Segmentation

论文作者

Zhu, Kai, Zhai, Wei, Zha, Zheng-Jun, Cao, Yang

论文摘要

很少的射击分段目的是将类别标签分配给每个图像像素，并具有很少的带注释的样本。这是一项具有挑战性的任务，因为只有在稀疏注释定义的潜在特征的指导下才能实现密集的预测。当从支持图像中提取的视觉特征被边缘化在嵌入空间中时，现有的元学习方法倾向于生成特定判别描述符。为了解决这个问题，本文提出了一个自适应调整框架，其中根据自我分割方案动态调整潜在特征在不同情节中的分布，增强了标签预测的特定于类别的描述符。具体而言，首先将新颖的自我监督的内环被设计为基础学习者，以从支持图像中提取基本的语义特征。然后，通过通过获得的特征反向传播自我监督的损失来计算梯度图，并利用作为增强嵌入空间中相应元素的指导。最后，通过从不同情节中不断学习的能力，基于优化的元学习者被采用为我们提出的框架的外循环，以逐步完善分割结果。基准Pascal- $ 5^{i} $和可可$ 20^{i} $数据集的广泛实验证明了我们所提出的方法比最先进的优越性。

Few-shot segmentation aims at assigning a category label to each image pixel with few annotated samples. It is a challenging task since the dense prediction can only be achieved under the guidance of latent features defined by sparse annotations. Existing meta-learning method tends to fail in generating category-specifically discriminative descriptor when the visual features extracted from support images are marginalized in embedding space. To address this issue, this paper presents an adaptive tuning framework, in which the distribution of latent features across different episodes is dynamically adjusted based on a self-segmentation scheme, augmenting category-specific descriptors for label prediction. Specifically, a novel self-supervised inner-loop is firstly devised as the base learner to extract the underlying semantic features from the support image. Then, gradient maps are calculated by back-propagating self-supervised loss through the obtained features, and leveraged as guidance for augmenting the corresponding elements in embedding space. Finally, with the ability to continuously learn from different episodes, an optimization-based meta-learner is adopted as outer loop of our proposed framework to gradually refine the segmentation results. Extensive experiments on benchmark PASCAL-$5^{i}$ and COCO-$20^{i}$ datasets demonstrate the superiority of our proposed method over state-of-the-art.

下载PDF全文

下载文献需遵守相关版权规定

论文标题