无监督的地标从未配对的数据中学习

论文标题

无监督的地标从未配对的数据中学习

Unsupervised Landmark Learning from Unpaired Data

论文作者

Xu, Yinghao, Yang, Ceyuan, Liu, Ziwei, Dai, Bo, Zhou, Bolei

论文摘要

最近无监督的地标学习的尝试利用了外观相似但姿势不同的合成图像对。这些方法通过鼓励原始图像与从交换外观和姿势重建的图像之间的一致性来学习地标。虽然合成图像对是通过应用预定义的转换创建的，但它们无法完全反映外观和姿势中的真实差异。在本文中，我们旨在为从自然图像收集中采样的不配对数据（即未对齐的图像对）学习地标的可能性，以便它们在外观和姿势上都可以不同。为此，我们提出了一个跨图像周期一致性框架（$ c^3 $），该框架两次应用交换重建策略以获得最终的监督。此外，进一步引入了一个跨图像流量模块，以在图像跨图像之间施加估计的地标之间。通过全面的实验，我们提出的框架被证明超过了强大的基线。除了定量结果外，我们还对我们学到的模型提供了可视化和解释，这不仅验证了学习地标的有效性，而且还会导致重要的见解，这些见解对未来的研究有益。

Recent attempts for unsupervised landmark learning leverage synthesized image pairs that are similar in appearance but different in poses. These methods learn landmarks by encouraging the consistency between the original images and the images reconstructed from swapped appearances and poses. While synthesized image pairs are created by applying pre-defined transformations, they can not fully reflect the real variances in both appearances and poses. In this paper, we aim to open the possibility of learning landmarks on unpaired data (i.e. unaligned image pairs) sampled from a natural image collection, so that they can be different in both appearances and poses. To this end, we propose a cross-image cycle consistency framework ($C^3$) which applies the swapping-reconstruction strategy twice to obtain the final supervision. Moreover, a cross-image flow module is further introduced to impose the equivariance between estimated landmarks across images. Through comprehensive experiments, our proposed framework is shown to outperform strong baselines by a large margin. Besides quantitative results, we also provide visualization and interpretation on our learned models, which not only verifies the effectiveness of the learned landmarks, but also leads to important insights that are beneficial for future research.

下载PDF全文

下载文献需遵守相关版权规定

论文标题