论文标题
量化的一个损失:与离散的Wasserstein分布匹配的深度哈希
One Loss for Quantization: Deep Hashing with Discrete Wasserstein Distributional Matching
论文作者
论文摘要
图像哈希是一种原则上的近似邻居方法,可以在大量图像集合中找到与查询相似的项目。哈希旨在学习将图像映射到二进制向量的二元输出函数。为了获得最佳的检索性能,生成具有低量化误差的平衡哈希码,以弥合学习阶段的持续放松和推理阶段离散量化之间的差距很重要。但是,在现有的深层监督方法中,很难实现编码平衡和低量化误差,并且涉及多个损失。我们认为这是因为这些方法中现有的量化方法是启发式构造的,并且无法实现这些目标。本文考虑了一种学习量化约束的替代方法。学习平衡代码具有低量化误差的任务被重新构建,因为将连续代码的学习分布与预定义的离散分布相匹配。这相当于最大程度地减少两个分布之间的距离。然后,我们通过利用哈希函数的离散属性提出了一个计算有效的分布距离。该分布距离是有效的距离,并且具有较低的时间和样本复杂性。提出的单损失量化目标可以集成到任何现有的监督哈希方法中,以改善代码平衡和量化错误。实验证实,所提出的方法基本上改善了几种代表性的哈希〜方法的性能。
Image hashing is a principled approximate nearest neighbor approach to find similar items to a query in a large collection of images. Hashing aims to learn a binary-output function that maps an image to a binary vector. For optimal retrieval performance, producing balanced hash codes with low-quantization error to bridge the gap between the learning stage's continuous relaxation and the inference stage's discrete quantization is important. However, in the existing deep supervised hashing methods, coding balance and low-quantization error are difficult to achieve and involve several losses. We argue that this is because the existing quantization approaches in these methods are heuristically constructed and not effective to achieve these objectives. This paper considers an alternative approach to learning the quantization constraints. The task of learning balanced codes with low quantization error is re-formulated as matching the learned distribution of the continuous codes to a pre-defined discrete, uniform distribution. This is equivalent to minimizing the distance between two distributions. We then propose a computationally efficient distributional distance by leveraging the discrete property of the hash functions. This distributional distance is a valid distance and enjoys lower time and sample complexities. The proposed single-loss quantization objective can be integrated into any existing supervised hashing method to improve code balance and quantization error. Experiments confirm that the proposed approach substantially improves the performance of several representative hashing~methods.