论文标题
DRKF:蒸馏的旋转内核融合,以在局部特征匹配中有效旋转不变描述符
DRKF: Distilled Rotated Kernel Fusion for Efficient Rotation Invariant Descriptors in Local Feature Matching
论文作者
论文摘要
在旋转较大的变化存在下,局部特征描述符的性能会降低。为了解决这个问题,我们提出了一种有效的学习旋转不变描述符的方法。具体而言,我们提出了旋转的内核融合(RKF),该融合将旋转在卷积内核上施加旋转,以提高CNN的固有性质。由于可以通过随后的重新参数来处理RKF,因此在推理阶段不会引入额外的计算成本。此外,我们提出了多面特征聚合(MOFA),该特征聚合(MOFA)聚集了从输入图像的多个旋转版本中提取的特征,并可以通过利用蒸馏策略来提供辅助知识来训练RKF。我们将蒸馏RKF型号称为DRKF。除了对公共数据集HPatches的旋转版本的评估外,我们还贡献了一个名为Diversebev的新数据集,该数据集在无人机飞行过程中收集,并包含具有较大视点变化和摄像机旋转的Bird's Eye View Image图像。广泛的实验表明,当暴露于较大的旋转变化时,我们的方法可以胜过其他最先进的技术。
The performance of local feature descriptors degrades in the presence of large rotation variations. To address this issue, we present an efficient approach to learning rotation invariant descriptors. Specifically, we propose Rotated Kernel Fusion (RKF) which imposes rotations on the convolution kernel to improve the inherent nature of CNN. Since RKF can be processed by the subsequent re-parameterization, no extra computational costs will be introduced in the inference stage. Moreover, we present Multi-oriented Feature Aggregation (MOFA) which aggregates features extracted from multiple rotated versions of the input image and can provide auxiliary knowledge for the training of RKF by leveraging the distillation strategy. We refer to the distilled RKF model as DRKF. Besides the evaluation on a rotation-augmented version of the public dataset HPatches, we also contribute a new dataset named DiverseBEV which is collected during the drone's flight and consists of bird's eye view images with large viewpoint changes and camera rotations. Extensive experiments show that our method can outperform other state-of-the-art techniques when exposed to large rotation variations.