匹配它们：视觉上可解释的几片图像分类

论文标题

匹配它们：视觉上可解释的几片图像分类

Match Them Up: Visually Explainable Few-shot Image Classification

论文作者

Wang, Bowen, Li, Liangzhi, Verma, Manisha, Nakashima, Yuta, Kawasaki, Ryo, Nagahara, Hajime

论文摘要

很少有学习方法（FSL）方法通常是基于一个假设，即可以从基础（SEED）类别获得预训练的知识，并且可以很好地转移到新颖（看不见的）类别中。但是，没有保证，尤其是对于后一部分。在大多数FSL方法中，此问题导致推理过程的未知性质，这阻碍了其在某些风险敏感领域的应用。在本文中，我们使用骨干模型和新出现的可解释分类器产生的视觉表示形式揭示了一种用于图像分类的FSL的新方法。加权表示仅包括最少数量的可区分特征，可视化的权重可以作为FSL过程的信息提示。最后，一个歧视器将比较支持集和查询集中每对图像的表示。分数最高的对将决定分类结果。实验结果证明，所提出的方法可以在三个主流数据集上实现良好的准确性和令人满意的解释性。

Few-shot learning (FSL) approaches are usually based on an assumption that the pre-trained knowledge can be obtained from base (seen) categories and can be well transferred to novel (unseen) categories. However, there is no guarantee, especially for the latter part. This issue leads to the unknown nature of the inference process in most FSL methods, which hampers its application in some risk-sensitive areas. In this paper, we reveal a new way to perform FSL for image classification, using visual representations from the backbone model and weights generated by a newly-emerged explainable classifier. The weighted representations only include a minimum number of distinguishable features and the visualized weights can serve as an informative hint for the FSL process. Finally, a discriminator will compare the representations of each pair of the images in the support set and the query set. Pairs with the highest scores will decide the classification results. Experimental results prove that the proposed method can achieve both good accuracy and satisfactory explainability on three mainstream datasets.

下载PDF全文

下载文献需遵守相关版权规定

论文标题