联合自我监督学习的特征相关引导的知识转移

论文标题

联合自我监督学习的特征相关引导的知识转移

Feature Correlation-guided Knowledge Transfer for Federated Self-supervised Learning

论文作者

Liu, Yi, Guo, Song, Zhang, Jie, Zhou, Qihua, Wang, Yingchun, Zhao, Xiaohan

论文摘要

为了消除对传统联邦学习（FL）监督模型培训的全标签数据的要求，对FL的应用（SSL）方法广泛关注，以解决标签稀缺问题。以前关于联合SSL的工作通常分为两类：基于参数的模型聚合（即FedAvg，适用于同质案例）或基于数据的特征共享（即知识蒸馏，适用于异质案例），以实现多个未遗嘱未遗嘱的客户之间的知识传递。尽管取得了进展，但所有这些都不可避免地依赖于一些假设，例如均质模型或其他公共数据集的存在，这阻碍了培训框架的普遍性，以实现更一般的场景。因此，在本文中，我们提出了一种具有基于特征相关的聚合（FEDFOA）的新颖而通用的方法，称为联邦自我监督的学习，以一种沟通效率和隐私的方式来应对上述限制。我们的见解是利用功能相关性来对齐功能映射，并在其本地培训过程中校准跨客户的本地模型更新。更具体地说，我们设计了一种基于分解的方法来从局部表示中提取交叉功能关系矩阵。然后，关系矩阵可以被视为语义信息的载体，以执行聚合阶段。我们证明FedFOA是一种模型不足的训练框架，很容易与最新的无监督FL方法兼容。广泛的经验实验表明，我们提出的方法的表现优于最先进的方法。

To eliminate the requirement of fully-labeled data for supervised model training in traditional Federated Learning (FL), extensive attention has been paid to the application of Self-supervised Learning (SSL) approaches on FL to tackle the label scarcity problem. Previous works on Federated SSL generally fall into two categories: parameter-based model aggregation (i.e., FedAvg, applicable to homogeneous cases) or data-based feature sharing (i.e., knowledge distillation, applicable to heterogeneous cases) to achieve knowledge transfer among multiple unlabeled clients. Despite the progress, all of them inevitably rely on some assumptions, such as homogeneous models or the existence of an additional public dataset, which hinder the universality of the training frameworks for more general scenarios. Therefore, in this paper, we propose a novel and general method named Federated Self-supervised Learning with Feature-correlation based Aggregation (FedFoA) to tackle the above limitations in a communication-efficient and privacy-preserving manner. Our insight is to utilize feature correlation to align the feature mappings and calibrate the local model updates across clients during their local training process. More specifically, we design a factorization-based method to extract the cross-feature relation matrix from the local representations. Then, the relation matrix can be regarded as a carrier of semantic information to perform the aggregation phase. We prove that FedFoA is a model-agnostic training framework and can be easily compatible with state-of-the-art unsupervised FL methods. Extensive empirical experiments demonstrate that our proposed approach outperforms the state-of-the-art methods by a significant margin.

下载PDF全文

下载文献需遵守相关版权规定

论文标题