异步：与基于欧几里得距离的自适应体重聚集的异步联合学习

论文标题

异步：与基于欧几里得距离的自适应体重聚集的异步联合学习

AsyncFedED: Asynchronous Federated Learning with Euclidean Distance based Adaptive Weight Aggregation

论文作者

Wang, Qiyuan, Yang, Qianqian, He, Shibo, Shi, Zhiguo, Chen, Jiming

论文摘要

在异步联合学习框架中，服务器一旦从客户端收到更新，而不是等待所有更新到达同步设置，则该服务器将更新全局模型。这允许具有多种计算能力的异质设备可以训练本地模型而无需暂停，从而加快了训练过程。但是，它引入了陈旧的模型问题，其中新到达更新是根据一组比当前的全局模型更古老的陈旧权重计算得出的，这些陈旧的重量可能会损害模型的收敛性。在本文中，我们提出了一个异步联合学习框架，该框架具有建议的自适应重量聚集算法，称为异步。据我们所知，这种聚合方法是第一个将到达梯度的彻底稳定性的方法，该方法是通过陈旧模型和当前全局模型之间的欧几里得距离以及已执行的本地时代的数量来衡量的。假设一般的非凸损失函数，我们从理论上证明了所提出的方法的收敛性。数值结果验证了与三个被考虑任务的现有方法相比，根据收敛速率和模型准确性所提出的异步的有效性。

In an asynchronous federated learning framework, the server updates the global model once it receives an update from a client instead of waiting for all the updates to arrive as in the synchronous setting. This allows heterogeneous devices with varied computing power to train the local models without pausing, thereby speeding up the training process. However, it introduces the stale model problem, where the newly arrived update was calculated based on a set of stale weights that are older than the current global model, which may hurt the convergence of the model. In this paper, we present an asynchronous federated learning framework with a proposed adaptive weight aggregation algorithm, referred to as AsyncFedED. To the best of our knowledge this aggregation method is the first to take the staleness of the arrived gradients, measured by the Euclidean distance between the stale model and the current global model, and the number of local epochs that have been performed, into account. Assuming general non-convex loss functions, we prove the convergence of the proposed method theoretically. Numerical results validate the effectiveness of the proposed AsyncFedED in terms of the convergence rate and model accuracy compared to the existing methods for three considered tasks.

下载PDF全文

下载文献需遵守相关版权规定

论文标题