论文标题

标志框架用于联合学习算法的比较分析

FLAGS Framework for Comparative Analysis of Federated Learning Algorithms

论文作者

Lodhi, Ahnaf Hannan, Akgün, Barış, Özkasap, Öznur

论文摘要

联合学习(FL)已成为分布式机器学习的关键选择。 FL的最新工作最初集中于集中式聚合,强调了更大的分散化,以适应高度异质的网络边缘。其中,分层,设备对设备和八卦联合学习(分别为HFL,D2DFL \&GFL)可以将其视为采用基本聚合策略的基础FL算法。随后提出了许多FL算法,共同采用了多个基本聚合方案。然而,现有的研究将FL算法对各种条件进行了对,并对这些算法的性能进行了分析,主要是针对联邦平均(FedAvg)。这项工作巩固了FL景观,并通过针对广泛的运营条件进行全面的交叉评估,对主要FL算法进行客观分析。除了三种基础FL算法外,这项工作还分析了六种衍生算法。为了启用统一的评估,一个名为标志的多FL框架:已开发出用于快速配置多种FL算法的联合学习算法模拟。我们的实验表明,完全分散的FL算法在多个工作条件下(包括异步聚集和Stragglers的存在)实现了可比的精度。此外,分散的FL还可以在嘈杂的环境中运行,并且本地更新速率相当较高。但是,极度偏斜的数据分布对分散FL的影响比集中式变体更加不利。结果表明,可能没有必要将设备限制为单个FL算法。相反,多FL节点可能会以更高的效率运行。

Federated Learning (FL) has become a key choice for distributed machine learning. Initially focused on centralized aggregation, recent works in FL have emphasized greater decentralization to adapt to the highly heterogeneous network edge. Among these, Hierarchical, Device-to-Device and Gossip Federated Learning (HFL, D2DFL \& GFL respectively) can be considered as foundational FL algorithms employing fundamental aggregation strategies. A number of FL algorithms were subsequently proposed employing multiple fundamental aggregation schemes jointly. Existing research, however, subjects the FL algorithms to varied conditions and gauges the performance of these algorithms mainly against Federated Averaging (FedAvg) only. This work consolidates the FL landscape and offers an objective analysis of the major FL algorithms through a comprehensive cross-evaluation for a wide range of operating conditions. In addition to the three foundational FL algorithms, this work also analyzes six derived algorithms. To enable a uniform assessment, a multi-FL framework named FLAGS: Federated Learning AlGorithms Simulation has been developed for rapid configuration of multiple FL algorithms. Our experiments indicate that fully decentralized FL algorithms achieve comparable accuracy under multiple operating conditions, including asynchronous aggregation and the presence of stragglers. Furthermore, decentralized FL can also operate in noisy environments and with a comparably higher local update rate. However, the impact of extremely skewed data distributions on decentralized FL is much more adverse than on centralized variants. The results indicate that it may not be necessary to restrict the devices to a single FL algorithm; rather, multi-FL nodes may operate with greater efficiency.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源