论文标题
Fedeval:联合学习的整体评估框架
FedEval: A Holistic Evaluation Framework for Federated Learning
论文作者
论文摘要
联合学习(FL)已被广泛接受为无需收集原始数据的保护隐私机器学习的解决方案。尽管过去几年中提出的新技术确实发展了FL领域,但不幸的是,这些作品中提出的评估结果的完整性不足,并且由于不一致的评估指标和实验环境而几乎不可比拟。在本文中,我们提出了一个名为Fedeval的全面评估框架,并对七种最先进的FL算法进行了基准研究。具体而言,我们首先介绍了称为Fedeval-Core的核心评估分类学模型,该模型涵盖了FL的四个基本评估方面:隐私,鲁棒性,有效性和效率,以及各种定义明确的指标和实验环境。基于Fedeval-Core,我们进一步开发了一个具有标准化评估设置和易于使用的接口的FL评估平台。然后,我们在包括FedSGD,FedSGD,FedAvg,FedProx,FedOpt,FedOpt,FedStc,Secagg和Heagg在内的七种著名的FL算法之间提供了深入的基准测试研究。我们全面分析了这些算法的优势和缺点,并进一步确定了针对不同算法的合适的实践方案,这很少是通过先前的工作来完成的。最后,我们挖掘了一系列收藏见解和未来的研究方向,这对FL地区的研究人员非常有帮助。
Federated Learning (FL) has been widely accepted as the solution for privacy-preserving machine learning without collecting raw data. While new technologies proposed in the past few years do evolve the FL area, unfortunately, the evaluation results presented in these works fall short in integrity and are hardly comparable because of the inconsistent evaluation metrics and experimental settings. In this paper, we propose a holistic evaluation framework for FL called FedEval, and present a benchmarking study on seven state-of-the-art FL algorithms. Specifically, we first introduce the core evaluation taxonomy model, called FedEval-Core, which covers four essential evaluation aspects for FL: Privacy, Robustness, Effectiveness, and Efficiency, with various well-defined metrics and experimental settings. Based on the FedEval-Core, we further develop an FL evaluation platform with standardized evaluation settings and easy-to-use interfaces. We then provide an in-depth benchmarking study between the seven well-known FL algorithms, including FedSGD, FedAvg, FedProx, FedOpt, FedSTC, SecAgg, and HEAgg. We comprehensively analyze the advantages and disadvantages of these algorithms and further identify the suitable practical scenarios for different algorithms, which is rarely done by prior work. Lastly, we excavate a set of take-away insights and future research directions, which are very helpful for researchers in the FL area.