关于使用未编码的组键的信息理论安全聚合

论文标题

关于使用未编码的组键的信息理论安全聚合

On the Information Theoretic Secure Aggregation with Uncoded Groupwise Keys

论文作者

Wan, Kai, Yao, Xin, Sun, Hua, Ji, Mingyue, Caire, Giuseppe

论文摘要

安全汇总是联合学习的核心组成部分，是从中央服务器的分布式用户汇总的本地训练模型。此类汇总的``安全''性质包括以下事实：除了聚合的本地模型外，不得将有关本地用户数据的信息泄漏到服务器。为了保证安全性，可以在用户之间共享一些密钥。在密钥共享阶段之后，每个用户都会掩盖其训练的模型，然后将其发送到服务器。本文遵循Zhao and Sun最初提出的信息理论安全聚合问题，目的是表征模型聚合阶段K用户的最低通信成本。由于用户辍学，服务器可能无法从用户接收所有消息。安全的聚合方案应容忍最多$ K-U $用户的辍学。最佳通信成本的特征是Zhao和Sun，但假设用户存储的键可能是具有任意依赖性的任何随机变量。关于未编码的群键更方便的动机，可以共享，并且可以在联合学习以外的大量应用中使用，在本文中，我们假设密钥变量是相互独立的，并且每个键都由一组S用户共享。据我们所知，所有现有的安全聚合计划都为用户分配了编码密钥。我们表明，如果$ s> k-u $，具有未编码的GroupWise键的新的安全聚合方案可以实现与使用编码键的最佳方案相同的最佳通信成本；如果$ s \ leq k-u $，则无编码的GroupWise密钥共享严格是最佳的。最后，我们还将我们提出的安全汇总方案实施到亚马逊EC2中，然后将其与现有的安全汇总方案和离线密钥共享进行比较。

Secure aggregation, which is a core component of federated learning, aggregates locally trained models from distributed users at a central server. The ``secure'' nature of such aggregation consists of the fact that no information about the local users' data must be leaked to the server except the aggregated local models. In order to guarantee security, some keys may be shared among the users. After the key sharing phase, each user masks its trained model which is then sent to the server. This paper follows the information theoretic secure aggregation problem originally formulated by Zhao and Sun, with the objective to characterize the minimum communication cost from the K users in the model aggregation phase. Due to user dropouts, the server may not receive all messages from the users. A secure aggregation scheme should tolerate the dropouts of at most $K-U$ users. The optimal communication cost is characterized by Zhao and Sun, but with the assumption that the keys stored by the users could be any random variables with arbitrary dependency. On the motivation that uncoded groupwise keys are more convenient to be shared and could be used in large range of applications besides federated learning, in this paper we assume the key variables are mutually independent and each key is shared by a group of S users. To the best of our knowledge, all existing secure aggregation schemes assign coded keys to the users. We show that if $S> K-U$, a new secure aggregation scheme with uncoded groupwise keys can achieve the same optimal communication cost as the best scheme with coded keys; if $S \leq K-U$, uncoded groupwise key sharing is strictly sub-optimal. Finally, we also implement our proposed secure aggregation scheme into Amazon EC2, which are then compared with the existing secure aggregation schemes with offline key sharing.

下载PDF全文

下载文献需遵守相关版权规定

论文标题