论文标题
隐私权合成智能电表数据
Privacy-Preserving Synthetic Smart Meters Data
论文作者
论文摘要
功耗数据非常有用,因为它允许优化电网,检测异常和防止故障,除了对多样化的研究目的有用。但是,使用功耗数据引起了重大隐私问题,因为此数据通常属于电力公司的客户。作为解决方案,我们提出了一种生成忠实模仿原件但与客户及其身份的合成功耗样本的方法。我们的方法基于生成对抗网络(GAN)。我们的贡献是双重的。首先,我们专注于生成数据的质量,这不是一项琐碎的任务,因为没有标准评估方法可用。然后,我们研究了提供给我们神经网络培训集的成员提供的隐私保证。作为对隐私的最低要求,我们要求我们的神经网络对会员推理攻击具有牢固的态度,因为除了自行呈现隐私威胁外,它们还为进一步攻击提供了门户。我们发现,在算法提供的隐私和性能之间存在妥协。
Power consumption data is very useful as it allows to optimize power grids, detect anomalies and prevent failures, on top of being useful for diverse research purposes. However, the use of power consumption data raises significant privacy concerns, as this data usually belongs to clients of a power company. As a solution, we propose a method to generate synthetic power consumption samples that faithfully imitate the originals, but are detached from the clients and their identities. Our method is based on Generative Adversarial Networks (GANs). Our contribution is twofold. First, we focus on the quality of the generated data, which is not a trivial task as no standard evaluation methods are available. Then, we study the privacy guarantees provided to members of the training set of our neural network. As a minimum requirement for privacy, we demand our neural network to be robust to membership inference attacks, as these provide a gateway for further attacks in addition to presenting a privacy threat on their own. We find that there is a compromise to be made between the privacy and the performance provided by the algorithm.