北京邮电大学学报

  • EI核心期刊

北京邮电大学学报

• •    

基于自适应梯度裁剪的差分隐私保护算法

周亚建1,由永桥2,王宇2,孙良友3,顾正龙4   

  1. 1. 北京邮电大学网络空间安全学院
    2. 烟台市大数据中心
    3. 烟台海颐软件股份有限公司
    4. 北京邮电大学灾备技术国家工程研究中心
  • 收稿日期:2024-05-07 修回日期:2024-09-09 发布日期:2024-11-22
  • 通讯作者: 周亚建

A Differential Privacy Protection Algorithm based on Adaptive Gradient Clipping Of GAN

  • Received:2024-05-07 Revised:2024-09-09 Published:2024-11-22
  • Contact: Yajian ZHOU

摘要: 本文提出一种基于GAN的自适应梯度裁剪差分隐私保护算法DPGAN-AGC(Differential Privacy GAN based on Adaptive Gradient Clipping),能够在保证数据可用性的同时实现隐私保护。该算法以DPSGD(Differential Privacy Stochastic Gradient Descent)模型为基础,在GAN的博弈学习过程中通过差分隐私机制向梯度注入高斯噪声,并根据当前的梯度值及公开训练样本集,迭代更新梯度裁剪阈值,从而灵活地控制注入噪声的规模,提升生成数据的可用性。同时,DPGAN-AGC算法利用时刻统计法实现对隐私损失的自动跟踪计算。DPGAN-AGC算法不仅实现了模型效用和安全性的均衡,而且解决了噪声设计对模型效用影响较大等问题。实验结果表明,相对DPSGD模型,DPGAN-AGC生成的图像质量更好,表现为更高的Inception Score;生成的数据集可用性更高,以之训练所得CNN分类器的准确率可提升2%-8%;抗成员推理攻击(Membership Inference Attacks,MIAs)能力更强,MIA成功率更接近50%。

关键词: 隐私保护, 生成对抗网络, 差分隐私, 梯度裁剪

Abstract: A differential privacy protection algorithm, DPGAN-AGC (Differential Privacy GAN based on Adaptive Gradient Clipping), is proposed on the basis of DPSGD (Differential Privacy Stochastic Gradient Descent) model, in order to better balance privacy preserving and data availability. When Gaussian noise is injected into the gradient of GAN through the differential privacy mechanism during its game-theoretic learning process, an adaptive strategy of updating gradient threshold is adopted, which can iteratively optimize the threshold of gradient clipping based on the public sample set as well as current gradient value. As a result, it is advantageous to control the scale of injected noise flexibly and improve the availability of generated data evidently. Meanwhile, the Moments Accountant is introduced to automatically track loss of the privacy. The DPGAN-AGC algorithm can not only balance utility and security properties of generative models, but also provide a solution to existing problems including inappropriate impact of noise designing, etc. Experimental results show that the proposed DPGAN-AGC outperforms DPSGD in several behaviors including better image quality in terms of higher Inception Score, higher availability of generated data demonstrated by a 2%-8% improvement in classification accuracy, and stronger resistance to Membership Inference Attacks (MIAs) measured by closer success rate to 50%.

Key words: privacy preserving, Generative Adversarial Networks, Differential privacy, Gradient Clipping

中图分类号: