基于小批量随机梯度下降法的SVM训练隐私保护方案

信息安全研究 ›› 2024, Vol. 10 ›› Issue (10): 967-.

基于小批量随机梯度下降法的SVM训练隐私保护方案

王杰昌1刘玉岭2张平3,4刘牧华3赵新辉1

1(郑州大学体育学院体育大数据中心郑州450044)
2(中国科学院信息工程研究所北京100085)
3(河南科技大学数学与统计学院河南洛阳471023)
4(龙门实验室智能系统科创中心河南洛阳471023)

出版日期:2024-10-15 发布日期:2024-10-26
通讯作者: 刘玉岭博士，正高级工程师，博士生导师.主要研究方向为网络安全态势感知、网安大数据分析. liuyuling@iie.ac.cn
作者简介:王杰昌硕士，讲师.主要研究方向为密码学、机器学习隐私保护、区块链安全. wangjiechang@126.com 刘玉岭博士，正高级工程师，博士生导师.主要研究方向为网络安全态势感知、网安大数据分析. liuyuling@iie.ac.cn 张平博士，教授.主要研究方向为密码学、信息安全. zhangping76@126.com 刘牧华博士，副教授.主要研究方向为密码学、信息安全. lxk0379@126.com 赵新辉博士，副教授.主要研究方向为信息安全. xinhui_zhao@126.com

Privacypreserving Scheme for SVM Training Based on Minibatch SGD

Wang Jiechang1, Liu Yuling2, Zhang Ping3,4, Liu Muhua3, and Zhao Xinhui1

1(Sports Big Data Center, Physical Education College of Zhengzhou University, Zhengzhou 450044)
2(Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100085)
3(School of Mathematics and Statistics, Henan University of Science and Technology, Luoyang, Henan 471023)
4(Intelligent System Science and Technology Innovation Center, Longmen Laboratory, Luoyang, Henan 471023)

Online:2024-10-15 Published:2024-10-26

摘要/Abstract

摘要： 使用支持向量机(support vector machine, SVM)处理敏感数据时，隐私保护很重要，已有SVM隐私保护方案基于批量梯度下降法(batch gradient descent, BGD)进行训练，计算开销巨大.针对该问题，提出基于小批量随机梯度下降法(minibatch stochastic gradient descent, Minibatch SGD)的SVM隐私保护方案.首先，设计基于Minibatch SGD的SVM训练算法；然后在此基础上，对模型权重进行乘法扰动，利用大整数分解问题困难假设确保模型的隐私性，使用同态密码体制对数据加密后再执行SVM训练，之后运用同态哈希函数进行验证；最终构建了SVM隐私保护方案.针对安全威胁，论证了数据隐私性、模型隐私性、模型正确性.对方案进行仿真实验和分析，结果表明，该方案在分类性能接近已有方案的情况下，其计算时间开销平均节约了92.4%.

关键词: 小批量随机梯度下降法, 支持向量机, 同态加密, 同态哈希函数, 隐私保护

Abstract: When using a support vector machine (SVM) to process sensitive data, privacy protection is very important. The existing SVM privacypreserving schemes are trained based on batch gradient descent (BGD) algorithm, and they have huge computational overhead. To solve this problem, this paper proposed a privacypreserving scheme for SVM training based on minibatch stochastic gradient descent (Minibatch SGD). Firstly, it designed the SVM training algorithm based on Minibatch SGD. Then, on this basis, it perturbed the model weights by multiplication, used the hardness assumption of integer factorization to ensure the privacy of the model, engaged the homomorphic cryptosystem to encrypt the data, performed SVM training, and then applied the homomorphic hash function for verification. Finally, it constructed the SVM privacypreserving scheme. Against security threats, the paper demonstrated data privacy, model privacy, and model correctness. It carried out simulation experiments and analysis of the scheme. The results show that the proposed scheme can save 92.4% of the computation time on average, while the classification performance is close to the existing schemes.

Key words: Minibatch SGD, SVM, homomorphic encryption, homomorphic hash function, privacypreserving

中图分类号:

TP309.2

王杰昌, 刘玉岭, 张平, 刘牧华, 赵新辉, . 基于小批量随机梯度下降法的SVM训练隐私保护方案[J]. 信息安全研究, 2024, 10(10): 967-.

[1]	王秋杨, 郭卫红, . 跨国数据共享与隐私保护的挑战研究[J]. 信息安全研究, 2024, 10(E1): 54-.
[2]	韩刚, 马炜燃, 张应辉, 刘伟, 盛丽玲, . 物联网感知环境中抗投毒可验证安全联邦学习方案[J]. 信息安全研究, 2024, 10(9): 804-.
[3]	李晓东, 赵炽野, 周苏雅, 李慧, 金鑫, . 基于全同态加密的高效密文数据库系统方案[J]. 信息安全研究, 2024, 10(9): 811-.
[4]	关业礼, 罗森林, 潘丽敏, 张笈, 于经纬, . 强化语义一致性的差分隐私文本脱敏方法[J]. 信息安全研究, 2024, 10(8): 706-.
[5]	秦智翔, 杨洪伟, 郝萌, 何慧, 张伟哲, . 隐私计算环境下深度学习的GPU加速技术综述[J]. 信息安全研究, 2024, 10(7): 586-.
[6]	陈珊, 潘文伦, . 基于椭圆曲线加密的多用户可搜索对称加密方案[J]. 信息安全研究, 2024, 10(7): 624-.
[7]	秦体红, 汪宗斌, 刘洋, 马姚, 刘金华, . 基于商密SM9算法同态加密方案[J]. 信息安全研究, 2024, 10(6): 513-.
[8]	满子琪, 张艳硕, 严梓洋, 罗乐琦, 陈颖, . 基于弹性秘密共享的多方洗牌协议[J]. 信息安全研究, 2024, 10(4): 347-.
[9]	刘晓迁, 许飞, 马卓, 袁明, 钱汉伟, . 联邦学习中的隐私保护技术研究[J]. 信息安全研究, 2024, 10(3): 194-.
[10]	程显淘, . 针对联邦学习的恶意客户端检测及防御方法[J]. 信息安全研究, 2024, 10(2): 163-.
[11]	李畅畅, . APP个人信息保护政策困境与应对路径[J]. 信息安全研究, 2024, 10(2): 177-.
[12]	刘家森, 王绪安, 余丹, 李龙, 赵臻, . 基于同态加密和边缘计算的关键目标人脸识别方案[J]. 信息安全研究, 2024, 10(11): 1004-.
[13]	杨珂, 郭庆雷, 李达, 温婷婷, 杜哲, . 基于区块链的隐私保护碳核算模型 [J]. 信息安全研究, 2024, 10(11): 1036-.
[14]	马莉莉, 阎红灿, 谷建涛, . 一种多素数和公共模数联合的Paillier优化算法[J]. 信息安全研究, 2024, 10(10): 952-.
[15]	管桂林, 支婷, 陶政坪, 曹扬, . 物联网中多密钥同态加密的联邦学习隐私保护方法[J]. 信息安全研究, 2024, 10(10): 958-.