信息安全研究 ›› 2025, Vol. 11 ›› Issue (7): 670-.

• 学术论文 • 上一篇    

融合语义的个性化差分隐私轨迹发布方案

张牙1刘凤春1,2,3杨光辉1,2,3张春英1,2,3任静1,2   

  1. 1(华北理工大学理学院河北唐山063210)
    2(河北省数据科学与应用重点实验室(华北理工大学)河北唐山063210)
    3(唐山市工程计算重点实验室(华北理工大学)河北唐山063210)
  • 出版日期:2025-07-29 发布日期:2025-07-29
  • 通讯作者: 张牙 硕士.主要研究方向为大数据安全与隐私保护、数据挖掘. zy422314@163.com
  • 作者简介:张牙 硕士.主要研究方向为大数据安全与隐私保护、数据挖掘. zy422314@163.com 刘凤春 博士,教授.主要研究方向为网络安全、数据挖掘和机器学习. lnobliu@ncst.edu.cn 杨光辉 博士,讲师.主要研究方向为网络与空间安全、数据挖掘和深度学习. yangguanghui@ncst.edu.cn 张春英 博士,教授,CCF高级会员.主要研究方向为网络与空间安全、人工智能和复杂网络智能分析. hblg_zcy@126.com 任静 硕士,助教.主要研究方向为数据挖掘、粗糙集. renjing@ncst.edu.cn

Personalized Differential Privacy Trajectory Publishing Scheme  Fusing Semantic

Zhang Ya1, Liu Fengchun1,2,3, Yang Guanghui1,2,3, Zhang Chunying1,2,3 , and Ren Jing1,2   

  1. 1(College of Science, North China University of Science and Technology, Tangshan, Hebei 063210)
    2(Hebei Key Laboratory of Data Science and Application (North China University of Science and Technology), Tangshan, Hebei 063210)
    3(The Key Laboratory of Engineering Computing in Tangshan City (North China University of Science and Technology), Tangshan, Hebei 063210)
  • Online:2025-07-29 Published:2025-07-29

摘要: 轨迹数据库中包含大量用户的信息,直接将其发布可能会导致个人敏感信息的泄露.用户的位置语义信息中包含大量日常活动和访问偏好信息,现有个性化差分隐私轨迹发布方案对于位置点隐私级别的判定未考虑位置点间的语义信息,仍然存在隐私性和数据可用性之间的不平衡问题.为解决上述问题,提出一种融合语义的个性化差分隐私轨迹发布方案(PRTDP),根据用户自身轨迹的移动特性进行动态隐私级别判定.首先,提出敏感位置点判定算法.利用DBSCAN聚类算法得到用户敏感位置点.接着,提出一种个性化隐私级别划分算法.基于位置点间的语义信息构建敏感位置点关系有向图模型,设计改进的PageRank算法确定位置点的隐私级别,将相应隐私级别的拉普拉斯噪声加入轨迹数据中并发布.PRTDP方案能够有效地保护用户的敏感信息,并提高轨迹数据的可用性,实验证明该方案在隐私保护程度、可用性和时间效率3个方面优于现有方案NFRP算法和FPT算法.

关键词: 个性化差分隐私, 轨迹隐私保护, PageRank算法, 轨迹数据发布, 隐私预算

Abstract: Trajectory databases contain massive information, and direct release may lead to the disclosure of personal sensitive information. The location semantic information of users encompasses abundant details about daily activities and access preferences. The existing personalized differential privacy trajectory publishing scheme does not consider the semantic information between location points in determining the privacy level, and there is still an imbalance between privacy and data availability. To solve the above problems, a semantically integrated personalized differential privacy trajectory publishing scheme (PRTDP) is proposed, which determines the dynamic privacy level according to the mobile characteristics of the user’s own trajectory. Firstly, an algorithm for determining sensitive location points is proposed. The DBSCAN clustering algorithm is used to obtain the user’s sensitive location points. Then, a personalized privacy level partitioning algorithm is proposed. By leveraging the semantic information between the location points, we construct a digraph model of the sensitive location point relationships and design an enhanced PageRank algorithm to determine the privacy level of the location points. Laplace noise corresponding to the privacy level is added to the trajectory data before publication. PRTDP scheme can effectively protect the sensitive information of users while enhancing trajectory data usability of trajectory data. Experiments show that the scheme outperforms the existing schemes NFRP algorithm and FPT algorithm in three dimensions: privacy protection degree, availability and time efficiency.

Key words: personalized differential privacy, trajectory privacy protection, PageRank algorithm, trajectory data publication, privacy budget

中图分类号: