基于本地化差分隐私的多表星形连接查询
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TP311

基金项目:

国家自然科学基金(62072156)


Multi-table Star-JOIN Queries Based on Local Differential Privacy
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    基于本地化差分隐私多关系表示上的Star-JOIN查询已得到研究者广泛关注. 现有基于OLH机制与层次树结构的Star-JOIN查询算法存在根节点泄露隐私风险、τ-截断机制没有给出如何选择合适τ值等问题. 针对现有算法存在的不足, 提出一种有效且满足本地化差分隐私的Star-JOIN查询算法LPRR-JOIN (longitudinal path random response for join). 该算法充分利用层次树的纵向路径结构与GRR机制, 设计一种纵向本地扰动算法LPRR, 该算法以所有属性纵向路径上的节点组合作为扰动值域. 每个用户把自身元组映射到相应节点组合中, 再利用GRR机制对映射后的元组进行本地扰动. 为了避免事实表上存在的频率攻击, LPRR-JOIN算法允许每个用户利用阈值τ本地截断自身元组个数, 大于τ条元组删减、小于τ条元组补充. 为了寻找合适的τ值, LPRR-JOIN算法利用τ-截断带来的偏差与扰动方差构造总体误差函数, 通过优化误差目标函数获得τ值; 其次结合用户分组策略获得τ值的总体分布, 再利用中位数获得合适的τ值. LPRR-JOIN算法与现有算法在3种多关系数据集上进行比较, 实验结果表明其响应查询算法优于同类算法.

    Abstract:

    Star-JOIN queries based on local differential privacy (LDP) have attracted a lot of attention from researchers in recent years. Existing Star-JOIN queries based on the OLH mechanism and hierarchical tree structures face issues such as privacy leakage risks at the root node and the lack of guidance on selecting an appropriate τ value for the τ-truncation mechanism. To remedy the shortcomings of the existing algorithms, this study proposes an effective Star-JOIN query algorithm, longitudinal path random response for join (LPRR-JOIN), to satisfy the requirements of LDP. In the LPRR-JOIN algorithm, full advantage is taken of the longitudinal path structure of the hierarchical tree and the GRR mechanism to propose an algorithm called LPRR to perturb users’ tuples. This algorithm utilizes the combinations of nodes along the longitudinal paths of all attributes as the perturbation domain. In the LPRR-JOIN algorithm, tuples are mapped by each user to corresponding node combinations, followed by local perturbation of the mapped tuples using the GRR mechanism. To guard against frequency attacks on the fact table, the algorithm permits users to locally truncate the count of their tuples based on a threshold τ, where tuples are deleted if their count exceeds τ and supplemented if it falls below τ. Two solutions are proposed within LPRR-JOIN to compute the optimal τ value. The first is to solve the optimization equation over bias caused by τ-truncation and perturbation variance due to LPRR. The second is to obtain the distribution of τ under the constraints of LDP and compute the median value from the distribution. The LPRR-JOIN algorithm employs an overall error function constructed from the bias and perturbation variance resulting from τ-truncation to derive an optimal τ value through the optimization of the error objective function. Additionally, by integrating a user grouping strategy, the algorithm ascertains the overall distribution of τ values and identifies a suitable τ value using the median. When compared with current algorithms across three diverse multi-relational datasets, experimental outcomes demonstrate the superiority of the LPRR-JOIN algorithm in query response performance.

    参考文献
    相似文献
    引证文献
引用本文

张啸剑,曹小杰,王宁,孟小峰.基于本地化差分隐私的多表星形连接查询.软件学报,,():1-21

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2023-08-06
  • 最后修改日期:2023-11-08
  • 录用日期:
  • 在线发布日期: 2024-11-01
  • 出版日期:
文章二维码
您是第位访问者
版权所有:中国科学院软件研究所 京ICP备05046678号-3
地址:北京市海淀区中关村南四街4号,邮政编码:100190
电话:010-62562563 传真:010-62562533 Email:jos@iscas.ac.cn
技术支持:北京勤云科技发展有限公司

京公网安备 11040202500063号