eDPRF: 高效的差分隐私随机森林训练算法

doi:10.13328/j.cnki.jos.007332

微信服务号

微信订阅号

2025年4月26日 19:05 星期六

首页 > 过刊浏览>2025年第36卷第7期 >1-18. DOI:10.13328/j.cnki.jos.007332

PDF HTML阅读 XML下载导出引用引用提醒

eDPRF: 高效的差分隐私随机森林训练算法
DOI:
                        10.13328/j.cnki.jos.007332
                    
CSTR:
                        
                    
作者:
                        王树兰王树兰
深圳技术大学 大数据与互联网学院, 广东 深圳 518118
在期刊界中查找
在百度中查找
在本站中查找
邱瑶邱瑶
深圳技术大学 大数据与互联网学院, 广东 深圳 518118
在期刊界中查找
在百度中查找
在本站中查找
赵陈斌赵陈斌
空天信息安全与可信计算教育部重点实验室 (武汉大学 国家网络安全学院), 湖北 武汉 430072
在期刊界中查找
在百度中查找
在本站中查找
邹家须邹家须
深圳技术大学 大数据与互联网学院, 广东 深圳 518118
在期刊界中查找
在百度中查找
在本站中查找
王彩芬王彩芬
深圳技术大学 大数据与互联网学院, 广东 深圳 518118
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:赵陈斌,E-mail:chenbinzhao96@whu.edu.cn
中图分类号:TP18
基金项目:国家自然科学基金(61702341); 深圳技术大学深圳市高等院校稳定支持项目(SZWD2021012); 深圳技术大学研究生校企合作研究基金(20223108010009)

eDPRF: Efficient Differential Privacy Random Forest Training Algorithm

Author:

WANG Shu-Lan
WANG Shu-Lan
College of Big Data and Internet, Shenzhen Technology University, Shenzhen 518118, China
在期刊界中查找
在百度中查找
在本站中查找
QIU Yao
QIU Yao
College of Big Data and Internet, Shenzhen Technology University, Shenzhen 518118, China
在期刊界中查找
在百度中查找
在本站中查找
ZHAO Chen-Bin
ZHAO Chen-Bin
Key Laboratory of Aerospace Information Security and Trusted Computing of Ministry of Education (School of Cyber Science and Engineering, Wuhan University), Wuhan 430072, China
在期刊界中查找
在百度中查找
在本站中查找
ZOU Jia-Xu
ZOU Jia-Xu
College of Big Data and Internet, Shenzhen Technology University, Shenzhen 518118, China
在期刊界中查找
在百度中查找
在本站中查找
WANG Cai-Fen
WANG Cai-Fen
College of Big Data and Internet, Shenzhen Technology University, Shenzhen 518118, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

差分隐私凭借其强大的隐私保护能力被应用在随机森林算法解决其中的隐私泄露问题, 然而, 直接将差分隐私应用在随机森林算法会使模型的分类准确率严重下降. 为了平衡隐私保护和模型准确性之间的矛盾, 提出了一种高效的差分隐私随机森林训练算法eDPRF (efficient differential privacy random forest). 具体而言, 该算法设计了决策树构建方法, 通过引入重排翻转机制高效地查询输出优势, 进一步设计相应的效用函数实现分裂特征以及标签的精准输出, 有效改善树模型在扰动情况下对于数据信息的学习能力. 同时基于组合定理设计了隐私预算分配的策略, 通过不放回抽样获得训练子集以及差异化调整内部预算的方式提高树节点的查询预算. 最后, 通过理论分析以及实验评估, 表明算法在给定相同隐私预算的情况下, 模型的分类准确度优于同类算法.

关键词:随机森林;差分隐私;隐私预算;重排翻转;扰动方式

Abstract:

Differential privacy, owing to its strong privacy protection capacity, is applied to the random forest algorithm to address the privacy leakage problem. However, the direct application of differential privacy to the random forest algorithm leads to a significant decline in the model’s classification accuracy. To balance the contradiction between privacy protection and model accuracy, this study proposes an efficient differential privacy random forest training algorithm, efficient differential privacy random forest (eDPRF). Specifically, the study designs a decision tree construction method based on the permute-and-flip mechanism. By introducing the efficient query output advantage of the permute and flip mechanism, the corresponding utility functions are further designed to achieve the precise output of split features and labels, effectively enhancing the learning ability of the tree model for data information under perturbation circumstances. At the same time, the study designs a privacy budget allocation strategy based on the composition theorem, which improves the privacy budget utilization rate of nodes by obtaining training subsets without replacement sampling and adjusting internal budgets through differentiation. Finally, through theoretical analysis and experimental evaluation, it is demonstrated that the proposed algorithm outperforms similar algorithms in terms of the model’s classification accuracy when given the same privacy budget.

Key words:random forest;differential privacy;privacy budget;permute and flip;perturbation method

引用本文

王树兰,邱瑶,赵陈斌,邹家须,王彩芬. eDPRF: 高效的差分隐私随机森林训练算法.软件学报,2025,36(7):1-18

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2024-07-10
最后修改日期:2024-10-15
录用日期:
在线发布日期: 2024-12-10
出版日期:

微信服务号

微信订阅号

引用本文

分享

文章指标

历史

文章二维码

微信服务号

微信订阅号

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码