eDPRF: Efficient Differential Privacy Random Forest Training Algorithm

doi:10.13328/j.cnki.jos.007332

微信服务号

微信订阅号

2025-5-15- 23

Home > Archive>Volume 36, Issue 7, 2025 >2929-2946. DOI:10.13328/j.cnki.jos.007332

PDF HTML XML Export Cite reminder

eDPRF: Efficient Differential Privacy Random Forest Training Algorithm
DOI:
                        10.13328/j.cnki.jos.007332
                    
Author:
                        WANG Shu-LanWANG Shu-Lan
College of Big Data and Internet, Shenzhen Technology University, Shenzhen 518118, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
QIU YaoQIU Yao
College of Big Data and Internet, Shenzhen Technology University, Shenzhen 518118, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
ZHAO Chen-BinZHAO Chen-Bin
Key Laboratory of Aerospace Information Security and Trusted Computing of Ministry of Education (School of Cyber Science and Engineering, Wuhan University), Wuhan 430072, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
ZOU Jia-XuZOU Jia-Xu
College of Big Data and Internet, Shenzhen Technology University, Shenzhen 518118, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
WANG Cai-FenWANG Cai-Fen
College of Big Data and Internet, Shenzhen Technology University, Shenzhen 518118, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:TP18
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Differential privacy, owing to its strong privacy protection capacity, is applied to the random forest algorithm to address the privacy leakage problem. However, the direct application of differential privacy to the random forest algorithm leads to a significant decline in the model’s classification accuracy. To balance the contradiction between privacy protection and model accuracy, this study proposes an efficient differential privacy random forest training algorithm, efficient differential privacy random forest (eDPRF). Specifically, the study designs a decision tree construction method based on the permute-and-flip mechanism. By introducing the efficient query output advantage of the permute and flip mechanism, the corresponding utility functions are further designed to achieve the precise output of split features and labels, effectively enhancing the learning ability of the tree model for data information under perturbation circumstances. At the same time, the study designs a privacy budget allocation strategy based on the composition theorem, which improves the privacy budget utilization rate of nodes by obtaining training subsets without replacement sampling and adjusting internal budgets through differentiation. Finally, through theoretical analysis and experimental evaluation, it is demonstrated that the proposed algorithm outperforms similar algorithms in terms of the model’s classification accuracy when given the same privacy budget.

Key words:random forest;differential privacy;privacy budget;permute and flip;perturbation method

Get Citation

王树兰,邱瑶,赵陈斌,邹家须,王彩芬. eDPRF: 高效的差分隐私随机森林训练算法.软件学报,2025,36(7):2929-2946

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:July 10,2024
Revised:October 15,2024
Adopted:
Online: December 10,2024
Published:

You are the first2044763Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History