Differentially Private Algorithm for Graphical Bandits

doi:10.13328/j.cnki.jos.006386

微信服务号

微信订阅号

2025-5-11- 11

Home > Archive>Volume 33, Issue 9, 2022 >3223-3235. DOI:10.13328/j.cnki.jos.006386

PDF HTML XML Export Cite reminder

Differentially Private Algorithm for Graphical Bandits
DOI:
                        10.13328/j.cnki.jos.006386
                    
Author:
                        LU Shi-YinLU Shi-Yin
State Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
WANG Guang-HuiWANG Guang-Hui
State Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
QIU Zi-HaoQIU Zi-Hao
State Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
ZHANG Li-JunZHANG Li-Jun
State Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:TP181
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Graphical bandit is an important model for sequential decision making under uncertainty and has been applied in various real-world scenarios such as social network, electronic commerce, and recommendation system. Existing work on graphical bandits only investigates how to identify the best arm rapidly so as to minimize the cumulative regret while ignoring the privacy protection issue arising in many real-world applications. To overcome this deficiency, a differentially private algorithm is proposed, termed as graph-based arm elimination with differential privacy (GAP), for graphical bandits. On the one hand, GAP updates the arm selection strategy based on empirical mean rewards of arms in an epoch manner. The empirical mean rewards are perturbed by Laplace noise, which makes it hard for malicious attackers to infer rewards of arms from the output of the algorithm, and thus protects the privacy. On the other hand, in each epoch, GAP carefully constructs an independent set of the feedback graph and only explores arms in the independent set, which effectively utilize the information in the graph feedback. It is proved that GAP is differential private and its regret bound matches the theoretical lower bound. Experimental results on synthetic datasets demonstrate that GAP can effectively protect the privacy and achieve cumulative regret comparable to that of existing non-private graphical bandits algorithm.

Key words:graphical bandits;differential privacy;sequential decision making under uncertainty;independent set;Laplace noise

Get Citation

卢世银,王广辉,邱梓豪,张利军.一种满足差分隐私的图赌博机算法.软件学报,2022,33(9):3223-3235

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:March 31,2021
Revised:May 04,2021
Adopted:
Online: June 15,2022
Published: September 06,2022

You are the first2043763Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History