Streaming Histogram Publication Method with Differential Privacy
Author:
Affiliation:

Fund Project:

National Natural Science Foundation of China (61502146, 61303017, 61202285); National High-Tech R&D Program of China (863) (2013AA013204); Research Fund for the Doctoral Program of Higher Education of China (20130004130001); Basic Research Program of He’nan Science and Technology Department (152300410091); Research Program of the Higher Education of He’nan Educational Committee (16A520002)

  • Article
  • | |
  • Metrics
  • |
  • Reference [17]
  • |
  • Related
  • |
  • Cited by
  • | |
  • Comments
    Abstract:

    Various approaches have been proposed to release histogram on static datasets with differential privacy, while little work exist that handle dynamic datasets. Those existing static approaches are ill-suited for the practical applications on data stream due to the inherent complexity of publishing streaming histograms. With this consideration, this paper addresses the challenge by proposing a partitioning-based method, called SHP (streaming histogram publication), which partitions the count values of each sliding window into different groups for releasing the final histogram. In view of different global sensitivity of queries adopted by this paper, three incremental utility-based mechanisms for adding Laplace noise are proposed to achieve differential privacy. The three mechanisms are sliding window mechanism, time point mechanism, and adaptive sampling mechanism, respectively. In the third mechanism, SHP relies on the adaptive sampling method to predict the next arriving count value at non-sampling time points. If the difference between the predicted value and the true value is less than a user-defined threshold, then it releases the predicted value, otherwise, releases the true value. This mechanism can save privacy budget in terms of sampling interval updates. Experimental results or real datasets show that the utility of SHP based on sampling is better than the other two mechanisms.

    Reference
    [1] Acs G, Castelluccia C, Chen R. Differentially private histogram publishing through lossy compression. In: Proc. of the 11th IEEE Int'l Conf. on Data Mining. Piscatawayi: IEEE, 2012. 84-95. [doi: 10.1109/ICDM.2012.80]
    [2] Kellaris G, Papadopoulos S. Practical differential privacy via grouping and smoothing. Proc. of the VLDB Endowment, 2013,6(5): 301-312. [doi: 10.14778/2535573.2488337]
    [3] Xu J, Zhang Z, Xiao X, Yang Y, Yu G. Differentially private histogram publication. In: Proc. of IEEE the 28th Int'l Conf. on Data Engineering. Piscatawayi: IEEE, 2012. 32-43. [doi: 10.1109/ICDE.2012.48]
    [4] Dwork C, McSherry F, Nissim K, Smith A. Calibrating noise to sensitivity in private data analysis. In: Proc. of the 3rd Theory of Cryptography Conf. Berlin: Springer-Verlag, 2006. 265-284. [doi: 10.1007/11681878_14]
    [5] Hay M, Rastogi V, Miklau G, Suciu D. Boosting the accuracy of differentially private histograms through consistency. Proc. of the VLDB Endowment, 2010,3(1):1021-1032. [doi: 10.14778/1920841.1920970]
    [6] Xiao X, Xiong L, Yuan C. Differential privacy via wavelet transforms. IEEE Trans. on Knowledge and Data Engineering, 2010, 23(8):1200-1214. [doi: 10.1109/TKDE.2010.247]
    [7] Li C, Hay M, Rastogi V, Miklau G, McGregor A. Optimizing linear counting queries under differential privacy. In: Proc. of the 29th ACM Symp. on Principles of Database Systems. New York: ACM Press, 2010. 123-134. [doi: 10.1145/1807085.1807104]
    [8] Li C, Miklau G. An adaptive mechanism for accurate query answering under differential privacy. Proc. of the VLDB Endowment, 2012,5(6):514-525. [doi: 10.14778/2168651.2168653]
    [9] Yuan G, Zhang Z, Winslett M, Xiao X, Yang Y, Hao Z. Low-Rank mechanism: optimizing batch queries under differential privacy. Proc. of the VLDB Endowment, 2012,5(11):1352-1363. [doi: 10.14778/2350229.2350252]
    [10] Dwork C, Naor M, Pitassi T, Rothblum GN. Differential privacy under continual observation. In: Proc. of the 42nd ACM Symp. on Theory of Computing. New York: ACM Press, 2010. 715-724. [doi: 10.1145/1806689.1806787]
    [11] Chan THH, Shi E, Song D. Private and continual release of statistics. ACM Trans. on Information and System Security, 2011,14(3): 26-49. [doi: 10.1145/2043621.2043626]
    [12] Fan L, Xiong L. An adaptive approach to real-time aggregate monitoring with differential privacy. IEEE Trans. on Knowledge and Data Engineering, 2014,26(9):1041-4347. [doi: 10.1109/TKDE.2013.96]
    [13] Cao J, Xiao Q, Ghinita G, Li N, Bertino E, Tan KL. Efficient and accurate strategies for differentially-private sliding window queries. In: Proc. of the 16th Int'l Conf. on Extending Database Technology. New York: ACM Press, 2013. 191-202. [doi: 10.114 5/2452376. 2452400]
    [14] Kellaris G, Papadopoulos S, Xiao X, Papadias D. Differentially private event sequences over infinite streams. Proc. of the VLDB Endowment, 2014,7(12):1155-1166. [doi: 10.14778/2732977.2732989]
    [15] Friedman A, Sharfman I, Keren D, Schuster A. Privacy-Preserving distributed stream monitoring. In: Proc. of the 21st Annual Network and Distributed System Security Symp. Washington: The Internet Society, 2014. [doi: 10.14722/ndss.2014.23128]
    [16] Jagadish HV, Koudas N, Muthukrishnan S, Poosala V, Sevcik KC, Suel T. Optimal histograms with quality guarantees. In: Proc. of the 24th Conf. of Very Large Databases. San Francisco: Morgan Kaufmann Publishers, 1998. 275-286.
    [17] McSherry F. Privacy integrated queries: An extensible platform for privacy-preserving data analysis. In: Proc. of the ACM SIGMOD Int'l Conf. on Management of Data. New York: ACM Presss, 2009. 19-30. [doi: 10.1145/1559845.1559850]
    Related
    Cited by
Get Citation

张啸剑,孟小峰.基于差分隐私的流式直方图发布方法.软件学报,2016,27(2):381-393

Copy
Share
Article Metrics
  • Abstract:4304
  • PDF: 6897
  • HTML: 1450
  • Cited by: 0
History
  • Received:March 30,2014
  • Online: February 03,2016
You are the first2050467Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063