基于强化联邦GNN的个性化公共安全突发事件检测
作者:
通讯作者:

杜军平,E-mail:junpingdu@126.com

基金项目:

国家自然科学基金(62192784, U22B2038, 62172056, 62272058)


Personalized Public Safety Event Detection Based on Reinforcement Federated GNN
Author:
  • 摘要
  • | |
  • 访问统计
  • |
  • 参考文献 [75]
  • |
  • 相似文献 [20]
  • |
  • 引证文献
  • | |
  • 文章评论
    摘要:

    近年来, 将公共安全数据转换为图的形式, 通过图神经网络(GNN)构造节点表示应用于下游任务的方法, 充分利用了公共安全数据的实体与关联信息, 取得了较好的效果. 为了提高模型的有效性, 需要大量的高质量数据, 但是高质量的数据通常归属于政府、公司和组织, 很难通过数据集中的方式使模型学习到有效的事件检测模型.由于各数据拥有方的关注主题与收集时间不同, 数据之间存在Non-IID的问题. 传统的假设一个全局模型可以适合所有客户端的方法难以解决此类问题. 提出了基于强化联邦图神经网络的个性化公共安全突发事件检测方法PPSED, 各客户端采用多方协作的方式训练个性化的模型来解决本地的突发事件检测任务. 设计了联邦公共安全突发事件检测模型的本地训练与梯度量化模块,采用基于图采样的minibatch机制的GraphSage构造公共安全突发事件检测本地模型, 以减小数据Non-IID的影响, 采用梯度量化方法减小梯度通信的消耗. 设计了基于随机图嵌入的客户端状态感知模块, 在保护隐私的同时, 更好地保留客户端模型有价值的梯度信息. 设计了强化联邦图神经网络的个性化梯度聚合与量化策略, 采用DDPG拟合个性化联邦学习梯度聚合加权策略, 并根据权重决定是否对梯度进行量化, 对模型的性能与通信压力进行平衡. 通过在微博平台收集的公共安全数据集和3个公开的图数据集进行了大量的实验, 实验结果表明了所提方法的有效性.

    Abstract:

    In recent years, the method of transforming public safety data into graph form and constructing node representations through graph neural networks for training and inference of downstream tasks has fully exploited the entity and association information of public safety data, achieving excellent results. Nevertheless, to enhance the effectiveness of the model, a large amount of high-quality data is needed, which is usually held by governments, companies, and organizations, making it difficult to learn an effective event detection model through data centralization. Moreover, due to different focuses and collection times of the data from various parties, there is a Non-IID (independent and identically distributed) problem among the data. Traditional methods that assume a global model can accommodate all clients are challenging to solve such issues. Therefore, this study proposes personalized public safety event detection (PPSED) method based on a reinforcement federated graph neural network. In this method, each client trains a personalized and more robust model through multi-party collaboration to solve local event detection tasks. A local training and gradient quantization module is designed for the federated public safety emergency event detection model and trained GraphSage through a minibatch mechanism based on graph sampling to construct a local model for public safety event detection. This approach reduces the impact of Non-IID data and supports the gradient quantization method to lower the consumption of gradient communication. A client state awareness module is also designed based on random graph embedding, which better retains the valuable information of the client model while protecting privacy. Furthermore, a personalized gradient aggregation and quantization strategy are designed for the federated graph neural network. Deep deterministic policy gradient (DDPG) is used to fit a personalized federated learning gradient aggregation weighting strategy, and it is determined whether the gradient can be quantized based on the weight, balancing the model's performance, and communication pressure. This study demonstrated the effectiveness of the method through extensive experiments on a public safety dataset collected from the Weibo platform and three public graph datasets.

    参考文献
    [1] Cao Y, Peng H, Wu J, et al. Knowledge-preserving incremental social event detection via heterogeneous GNNs. In:Proc. of the Web Conf. 2021. 3383-3395.
    [2] Liu Z, Yang Y, Huang Z, et al. Event early embedding:Predicting event volume dynamics at early stage. In:Proc. of the 40th Int'l ACM SIGIR Conf. on Research and Development in Information Retrieval. 2017. 997-1000.
    [3] Fedoryszak M, Frederick B, Rajaram V, et al. Real-time event detection on social data streams. In:Proc. of the 25th ACM SIGKDD Int'l Conf. on Knowledge Discovery & Data Mining. 2019. 2774-2782.
    [4] Aggarwal CC, Subbian K. Event detection in social streams. In:Proc. of the 2012 SIAM Int'l Conf. on Data Mining. Society for Industrial and Applied Mathematics, 2012. 624-635.
    [5] Peng H, Li J, Song Y, et al. Streaming social event detection and evolution discovery in heterogeneous information networks. ACM Trans. on Knowledge Discovery from Data, 2021, 15(5):1-33.
    [6] Cui W, Du J, Wang D, et al. Extended search method based on a semantic hashtag graph combining social and conceptual information. World Wide Web, 2019, 22:2589-2610.
    [7] Hamilton W, Ying Z, Leskovec J. Inductive representation learning on large graphs. In:Advances in Neural Information Proccesing Systems, Vol.30. 2017
    [8] Kipf TN, Welling M. Semi-supervised classification with graph convolutional networks. arXiv:1609.02907, 2016.
    [9] Bo D, Wang X, Liu Y, et al. A survey on spectral graph neural networks. arXiv:2302.05631, 2023.
    [10] Baek J, Kang M, Hwang SJ. Accurate learning of graph representations with graph multiset pooling. arXiv:2102.11533, 2021.
    [11] Wu Z, Pan S, Chen F, et al. A comprehensive survey on graph neural networks. IEEE Trans. on Neural Networks and Learning Systems, 2021, 32(1):4-24.
    [12] Perozzi B, Al-Rfou R, Skiena S. DeepWalk:Online learning of social representations. In:Proc. of the 20th ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining. 2014. 701-710.
    [13] Grover A, Leskovec J. Node2vec:Scalable feature learning for networks. In:Proc. of the 22nd ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining. 2016. 855-864.
    [14] Yang JX, Du JP, Shao YX, et al. Construction method of intellectual-property-oriented scientific and technological resources portrait. Ruan Jian Xue Bao/Journal of Software, 2022, 33(4):1439-1450 (in Chinese with English abstract). https://www.jos.org.cn/1000-9825/6483.htm[doi:10.13328/j.cnki.jos.006483]
    [15] Peng H, Li J, Gong Q, et al. Fine-grained event categorization with heterogeneous graph convolutional networks. arXiv:1906. 04580, 2019.
    [16] Liu J, Ong GP, Chen X. GraphSAGE-based traffic speed forecasting for segment network with sparse data. IEEE Trans. on Intelligent Transportation Systems, 2020, 23(3):1755-1766.
    [17] Bongini P, Bianchini M, Scarselli F. Molecular generative graph neural networks for drug discovery. Neurocomputing, 2021, 450:242-252.
    [18] Chen JS, Meng XW, Ji WY, et al. POI recommendation based on multidimensional context-aware graph embedding model. Ruan Jian Xue Bao/Journal of Software, 2020, 31(12):3700-3715 (in Chinese with English abstract). http://www.jos.org.cn/1000-9825/5855.htm[doi:10.13328/j.cnki.jos.005855]
    [19] Shi C, Han X, Song L, et al. Deep collaborative filtering with multi-aspect information in heterogeneous networks. IEEE Trans. on Knowledge and Data Engineering, 2019, 33(4):1413-1425.
    [20] Zhang Y, Shi Y, Zhou Z, et al. Efficient and secure skyline queries over vertical data federation. IEEE Trans. on Knowledge and Data Engineering, 2022, 35(9):9269-9280.
    [21] Pan X, Tong Y, Xue C, et al. Hu-Fu:A data federation system for secure spatial queries. Proc. of the VLDB Endowment, 2022, 15(12):3582-3585.
    [22] Shen X, Dai Q, Chung FL, et al. Adversarial deep network embedding for cross-network node classification. Proc. of the AAAI Conf. on Artificial Intelligence, 2020, 34(3):2991-2999.
    [23] Guan Z, Li Y, Xue Z, et al. Federated graph neural network for cross-graph node classification. In:Proc. of the 2021 IEEE 7th Int'l Conf. on Cloud Computing and Intelligent Systems (CCIS). IEEE, 2021. 418-422.
    [24] Li Q, He B, Song D. Model-contrastive federated learning. In:Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition. 2021. 10713-10722.
    [25] Wang H, Kaplan Z, Niu D, et al. Optimizing Federated learning on non-iid data with reinforcement learning. In:Proc. of the IEEE Conf. on Computer Communications (INFOCOM 2020). IEEE, 2020. 1698-1707.
    [26] McMahan B, Moore E, Ramage D, et al. Communication-efficient learning of deep networks from decentralized data. In:Proc. of the Artificial Intelligence and Statistics. 2017. 1273-1282.
    [27] He C, Balasubramanian K, Ceyani E, et al. FedGraphNN:A Federated learning system and benchmark for graph neural networks. arXiv:2104.07145, 2021.
    [28] He C, Ceyani E, Balasubramanian K, et al. SpreadGNN:Serverless multi-task Federated learning for graph neural networks. arXiv:2106.02743, 2021.
    [29] Baek J, Jeong W, Jin J, et al. Personalized subgraph Federated learning. arXiv:2206.10206, 2022.
    [30] Huang Y, Chu L, Zhou Z, et al. Personalized cross-silo Federated learning on non-iid data. Proc. of the AAAI Conf. on Artificial Intelligence, 2021, 35(9):7865-7873.
    [31] Schneider J, Vlachos M. Mass personalization of deep learning. arXiv:1909.02803, 2019.
    [32] Li T, Sahu AK, Zaheer M, et al. Federated optimization in heterogeneous networks. Proc. of the Machine Learning and Systems, 2020, 2:429-450.
    [33] Kober J, Bagnell JA, Peters J. Reinforcement learning in robotics:A survey. The Int'l Journal of Robotics Research, 2013, 32(11):1238-1274.
    [34] Sun Q, Li J, Peng H, et al. SUGAR:Subgraph neural network with reinforcement pooling and self-supervised mutual information mechanism. In:Proc. of the Web Conf. 2021. 2081-2091.
    [35] Yang M, Li C, Sun F, et al. Be relevant, non-redundant, and timely:Deep reinforcement learning for real-time event summarization. Proc. of the AAAI Conf. on Artificial Intelligence, 2020, 34(5):9410-9417.
    [36] Zhou H, Yin H, Zheng H, et al. A survey on multi-modal social event detection. Knowledge-based Systems, 2020, 195:105695.
    [37] Allan J. Introduction to Topic Detection and Tracking. Topic Detection and Tracking:Event-based Information Organization. Boston:Springer, 2002. 1-16.
    [38] Angel A, Koudas N, Sarkas N, et al. Dense subgraph maintenance under streaming edge weight updates for real-time story identification. The VLDB Journal, 2014, 23:175-199.
    [39] Yu W, Li J, Bhuiyan MZA, et al. Ring:Real-time emerging anomaly monitoring system over text streams. IEEE Trans. on Big Data, 2017, 5(4):506-519.
    [40] Allan J. Topic Detection and Tracking:Event-based Information Organization. Springer Publishing Company, Incorporated, 2002.
    [41] Peng H, Zhang R, Li S, et al. Reinforced, incremental and cross-lingual event detection from social messages. IEEE Trans. on Pattern Analysis and Machine Intelligence, 2022, 45(1):980-998.
    [42] Sun M, Zhao S, Gilvary C, et al. Graph convolutional networks for computational drug development and discovery. Briefings in Bioinformatics, 2020, 21(3):919-935.
    [43] Rong Y, Bian Y, Xu T, et al. Self-supervised graph transformer on large-scale molecular data. In:Advances in Neural Information Proccesing Systems, Vol.33. 2020. 12559-12571.
    [44] Xiao ST, Shao YX, Li YW, et al. LECF:Recommendation via learnable edge collaborative filtering. Science China Information Sciences, 2022, 65(1):112101.
    [45] Li Y, Yuan Y, Wang Y, et al. Distributed multimodal path queries. IEEE Trans. on Knowledge and Data Engineering, 2020, 34(7):3196-3210.
    [46] Veličković P, Cucurull G, Casanova A, et al. Graph attention networks. arXiv:1710.10903, 2017.
    [47] Li H, Shao Y, Du J, et al. An I/O-efficient disk-based graph system for scalable second-order random walk of large graphs. arXiv:2203.16123, 2022.
    [48] Dai H, Li H, Tian T, et al. Adversarial attack on graph structured data. In:Proc. of the Int'l Conf. on Machine Learning. 2018. 1115-1124.
    [49] Chen L, Li JT, Peng QB, et al. Understanding structural vulnerability in graph convolutional networks. In:Proc. of the IJCAI. 2021. 2249-2255.
    [50] Zhu D, Zhang Z, Cui P, et al. Robust graph convolutional networks against adversarial attacks. In:Proc. of the 25th ACM SIGKDD Int'l Conf. on Knowledge Discovery & Data Mining. 2019. 1399-1407.
    [51] Li Q, Wen Z, Wu Z, et al. A survey on Federated learning systems:Vision, hype and reality for data privacy and protection. IEEE Trans. on Knowledge and Data Engineering, 2021, 35(4):3347-3366.
    [52] Arivazhagan MG, Aggarwal V, Singh AK, et al. Federated learning with personalization layers. arXiv:1912.00818, 2019.
    [53] Wang B, Li A, Pang M, et al. GraphFL:A Federated learning framework for semi-supervised node classification on graphs. In:Proc. of the ICDM IEEE Int'l Conf. on Data Mining. 2022. 498-507.
    [54] Scardapane S, Spinelli I, Di Lorenzo P. Distributed training of graph convolutional networks. IEEE Trans. on Signal and Information Processing over Networks, 2020, 7:87-100.
    [55] Wan C, Li Y, Li A, et al. BNS-GCN:Efficient full-graph training of graph convolutional networks with partition-parallelism and random boundary node sampling. Proc. of the Machine Learning and Systems, 2022, 4:673-693.
    [56] Yao Y, Jin W, Ravi S, et al. FedGCN:Convergence-communication tradeoffs in Federated training of graph convolutional networks. arXiv:2201.12433, 2023.
    [57] Zhang K, Yang C, Li X, et al. Subgraph Federated learning with missing neighbor generation. In:Advances in Neural Information Proccessing Systems, Vol.34. 2021. 6671-6682.
    [58] Watkins CJCH, Dayan P. Q-learning. Machine Learning, 1992, 8:279-292.
    [59] Mnih V, Kavukcuoglu K, Silver D, et al. Playing atari with deep reinforcement learning. arXiv:1312.5602, 2013.
    [60] Sutton RS, McAllester D, Singh S, et al. Policy gradient methods for reinforcement learning with function approximation. In:Advances in Neural Information Processing Systems, Vol.12. 1999.1057-1063.
    [61] Silver D, Lever G, Heess N, et al. Deterministic policy gradient algorithms. In:Proc. of the 31st Int'l Conf. on Machine Learning. 2014. 387-395.
    [62] Lillicrap TP, Hunt JJ, Pritzel A, et al. Continuous control with deep reinforcement learning. arXiv:1509.02971, 2019.
    [63] Banner R, Hubara I, Hoffer E, et al. Scalable methods for 8-bit training of neural networks. In:Advances in Neural Information Processing Systems, Vol.31. 2018.
    [64] Li Y, Li W, Xue Z. Federated learning with stochastic quantization. Int'l Journal of Intelligent Systems, 2022, 37(12):11600-11621.
    [65] Zhang XX, Zhu ZF, Zhao YW, et al. Prototype learning in machine learning:A literature review. Ruan Jian Xue Bao/Journal of Software, 2022, 33(10):3732-3753 (in Chinese with English abstract). http://www.jos.org.cn/1000-9825/6365.htm[doi:10.13328/j.cnki.jos.006365]
    [66] Tan Y, Long G, Liu L, et al. FedProto:Federated prototype learning across heterogeneous clients. Proc. of the AAAI Conf. on Artificial Intelligence, 2022, 36(8):8432-8440.
    [67] Abadi M, Chu A, Goodfellow I, et al. Deep learning with differential privacy. In:Proc. of the ACM SIGSAC Conf. on Computer and Communications Security. 2016. 308-318.
    [68] Xu R, Baracaldo N, Zhou Y, et al. HybridAlpha:An efficient approach for privacy-preserving Federated learning. In:Proc. of the ACM Workshop on Artificial Intelligence and Security. 2019. 13-23.
    [69] Sen P, Namata G, Bilgic M, et al. Collective classification in network data. AI Magazine, 2008, 29(3):93-106.
    [70] Shchur O, Mumme M, Bojchevski A, et al. Pitfalls of graph neural network evaluation. arXiv:1811.05868, 2019.
    [71] Blondel VD, Guillaume JL, Lambiotte R, et al. Fast unfolding of communities in large networks. Journal of Statistical Mechanics:Theory and Experiment, 2008(10):P10008.
    附中文参考文献:
    [14] 杨佳鑫, 杜军平, 邵蓥侠, 等. 面向知识产权的科技资源画像构建方法. 软件学报, 2022, 33(4):1439-1450. https://www.jos.org.cn/1000-9825/6483.htm[doi:10.13328/j.cnki.jos.006483]
    [18] 陈劲松, 孟祥武, 纪威宇, 等. 基于多维上下文感知图嵌入模型的兴趣点推荐. 软件学报, 2020, 31(12):3700-3715. http://www.jos.org.cn/1000-9825/5855.htm[doi:10.13328/j.cnki.jos.005855]
    [65] 张幸幸, 朱振峰, 赵亚威, 等. 机器学习中原型学习研究进展. 软件学报, 2022, 33(10):3732-3753. http://www.jos.org.cn/1000-9825/6365.htm[doi:10.13328/j.cnki.jos.006365]
    引证文献
    网友评论
    网友评论
    分享到微博
    发 布
引用本文

管泽礼,杜军平,薛哲,王沛文,潘圳辉,王晓阳.基于强化联邦GNN的个性化公共安全突发事件检测.软件学报,2024,35(4):1774-1789

复制
分享
文章指标
  • 点击次数:955
  • 下载次数: 3539
  • HTML阅读次数: 1342
  • 引用次数: 0
历史
  • 收稿日期:2023-05-15
  • 最后修改日期:2023-07-07
  • 在线发布日期: 2023-09-11
  • 出版日期: 2024-04-06
文章二维码
您是第19727454位访问者
版权所有:中国科学院软件研究所 京ICP备05046678号-3
地址:北京市海淀区中关村南四街4号,邮政编码:100190
电话:010-62562563 传真:010-62562533 Email:jos@iscas.ac.cn
技术支持:北京勤云科技发展有限公司

京公网安备 11040202500063号