Malicious Domain Name Detection Method Based on Graph Contrastive Learning

doi:10.13328/j.cnki.jos.006964

微信服务号

微信订阅号

2025-4-9- 9

Home > Archive>Volume 35, Issue 10, 2024 >4837-4858. DOI:10.13328/j.cnki.jos.006964

PDF HTML XML Export Cite reminder

Malicious Domain Name Detection Method Based on Graph Contrastive Learning
DOI:
                        10.13328/j.cnki.jos.006964
                    
Author:
                        ZHANG ZhenZHANG Zhen
School of Cyber Science and Engineering, Southeast University, Nanjing 211189, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
ZHANG San-FengZHANG San-Feng
School of Cyber Science and Engineering, Southeast University, Nanjing 211189, China;Key Laboratory of Computer Network and Information Integration of Ministry of Education (Southeast University), Nanjing 211189, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
YANG WangYANG Wang
School of Cyber Science and Engineering, Southeast University, Nanjing 211189, China;Key Laboratory of Computer Network and Information Integration of Ministry of Education (Southeast University), Nanjing 211189, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:TP393
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

The domain name plays an important role in cybercrimes. Existing malicious domain name detection methods are not only difficult to use with rich topology and attribute information but also require a large amount of label data, resulting in limited detection effects and high costs. To address this problem, this study proposes a malicious domain name detection method based on graph contrastive learning. The domain name and IP address are taken as two types of nodes in a heterogeneous graph, and the feature matrix of corresponding nodes is established according to their attributes. Three types of meta paths are constructed based on the inclusion relationship between domain names, the measure of similarity, and the correspondence between domain names and IP addresses. In the pre-training stage, the contrast learning model based on the asymmetric encoder is applied to avoid the damage to graph structure and semantics caused by graph data augmentation operation and reduce the demand for computing resources. By using the inductive graph neural network graph encoders HeteroSAGE and HeteroGAT, a node-centric mini-batch training strategy is adopted to explore the aggregation relationship between the target node and its neighbor nodes, which solves the problem of poor applicability of the transductive graph neural networks in dynamic scenarios. The downstream classification detection task contrastively utilizes logistic regression and random forest algorithms. Experimental results on publicly available data sets show that detection performance is improved by two to six percentage points compared with that of related works.

Key words:malicious domain name detection;attribute heterogeneous graph;graph neural network (GNN);asymmetric coding;self-supervised learning

Get Citation

张震,张三峰,杨望.基于图对比学习的恶意域名检测方法.软件学报,2024,35(10):4837-4858

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:September 06,2022
Revised:January 17,2023
Adopted:
Online: September 13,2023
Published: October 06,2024

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History