基于混合人工免疫算法的流程挖掘事件日志融合方法
作者:
作者单位:

作者简介:

徐杨(1970-),男,湖北武汉人,博士,讲师,主要研究领域为分布式计算,流程建模,流程分析;汤德佑(1976-),男,博士,副教授,CCF专业会员,主要研究领域为数据起源,数据库,高性能计算;袁峰(1977-),男,博士,副研究员,主要研究领域为物联网,云计算,大数据;李东(1970-),男,博士,教授,博士生导师,CCF专业会员,主要研究领域为大数据与云计算,业务流程管理;林琪(1991-),男,硕士,主要研究领域为流程建模,流程分析.

通讯作者:

李东,E-mail:cslidong@scut.edu.cn

中图分类号:

TP181

基金项目:

国家自然科学基金(71090403);广东省科技计划(2014B090901001,2015B010103002,2016B090918062,2016B050502001);广州市科技计划(201604010127);华南理工大学软件学院985学科建设基金(x2rjD615015III)


Merging Event Logs for Process Mining with a Hybrid Artificial Immune Algorithm
Author:
Affiliation:

Fund Project:

National Natural Science Foundation of China (71090403); Science and Technology Planning Projects of Guangdong Province (2014B090901001, 2015B010103002, 2016B090918062, 2016B050502001); Science and Technology Planning Projects of Guangzhou City (201604010127); Special Funds on "985 Project" Disciplinary Construction in School of Software Engineering of South China University of Technology (x2rjD615015Ⅲ)

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    流程挖掘是流程管理和数据挖掘交叉领域中的一个研究热点.在实际业务环境中,流程执行的数据往往分散记录到不同的事件日志中,需要将这些事件日志融合成单一事件日志文件,才能应用当前基于单一事件日志的流程挖掘技术.然而,由于流程日志间存在着执行实例的多对多匹配关系、融合所需信息可能缺失等问题,导致事件日志融合问题具有较高的挑战性.对事件日志融合问题进行了形式化定义,指出该问题是一个搜索优化问题,并提出了一种基于混合人工免疫算法的事件日志融合方法:以启发式方法生成初始种群,以人工免疫系统的克隆选择理论作为基础,通过免疫进化获得"最佳"的融合解,从而支持包含多对多的实例匹配关系的日志融合;考虑两个实例级别的因素——流程执行路径出现的频次和流程实例间的时间匹配关系,分别从"量"匹配和"时间"匹配两个维度来评价进化中的个体;通过设置免疫记忆库、引入模拟退火机制,保证新一代种群的多样性,减少进化早熟几率.实验结果表明:该方法能够实现多对多的实例匹配关系的事件日志融合的目标,相对于随机方法生成初始种群,启发式方法能够加快免疫进化的速度.另外,针对利用分布式技术提高事件日志融合性能,探讨了大规模事件日志分布式融合中的数据划分问题.

    Abstract:

    Process mining is an active research topic in the cross field of process management and data mining. In an actual business environment, the recorded data of a process execution that may be supported by different computer systems is scattered into different event log files. It is necessary to merge the scattered data into one single event log file when applying current process mining techniques and tools for process mining. This mission is still challenging, however, because of the complex relationships between cases in two logs and the possible lack of information for the merging. In this paper, event log merging for process mining is regard as a type of search and optimization problems based on the formal definition, and a merging approach with a hybrid artificial immune algorithm is presented in order to achieve the event log merging with many to many relationship between cases in the two event logs. In the merging approach, the clonal selection principle is selected as its underlying principle, which requires the matching process to undergo iterations of clonal selection, hypermutation and receptor editing in order to get the best solution. The algorithm starts from an initial population produced with a heuristic approach. Two factors, occurrence frequency and temporal relation, are designed in the affinity function to evaluate the individuals in the population. In addition, immunological memory and simulated annealing are exploited to make the artificial immune merging jumping out from the trap of local optima. Experimental results show that the hybrid algorithm has good performance in merging logs with complex cases relationships, and the heuristic approach for initial population can speed the process of the evolution. This paper also discusses the data distribution methods in which the log merging problems can be distributed.

    参考文献
    相似文献
    引证文献
引用本文

徐杨,袁峰,林琪,汤德佑,李东.基于混合人工免疫算法的流程挖掘事件日志融合方法.软件学报,2018,29(2):396-416

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2016-10-10
  • 最后修改日期:2016-12-12
  • 录用日期:
  • 在线发布日期: 2017-03-27
  • 出版日期:
文章二维码
您是第位访问者
版权所有:中国科学院软件研究所 京ICP备05046678号-3
地址:北京市海淀区中关村南四街4号,邮政编码:100190
电话:010-62562563 传真:010-62562533 Email:jos@iscas.ac.cn
技术支持:北京勤云科技发展有限公司

京公网安备 11040202500063号