一种从无“aba”模式的日志中挖掘2度循环的方法
作者:
作者简介:

林雷蕾(1989-),男,海南万宁人,硕士,主要研究领域为软件工程,流程管理,系统分析与集成;朱锐(1987-),男,博士,讲师,CCF专业会员,主要研究领域为软件过程,过程挖掘;周华(1963-),男,博士,研究员,博士生导师,主要研究领域为软件工程,系统分析与集成;李彤(1963-),男,博士,教授,博士生导师,CCF高级会员,主要研究领域为软件工程;代飞(1982-),男,博士,副教授,CCF专业会员,主要研究领域为软件工程,业务过程管理.

通讯作者:

代飞,E-mail:daifei@swfu.edu.cn

基金项目:

国家自然科学基金(61462095,61702442,61662085);云南省自然科学基金(2016FB102);云南省中青年学术和技术带头人后备人才培养项目(C6143002);云南省软件工程重点实验室开放基金(2017SE201,2016SE202);云南省教育厅科学研究基金(2017YJS107,2017ZZX227);云南大学研究生创新项目(YDY17095)


Approach to Mining Length-Two Loops From the Log Without “aba” Pattern
Author:
Fund Project:

National Natural Science Foundation of China (61462095, 61702442, 61662085); Yunnan Province Natural Science Foundation (2016FB102); Talent Project of Yunnan Province (C6143002); Open Fund Project of Key Laboratory of Software Engineering of Yunnan Province (2017SE201, 2016SE202); Yunnan Provincial Department of Education Science Research Fund (2017YJS107, 2017ZZX227); Graduate Innovation Project of Yunnan University (YDY17095)

  • 摘要
  • | |
  • 访问统计
  • |
  • 参考文献 [32]
  • |
  • 相似文献 [20]
  • |
  • 引证文献
  • | |
  • 文章评论
    摘要:

    现有的过程挖掘算法依赖于"aba"模式来挖掘2度循环,而满足局部完备性的日志文件中不一定出现该模式.为此,扩展了经典Alpha算法,提出了αL+算法,用于从没有"aba"模式的日志文件中挖掘出2度循环.首先建立任务间的次序向量矩阵,用于抽象2度循环结构的变体结构;然后从全局视角,根据事件的出现次数及位置来区分2度循环和并发关系;最后提出紧邻度和回路抽象,以排除并发分支上同类型循环带来的干扰.实验结果表明,与现有的挖掘算法相比,αL+算法能够从具有"aba"模式或不具有"aba"模式的日志文件中挖掘出2度循环.此外,该算法实现且集成在开源框架ProM中.

    Abstract:

    The current research in mining length-two loops depends on "aba" pattern. However, the pattern does not necessarily appear in the logs that satisfies local completeness. This research aims at finding ways to mine length-two loops without the pattern. It results in a new algorithm (αL+-algorithm) that is based on the α-algorithm. First, an order vector matrix is established by tasks in logs to abstract variant structures of length-two loops. Then, distinction between loops and concurrency structure is obtained by event's frequency and location in traces. Finally, proximity and circuit abstraction are used to eliminate the interference caused by the concurrent branches. The experimental results show that the αL+-algorithm can handle length-two loops with or without "aba" pattern. In addition, the αL+-algorithm is implemented in the ProM tool.

    参考文献
    [1] van der Aalst W. Process Mining:Discovery, Conformance and Enhancement of Business Proceses. Springer Publishing Company, Incorporated, 2014.
    [2] van der Aalst W, Weijters T, Maruster L. Workflow mining:Discovering process models from event logs. IEEE Trans. on Knowledge & Data Engineering, 2004,16(9):1128-1142.
    [3] Weijters AJMM, Aalst WMP, Medeiros AKA. Process mining with the heuristics miner algorithm. Eindhoven University of Technology, 2006,166:1-34.
    [4] Medeiros AKAD, Weijters AJMM, Aalst WMPVD. Genetic process mining:An experimental evaluation. Data Mining & Knowledge Discovery, 2007,14(2):245-304.
    [5] Sarno R, Sungkono KR. Hidden Markov model for process mining of parallel business processes. Int'l Review on Computers & Software, 2016,11(4):290-306.
    [6] van der Werf JMEM, Dongen BFV, Hurkens CAJ, et al. Process discovery using integer linear programming. Fundamenta Informaticae, 2008,94(3):368-387.
    [7] Bergenthum R, Desel J, Lorenz R, et al. Process mining based on regions of languages. In:Proc. of the Int'l Conf. on Business Process Management. Springer-Verlag, 2007. 375-383.
    [8] Yang HD, Wen LJ, Wang JM. An approach to evaluate the local completeness of an event log. In:Proc. of the 12th IEEE Int'l Conf. on Data Mining (ICDM 2012). 2012. 1164-1169.
    [9] Hofstede AHMT. Estimating completeness of event logs. Technical Report, No.04, BPM Center, 2012.
    [10] Medeiros AKAD, Dongen BFV, Weijters AJMM. Process mining:Extending the α-algorithm to mine short loops. Eindhoven University of Technology, 2004,133:145-180.
    [11] Yuan CY. Petri Net Application. Beijing:Science Press, 2103(in Chinese).
    [12] Günther CW. Activity mining by global trace segmentation. In:Proc. of the Int'l Workshops on Business Process Management Workshops (BPM 2009). Ulm:Revised Papers, 2009. 128-139.
    [13] Polyvyanyy A, García-Bañuelos L, Dumas M. Structuring acyclic process models. Information Systems, 2010,37(6):518-538.
    [14] Polyvyanyy A, García-Bañuelos L, Fahland D, Weske M. Maximal structuring of acyclic process models. The Computer Journal, 2014,57(1):12-35.
    [15] Zhu R, Li T, Mo Q, Dai F, Gao TL, He Y, Sun X. Heuristic parallelized mining single firing sequence. Computer Integrated Manufacturing Systems, 2016,22(2):330-342(in Chinese with English abstract).
    [16] Ma H, Tang Y, Wu LK. Incremental mining of processes with loops. Int'l Journal on Artificial Intelligence Tools, 2011,20(01):221-235.
    [17] Van Dongen BF, De Medeiros AKA, Verbeek HMW, et al. The ProM framework:A new era in process mining tool support. In:Proc. of the Int'l Conf. on Applications and Theory of Petri Nets. Springer-Verlag, 2005. 444-454.
    [18] Jin T, Wang J, Yang Y, Wen L, Li K. Refactor business process models with maximized parallelism. IEEE Trans. on Services Computing, 2016,9(3):456-468.
    [19] Lu FM, Zeng QT, Duan H, Cheng JJ, Bao YX. College of information science and engineering parallelized heuristic process mining algorithm. Ruan Jian Xue Bao/Journal of Software, 2015,26(3):533-549(in Chinese with English abstract). http://www.jos.org.cn/1000-9825/4769.htm[doi:10.13328/j.cnki.jos.004769]
    [20] Vázquez-Barreiros B, Mucientes M, Lama M. ProDiGen:Mining complete, precise and minimal structure process models with a genetic algorithm. Information Sciences, 2015,294:315-333.
    [21] Lu FM, Zeng QT, Bao YX, Duan H, Zhang H. Mining algorithm of task dependencies based on process case clusters. Computer Integrated Manufacturing Systems, 2013,19(8):1771-1783(in Chinese with English abstract).
    [22] Wang HY. A process mining algorithm for cycle tasks[MS. Thesis]. Tianjin:Hebei University of Technology, 2011(in Chinese with English abstract).
    [23] Wu S. An extended alpha mining algorithm for complex loop structures[MS. Thesis]. Harbin:Harbin Engineering University, 2011(in Chinese with English abstract).
    [24] Lekić J, Milićev D. Discovering block-structured parallel process models from causally complete event logs. Journal of Electrical Engineering, 2016,67(2):111-123.
    [25] Lekic J, Milicev D. Discovering models of parallel workflow processes from incomplete event logs. In:Proc. of the Int'l Conf. on Model-Driven Engineering and Software Development. IEEE, 2015. 477-482.
    附中文参考文献:
    [11] 袁崇义.Petri网应用.北京:科学出版社,2013.
    [15] 朱锐,李彤,莫启,代飞,高提雷,何云,孙雪.启发式并行化单触发序列挖掘算法.计算机集成制造系统,2016,22(2):330-342.
    [19] 鲁法明,曾庆田,段华,程久军,包云霞.一种并行化的启发式流程挖掘算法.软件学报,2015,26(3):533-549. http://www.jos.org.cn/1000-9825/4769.htm[doi:10.13328/j.cnki.jos.004769]
    [21] 鲁法明,曾庆田,包云霞,段华,张昊.基于流程案例簇的任务关系挖掘算法.计算机集成制造系统,2013,19(8):1771-1783.
    [22] 王海燕.面向循环任务的过程挖掘算法研究[硕士学位论文].天津:河北工业大学,2011.
    [23] 吴苏.一种可发现复杂循环结构的扩展α过程挖掘算法[硕士学位论文].哈尔滨:哈尔滨工程大学,2011.
    引证文献
引用本文

林雷蕾,周华,代飞,朱锐,李彤.一种从无“aba”模式的日志中挖掘2度循环的方法.软件学报,2018,29(11):3278-3294

复制
分享
文章指标
  • 点击次数:5354
  • 下载次数: 5809
  • HTML阅读次数: 3044
  • 引用次数: 0
历史
  • 收稿日期:2017-07-20
  • 最后修改日期:2017-09-16
  • 录用日期:2017-11-14
  • 在线发布日期: 2017-12-05
文章二维码
您是第19528850位访问者
版权所有:中国科学院软件研究所 京ICP备05046678号-3
地址:北京市海淀区中关村南四街4号,邮政编码:100190
电话:010-62562563 传真:010-62562533 Email:jos@iscas.ac.cn
技术支持:北京勤云科技发展有限公司

京公网安备 11040202500063号