一种适用于多样性环境的业务流程挖掘方法
作者:
基金项目:

国家自然科学基金(60873115); 教育部-中国移动科研基金(MCM20123011); 上海市科技发展基金(13dz2260200, 13511504300); 上海中医药大学预算内项目(2013JW30)


Process Mining Approach for Diverse Application Environments
Author:
  • 摘要
  • | |
  • 访问统计
  • |
  • 参考文献 [26]
  • |
  • 相似文献 [20]
  • | | |
  • 文章评论
    摘要:

    从运行日志挖掘业务流程模型的流程挖掘方法研究方兴未艾,然而,复杂多变的运行环境使流程日志也不可避免地呈现出多样性.传统的流程挖掘算法各有其适用对象,因此,如何挑选适合多样性流程日志的流程挖掘算法成为了一项挑战.提出一种适用于多样性环境的业务流程挖掘方法SoFi(survival of fittest integrator).该方法基于领域知识对日志进行分类,使用多种现有的挖掘算法对每一类子日志产生一组流程模型作为遗传算法的初始种群,借助遗传算法的优化能力,从中整合得到高质量的业务流程模型.针对模拟日志和某通信公司真实日志的实验结果表明:相对于任何单一的挖掘算法,SoFi产生的流程模型具有更高的综合质量,即重现度、精确度、通用性和简单性.

    Abstract:

    Mining business process models from running logs is in its ascendant. Inevitably, the ever changing operational environment makes these log records diverse. Considering every mining algorithm has its pros and cons, this paper focuses on the challenge to apply a best mining algorithm against diverse logs. A novel approach, SoFi (survival of fittest integrator), is proposed to mine business process models effectively in such a diverse environment. SoFi tackles the diversity issue by utilizing domain knowledge to classify the cases in a log and applying various mining algorithms on these categories to obtain comprehensive process models as candidates for optimization. A genetic algorithm (GA) based optimizer takes these candidates as initial population for purpose of both genetic quality as well as genetic diversity. Under the principle of survival of fittest, the GA optimizer can aggregate best process fragments with context into the final process model for the entire log. Experiments on synthetic data and real cases from a telecommunication firm demonstrate the effectiveness of SoFi and comprehensive quality of mined process models in terms of replay fitness, accuracy, generalization, and simplicity.

    参考文献
    [1] Van der Aalst WMP. Process Mining: Discovery, Conformance and Enhancement of Business Processes. Springer-Verlag, 2011.
    [2] Van der Aalst WMP, De Medeiros AKA, Weijters AJMM. Genetic process mining. In: Proc. of the Applications and Theory of Petri Nets 2005. LNCS 3536, Springer-Verlag, 2005. 48-69.
    [3] Van der Aalst W, Weijters T, Maruster L. Workflow mining: Discovering process models from event logs. IEEE Trans. on Knowledge and Data Engineering, 2004,16(9):1128-1142. [doi: 10.1109/TKDE.2004.47]
    [4] Wen LJ, Wang JM, Sun JG. Detecting implicit dependencies between tasks from event logs. In: Proc. of the Frontiers of WWW Research and Development-APWeb 2006. Springer-Verlag, 2006. 591-603. [doi: 10.1007/11610113_52]
    [5] Weijters AJMM, Ribeiro JTS. Flexible heuristics miner (FHM). In: Proc. of the IEEE Symp. on Computational Intelligence and Data Mining (CIDM). IEEE, 2011. 310-317. [doi: 10.1109/CIDM.2011.5949453]
    [6] Weijters AJMM, van der Aalst WMP. Rediscovering workflow models from event-based data using little thumb. Integrated Computer Aided Engineering, 2003,10(2):151-162.
    [7] Van Dongen BF, Van der Aalst WMP. Multi-Phase process mining: Aggregating instance graphs into EPCs and Petri nets. In: Proc. of the 2nd Int'l Workshop on Applications of Petri Nets to Coordination, Workflow and Business Process Management (PNCWB). Citeseer, 2005.
    [8] Van Dongen BF, Van der Aalst WMP. Multi-Phase process mining: Building instance graphs. In: Proc. of the Conceptual Modeling–ER 2004. Springer-Verlag, 2004. 362-376. [doi: 10.1007/978-3-540-30464-7_29]
    [9] De Medeiros AKA, Weijters AJMM, Van der Aalst WMP. Genetic process mining: an experimental evaluation. Data Mining and Knowledge Discovery, 2007,14(2):245-304. [doi: 10.1007/s10618-006-0061-7]
    [10] De Medeiros AKA, Weijters AJMM, Van der Aalst WMP. Genetic process mining: A basic approach and its challenges. In: Proc. of the Business Process Management Workshops. Springer-Verlag, 2006. 203-215. [doi: 10.1007/11678564_18]
    [11] Song M, Günther CW, Van der Aalst WMP. Trace clustering in process mining. In: Proc. of the Business Process Management Workshops. Springer-Verlag, 2009. 109-120. [doi: 10.1007/978-3-642-00328-8_11]
    [12] Bose RPJC, Van der Aalst WMP. Trace clustering based on conserved patterns: Towards achieving better process models. In: Proc. of the Business Process Management Workshops. Springer-Verlag, 2010. 170-181. [doi: 10.1007/978-3-642-12186-9_16]
    [13] Bose RPJC, Van der Aalst WMP. Context aware trace clustering: Towards improving process mining results. In: Proc. of the SDM. SIAM, 2009. 401-412.
    [14] Kumaran S, Liu R, Wu FY. On the duality of information-centric and activity-centric models of business processes. In: Proc. of the Advanced Information Systems Engineering. Springer-Verlag, 2008. 32-47. [doi: 10.1007/978-3-540-69534-9_3]
    [15] Nigam A, Caswell NS. Business artifacts: An approach to operational specification. IBM Systems Journal, 2003,42(3):428-445. [doi: 10.1147/sj.423.0428]
    [16] Cortadella J, Kishinevsky M, Lavagno L, Yakovlev A. Deriving Petri nets from finite transition systems. IEEE Trans. on Computers, 1998,47(8):859-882. [doi: 10.1109/12.707587]
    [17] Buijs J, Van Dongen BF, Van der Aalst WMP. A genetic algorithm for discovering process trees. In: Proc. of the 2012 IEEE Congress on Evolutionary Computation (CEC). IEEE, 2012. 1-8. [doi: 10.1109/CEC.2012.6256458]
    [18] Aalst WMP, Medeiros AKA, Weijters AJMM. Genetic process mining. In: Proc. of the Applications and Theory of Petri Nets. 2005. 985-985. [doi: 10.1007/11494744_5]
    [19] Buijs J, Van Dongen BF, Van der Aalst WMP. On the role of fitness, precision, generalization and simplicity in process discovery. In: Proc. of the Move to Meaningful Internet Systems: OTM 2012. Springer-Verlag, 2012. 305-322. [doi: 10.1007/978-3-642- 33606-5_19]
    [20] Adriansyah A, Van Dongen BF, Van der Aalst WMP. Conformance checking using cost-based fitness analysis. In: Proc. of the 2011 15th IEEE Int'l Enterprise Distributed Object Computing Conf. (EDOC). IEEE, 2011. 55-64. [doi: 10.1109/EDOC.2011.12]
    [21] Adriansyah A, Munoz-Gama J, Carmona J, Van Dongen BF, Van der Aalst WMP. Alignment based precision checking. In: Proc. of the Business Process Management Workshops. Springer-Verlag, 2013. 137-149. [doi: 10.1007/978-3-642-36285-9_15]
    [22] Van Dongen BF, De Medeiros AKA, Verbeek HMW, Weijters AJMM, Aalst WMP. The ProM framework: A new era in process mining tool support. In: Proc. of the Applications and Theory of Petri Nets 2005. Springer-Verlag, 2005. 444-454. [doi: 10.1007/ 11494744_25]
    [23] Burattin A, Sperduti A. PLG: A framework for the generation of business process models and their execution logs. In: Proc. of the Business Process Management Workshops. Springer-Verlag, 2011. 214-219. [doi: 10.1007/978-3-642-20511-8_20]
    [24] Eck M. Alignment-Based Process Model Repair and its Application to the Evolutionary Tree Miner., 2013
    [25] Zeng QT, Sun SX, Huan H, Liu C, Wang HQ. Cross-Organizational collaborative workflow mining from a multi-source log. In: Proc. of the Decision Support Systems. 2013. 1280-1301. [doi: 10.1016/j.dss.2012.12.001]
    [26] Rozinat A, Van Der Aalst WMP. Conformance checking of processes based on monitoring real behavior. Information Systems, 2008,33(1):64-95. [doi: 10.1016/j.is.2007.07.001]
    引证文献
    网友评论
    网友评论
    分享到微博
    发 布
引用本文

杨丽琴,康国胜,郭立鹏,田朝阳,张亮,张笑楠,高翔.一种适用于多样性环境的业务流程挖掘方法.软件学报,2015,26(3):550-561

复制
分享
文章指标
  • 点击次数:5611
  • 下载次数: 6765
  • HTML阅读次数: 2458
  • 引用次数: 0
历史
  • 收稿日期:2014-07-01
  • 最后修改日期:2014-11-21
  • 在线发布日期: 2015-03-03
文章二维码
您是第19831005位访问者
版权所有:中国科学院软件研究所 京ICP备05046678号-3
地址:北京市海淀区中关村南四街4号,邮政编码:100190
电话:010-62562563 传真:010-62562533 Email:jos@iscas.ac.cn
技术支持:北京勤云科技发展有限公司

京公网安备 11040202500063号