Optimization Framework for DFG-based Automated Process Discovery Approaches
AuthorAugusto, A; Dumas, M; La Rosa, M; Leemans, S; Vanden Broucke, S
Source TitleSoftware and System Modeling
University of Melbourne Author/sAugusto, Adriano
AffiliationComputing and Information Systems
Document TypeJournal Article
CitationsAugusto, A., Dumas, M., La Rosa, M., Leemans, S. & Vanden Broucke, S. (2020). Optimization Framework for DFG-based Automated Process Discovery Approaches. Software and Systems Modeling
Access StatusThis item is embargoed and will be available on 2022-12-10
The problem of automatically discovering business process models from event logs has been intensely investigated in the past two decades, leading to a wide range of approaches that strike various trade-offs between accuracy, model complexity, and execution time. A few studies have suggested that the accuracy of automated process discovery approaches can be enhanced by means of metaheuristic optimization techniques. However, these studies have remained at the level of proposals without validation on real-life datasets or they have only considered one metaheuristic in isolation. This article presents a metaheuristic optimization framework for automated process discovery. The key idea of the framework is to construct a Directly-Follows Graph (DFG) from the event log, to perturb this DFG so as to generate new candidate solutions, and to apply a DFG-based automated process discovery approach in order to derive a process model from each DFG. The framework can be instantiated by linking it to an automated process discovery approach, an optimization metaheuristic, and the quality measure to be optimized (e.g. fitness, precision, F-score). The article considers several instantiations of the framework corresponding to four optimization metaheuristics, three automated process discovery approaches (Inductive Miner – directly follows, Fodina, and Split Miner), and one accuracy measure (Markovian F-score). These framework instances are compared using a set of 20 real-life event logs. The evaluation shows that metaheuristic optimization consistently yields visible improvements in F-score for all the three automated process discovery approaches, at the cost of execution times in the order of minutes, versus seconds for the baseline approaches.
- Click on "Export Reference in RIS Format" and choose "open with... Endnote".
- Click on "Export Reference in RIS Format". Login to Refworks, go to References => Import References