Optimization Framework for DFG-based Automated Process Discovery Approaches
Citations
Altmetric
Author
Augusto, A; Dumas, M; La Rosa, M; Leemans, S; Vanden Broucke, SDate
2020Source Title
Software and System ModelingPublisher
Springer VerlagUniversity of Melbourne Author/s
Augusto, AdrianoAffiliation
Computing and Information SystemsMetadata
Show full item recordDocument Type
Journal ArticleCitations
Augusto, A., Dumas, M., La Rosa, M., Leemans, S. & Vanden Broucke, S. (2020). Optimization Framework for DFG-based Automated Process Discovery Approaches. Software and Systems ModelingAccess Status
This item is embargoed and will be available on 2022-12-10Abstract
The problem of automatically discovering business process models from event logs has been intensely investigated in the past two decades, leading to a wide range of approaches that strike various trade-offs between accuracy, model complexity, and execution time. A few studies have suggested that the accuracy of automated process discovery approaches can be enhanced by means of metaheuristic optimization techniques. However, these studies have remained at the level of proposals without validation on real-life datasets or they have only considered one metaheuristic in isolation. This article presents a metaheuristic optimization framework for automated process discovery. The key idea of the framework is to construct a Directly-Follows Graph (DFG) from the event log, to perturb this DFG so as to generate new candidate solutions, and to apply a DFG-based automated process discovery approach in order to derive a process model from each DFG. The framework can be instantiated by linking it to an automated process discovery approach, an optimization metaheuristic, and the quality measure to be optimized (e.g. fitness, precision, F-score). The article considers several instantiations of the framework corresponding to four optimization metaheuristics, three automated process discovery approaches (Inductive Miner – directly follows, Fodina, and Split Miner), and one accuracy measure (Markovian F-score). These framework instances are compared using a set of 20 real-life event logs. The evaluation shows that metaheuristic optimization consistently yields visible improvements in F-score for all the three automated process discovery approaches, at the cost of execution times in the order of minutes, versus seconds for the baseline
approaches.
Export Reference in RIS Format
Endnote
- Click on "Export Reference in RIS Format" and choose "open with... Endnote".
Refworks
- Click on "Export Reference in RIS Format". Login to Refworks, go to References => Import References