Efficient edge filtering of directly-follows graphs for process mining
| dc.contributor.affiliation | Universidade de Santiago de Compostela. Centro de Investigación en Tecnoloxías da Información | gl |
| dc.contributor.affiliation | Universidade de Santiago de Compostela. Departamento de Electrónica e Computación | gl |
| dc.contributor.author | Chapela Campa, David | |
| dc.contributor.author | Dumas, Marlon | |
| dc.contributor.author | Mucientes Molina, Manuel | |
| dc.contributor.author | Lama Penín, Manuel | |
| dc.date.accessioned | 2022-11-25T10:14:53Z | |
| dc.date.available | 2022-11-25T10:14:53Z | |
| dc.date.issued | 2022 | |
| dc.description.abstract | Automated process discovery is a process mining operation that takes as input an event log of a business process and generates a diagrammatic representation of the process. In this setting, a common diagrammatic representation generated by commercial tools is the directly-follows graph (DFG). In some real-life scenarios, the DFG of an event log contains hundreds of edges, hindering its understandability. To overcome this shortcoming, process mining tools generally offer the possibility of filtering the edges in the DFG. We study the problem of efficiently filtering the DFG extracted from an event log while retaining the most frequent relations. We formalize this problem as an optimization problem, specifically, the problem of finding a sound spanning subgraph of a DFG with a minimal number of edges and a maximal sum of edge frequencies. We show that this problem is an instance of an NP-hard problem and outline several polynomial-time heuristics to compute approximate solutions. Finally, we report on an evaluation of the efficiency and optimality of the proposed heuristics using 13 real-life event logs | gl |
| dc.description.peerreviewed | SI | gl |
| dc.description.sponsorship | We thank Luciano García-Baíuelos for proposing the idea of combining the results of Chu-Liu-Edmonds’ algorithm to filter a DFG. We also thank Adriano Augusto for providing us with the implementation of the Split Miner filtering technique. This research was funded by the Spanish Ministry of Economy and Competitiveness (TIN2017-84796-C2-1-R) and the Galician Ministry of Education, Culture and Universities (ED431G/08). These grants are co-funded by the European Regional Development Fund (ERDF/FEDER program). D. Chapela-Campa is supported by the Spanish Ministry of Education, under the FPU national plan (FPU16/04428 and EST19/00135). This research is also funded by the Estonian Research Council (grant PRG1226) | gl |
| dc.identifier.citation | Information Sciences 610 (2022). https://doi.org/10.1016/j.ins.2022.07.170 | gl |
| dc.identifier.doi | 10.1016/j.ins.2022.07.170 | |
| dc.identifier.essn | 0020-0255 | |
| dc.identifier.uri | http://hdl.handle.net/10347/29469 | |
| dc.language.iso | eng | gl |
| dc.publisher | Elsevier | gl |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/TIN2017-84796-C2-1-R/ES/APORTANDO INTELIGENCIA A LOS PROCESOS DE NEGOCIO MEDIANTE SOFT COMPUTING EN ESCENARIOS DE DATOS MASIVOS | gl |
| dc.relation.publisherversion | https://doi.org/10.1016/j.ins.2022.07.170 | gl |
| dc.rights | ©2022 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/) | gl |
| dc.rights | Atribución 4.0 Internacional | |
| dc.rights.accessRights | open access | gl |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.subject | Process mining | gl |
| dc.subject | Automated process discovery | gl |
| dc.subject | Directly-follows graph | gl |
| dc.subject | Edge filtering | gl |
| dc.title | Efficient edge filtering of directly-follows graphs for process mining | gl |
| dc.type | journal article | gl |
| dc.type.hasVersion | VoR | gl |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | 21112b72-72a3-4a96-bda4-065e7e2bb262 | |
| relation.isAuthorOfPublication | 208dae76-e3a1-4dee-8254-35177f75e17c | |
| relation.isAuthorOfPublication.latestForDiscovery | 21112b72-72a3-4a96-bda4-065e7e2bb262 |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- 2022_infsci_chapela_efficient.pdf
- Size:
- 2.01 MB
- Format:
- Adobe Portable Document Format
- Description:
- Artigo de investigación