论文标题

对抗系统变体近似以量化过程模型概括

Adversarial System Variant Approximation to Quantify Process Model Generalization

论文作者

Theis, Julian, Darabi, Houshang

论文摘要

在过程挖掘中,使用过程发现算法从事件日志中提取过程模型,并通常使用多个质量维度评估。虽然测量提取的过程模型与事件日志的关系的指标是经过充分研究的,从而量化了过程模型可以描述其潜在系统的未观察到的行为的水平。在本文中,提出了一种基于深度学习的新方法,称为对抗系统变体近似(Avatar)来克服此问题。对事件日志中包含的变体进行了序列生成对抗网络的训练,目的是近似系统行为的基本变体分布。未观察到的逼真的变体直接从序列生成的对抗网络或利用大都市 - 悬挂算法中采样。然后,使用已建立的过程模型质量指标,基于观察到的和估计的未观察到的变体的实际观察和估计的未观察到的变体,对过程模型与其潜在的未知系统行为相关的程度进行了量化。在15个地面真实系统的受控实验中,证明了揭示现实的未观察到变体方面的显着性能改进。此外,对所提出的方法进行了实验测试和评估,以量化60个发现的过程模型相对于其系统的概括。

In process mining, process models are extracted from event logs using process discovery algorithms and are commonly assessed using multiple quality dimensions. While the metrics that measure the relationship of an extracted process model to its event log are well-studied, quantifying the level by which a process model can describe the unobserved behavior of its underlying system falls short in the literature. In this paper, a novel deep learning-based methodology called Adversarial System Variant Approximation (AVATAR) is proposed to overcome this issue. Sequence Generative Adversarial Networks are trained on the variants contained in an event log with the intention to approximate the underlying variant distribution of the system behavior. Unobserved realistic variants are sampled either directly from the Sequence Generative Adversarial Network or by leveraging the Metropolis-Hastings algorithm. The degree by which a process model relates to its underlying unknown system behavior is then quantified based on the realistic observed and estimated unobserved variants using established process model quality metrics. Significant performance improvements in revealing realistic unobserved variants are demonstrated in a controlled experiment on 15 ground truth systems. Additionally, the proposed methodology is experimentally tested and evaluated to quantify the generalization of 60 discovered process models with respect to their systems.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源