论文标题

结构化程序合成的过程发现

Process Discovery for Structured Program Synthesis

论文作者

Zhang, Dell, Kuhnle, Alexander, Richardson, Julian, Sensoy, Murat

论文摘要

过程挖掘的核心任务是过程发现,旨在从事件日志数据中学习准确的过程模型。在本文中,我们建议将(块)结构化程序直接用作目标过程模型,以建立与程序合成领域的连接,并促进从抽象过程模型到可执行过程的转换,例如机器人过程自动化。此外,我们开发了一种新型的自下而上的团聚方法,以发现这种结构化程序过程模型。与流行的自上而下的递归式矿工相比,我们提议的集聚矿工享有类似的理论保证,可以产生合理的过程模型(没有僵局和其他异常),同时表现出一些优势,例如避免进行无声活动和适应重复的活动。所提出的算法通过迭代地将一些图形重写规则应用于直接关注活动的绘画作用。对于直接关注真实世界的(稀疏),该算法相对于不同活动的数量具有二次计算复杂性。据我们所知,这是为程序综合目的而制作的第一个过程发现算法。 BPI-Challenge 2020数据集和Karel编程数据集的实验表明,我们提出的算法不仅可以根据传统的过程发现指标,而且还可以从少量的执行轨迹中找到真正的潜在结构性程序来胜过归纳矿工。

A core task in process mining is process discovery which aims to learn an accurate process model from event log data. In this paper, we propose to use (block-) structured programs directly as target process models so as to establish connections to the field of program synthesis and facilitate the translation from abstract process models to executable processes, e.g., for robotic process automation. Furthermore, we develop a novel bottom-up agglomerative approach to the discovery of such structured program process models. In comparison with the popular top-down recursive inductive miner, our proposed agglomerative miner enjoys the similar theoretical guarantee to produce sound process models (without deadlocks and other anomalies) while exhibiting some advantages like avoiding silent activities and accommodating duplicate activities. The proposed algorithm works by iteratively applying a few graph rewriting rules to the directly-follows-graph of activities. For real-world (sparse) directly-follows-graphs, the algorithm has quadratic computational complexity with respect to the number of distinct activities. To our knowledge, this is the first process discovery algorithm that is made for the purpose of program synthesis. Experiments on the BPI-Challenge 2020 dataset and the Karel programming dataset have demonstrated that our proposed algorithm can outperform the inductive miner not only according to the traditional process discovery metrics but also in terms of the effectiveness in finding out the true underlying structured program from a small number of its execution traces.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源