论文标题

PSPICE:复杂事件处理的部分匹配脱落

pSPICE: Partial Match Shedding for Complex Event Processing

论文作者

Slo, Ahmad, Bhowmik, Sukanya, Flaig, Albert, Rothermel, Kurt

论文摘要

复杂的事件处理(CEP)系统连续处理输入事件流以检测模式。随着时间的流逝,输入事件速率可能会波动并超越系统的功能。减少系统过载的一种方法是使用负载脱落。在本文中,我们提出了针对CEP系统的负载脱落策略,该策略丢弃了CEP操作员内部状态(又称部分匹配)的一部分,以维持给定的延迟绑定。这里的关键问题是要删除多少部分和哪些部分匹配,以便在最小化结果质量的降解时保持给定的潜伏期界限。在流处理域中,已经提出了不同的负载脱落策略,主要取决于单个元素的重要性。但是,随着CEP系统执行模式检测,事件的重要性也受流中其他事件的影响。我们的负载脱落策略使用马尔可夫链和马尔可夫奖励过程来预测部分匹配的效用/重要性,以确定要删除的效用。此外,我们以最小化负载脱落的开销的方式代表实用程序。此外,我们提供算法来决定何时开始放弃部分匹配以及要丢弃多少部分匹配。通过对三个现实世界数据集和几个代表性查询的广泛评估我们的方法,我们表明我们的负载策略对结果质量的不利影响大大远低于最先进的负载脱落策略的影响。

Complex event processing (CEP) systems continuously process input event streams to detect patterns. Over time, the input event rate might fluctuate and overshoot the system's capabilities. One way to reduce the overload on the system is to use load shedding. In this paper, we propose a load shedding strategy for CEP systems which drops a portion of the CEP operator's internal state (a.k.a. partial matches) to maintain a given latency bound. The crucial question here is how many and which partial matches to drop so that a given latency bound is maintained while minimizing the degradation in the quality of results. In the stream processing domain, different load shedding strategies have been proposed that mainly depend on the importance of individual tuples. However, as CEP systems perform pattern detection, the importance of events is also influenced by other events in the stream. Our load shedding strategy uses Markov chain and Markov reward process to predict the utility/importance of partial matches to determine the ones to be dropped. In addition, we represent the utility in a way that minimizes the overhead of load shedding. Furthermore, we provide algorithms to decide when to start dropping partial matches and how many partial matches to drop. By extensively evaluating our approach on three real-world datasets and several representative queries, we show that the adverse impact of our load shedding strategy on the quality of results is considerably less than the impact of state-of-the-art load shedding strategies.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源