WFBENCH：科学工作流程基准的自动生成

论文标题

WFBENCH：科学工作流程基准的自动生成

WfBench: Automated Generation of Scientific Workflow Benchmarks

论文作者

Coleman, Tainã, Casanova, Henri, Maheshwari, Ketan, Pottier, Loïc, Wilkinson, Sean R., Wozniak, Justin, Suter, Frédéric, Shankar, Mallikarjun, da Silva, Rafael Ferreira

论文摘要

具有高度计算要求的科学工作流的流行率要求其在各种分布式计算平台上执行，包括大规模的领导级高性能计算（HPC）群集。为了处理工作流执行的部署，监视和优化，在过去十年中已经开发了许多工作流程系统。需要工作流基准测试，可用于评估当前和未来软件堆栈和硬件平台上工作流程系统的性能。我们提出了一个现实的工作流基准规范的生成器，可以将其转换为基准代码，以使用当前的工作流系统执行。我们的方法生成具有任意性能特征（CPU，内存和I/O使用）的工作流程任务，并根据生产工作流程中看到的任务依赖性结构进行现实的任务依赖性结构。我们提出了实验结果，表明我们的方法生成了代表生产工作流程的基准，并进行了案例研究，以证明我们生成的基准测试的使用和实用性，以评估不同配置方案下工作流程系统的性能。

The prevalence of scientific workflows with high computational demands calls for their execution on various distributed computing platforms, including large-scale leadership-class high-performance computing (HPC) clusters. To handle the deployment, monitoring, and optimization of workflow executions, many workflow systems have been developed over the past decade. There is a need for workflow benchmarks that can be used to evaluate the performance of workflow systems on current and future software stacks and hardware platforms. We present a generator of realistic workflow benchmark specifications that can be translated into benchmark code to be executed with current workflow systems. Our approach generates workflow tasks with arbitrary performance characteristics (CPU, memory, and I/O usage) and with realistic task dependency structures based on those seen in production workflows. We present experimental results that show that our approach generates benchmarks that are representative of production workflows, and conduct a case study to demonstrate the use and usefulness of our generated benchmarks to evaluate the performance of workflow systems under different configuration scenarios.

下载PDF全文

下载文献需遵守相关版权规定

论文标题