PYSCHEDCL：利用异质数据并行系统中的并发

论文标题

PYSCHEDCL：利用异质数据并行系统中的并发

PySchedCL: Leveraging Concurrency in Heterogeneous Data-Parallel Systems

论文作者

Ghose, Anirban, Singh, Siddharth, Kulaharia, Vivek, Dokara, Lokesh, Maity, Srijeeta, Dey, Soumyajit

论文摘要

在过去的十年中，异质GPGPU平台展示的高性能计算功能已导致数据并行编程语言（例如CUDA和OPENCL）的普及。但是，这种语言涉及陡峭的学习曲线，并对异质平台中计算设备的基础架构有了广泛的了解。这导致了几个高性能计算框架的出现，这些计算框架提供了高级抽象，以简化异质平台上数据并行应用程序的开发。但是，此类框架做出的计划决定仅利用数据并行应用中的粗粒粒度并发。在本文中，我们提出了PyschedCl，该框架探讨了良好的并发意识的调度决策，以有效利用异质CPU/GPU体系结构的力量。％，现有HPC框架未提供的功能。我们通过对基于机器学习的推论应用进行广泛的实验评估来展示此类调度机制对现有粗粒颗粒动态调度方案的功效。

In the past decade, high performance compute capabilities exhibited by heterogeneous GPGPU platforms have led to the popularity of data parallel programming languages such as CUDA and OpenCL. Such languages, however, involve a steep learning curve as well as developing an extensive understanding of the underlying architecture of the compute devices in heterogeneous platforms. This has led to the emergence of several High Performance Computing frameworks which provide high-level abstractions for easing the development of data-parallel applications on heterogeneous platforms. However, the scheduling decisions undertaken by such frameworks only exploit coarse-grained concurrency in data parallel applications. In this paper, we propose PySchedCL, a framework which explores fine-grained concurrency aware scheduling decisions that harness the power of heterogeneous CPU/GPU architectures efficiently. %, a feature which is not provided by existing HPC frameworks. We showcase the efficacy of such scheduling mechanisms over existing coarse-grained dynamic scheduling schemes by conducting extensive experimental evaluations for a Machine Learning based inferencing application.

下载PDF全文

下载文献需遵守相关版权规定

论文标题