论文标题
DeepSocs:用于片上系统的神经调度程序(SOC)资源调度
DeepSoCS: A Neural Scheduler for Heterogeneous System-on-Chip (SoC) Resource Scheduling
论文作者
论文摘要
在本文中,我们提出了一种新型的调度解决方案,用于一类片上系统(SOC)系统,在该系统中,必须有效地安排异质的芯片资源(DSP,FPGA,GPU等),以通过有指示的无环形图表示,以连续到达层次结构工作。传统上,启发式算法已被广泛用于许多资源调度领域,而最早的最早完成时间(HEFT)在多年来一直是广泛的异构资源调度域中的主要最新技术。尽管人们很长期以来,但众所周知,类似重的算法容易受到少量噪音的影响。我们的深入强化学习(DRL)基于SOC调度程序(DeepSocs)能够在动态环境变化下学习“最佳”任务订购,克服了基于规则的调度程序的脆弱性,例如在不同类型的工作中具有明显更高绩效的HEFT。我们〜使用实时异质的SOC计划模拟器来描述DeepSOCS设计过程,讨论主要挑战,并提出了两个新型的神经网络设计功能,这些功能均超过了较高的重量:(i)层次结构的工作和任务编织嵌入; (ii)在状态空间中有效使用实时任务信息。此外,我们〜引入有效的技术来解决我们环境中面临的两个基本挑战:延迟后果和联合行动。通过一项广泛的模拟研究,我们表明我们的DeepSOC表现出与在逼真的噪声条件下具有更高水平的鲁棒性的工作执行时间的明显更高。我们〜在讨论我们的DeepSocs神经调度程序的潜在改进的情况下进行了讨论。
In this paper, we~present a novel scheduling solution for a class of System-on-Chip (SoC) systems where heterogeneous chip resources (DSP, FPGA, GPU, etc.) must be efficiently scheduled for continuously arriving hierarchical jobs with their tasks represented by a directed acyclic graph. Traditionally, heuristic algorithms have been widely used for many resource scheduling domains, and Heterogeneous Earliest Finish Time (HEFT) has been a dominating state-of-the-art technique across a broad range of heterogeneous resource scheduling domains over many years. Despite their long-standing popularity, HEFT-like algorithms are known to be vulnerable to a small amount of noise added to the environment. Our Deep Reinforcement Learning (DRL)-based SoC Scheduler (DeepSoCS), capable of learning the "best" task ordering under dynamic environment changes, overcomes the brittleness of rule-based schedulers such as HEFT with significantly higher performance across different types of jobs. We~describe a DeepSoCS design process using a real-time heterogeneous SoC scheduling emulator, discuss major challenges, and present two novel neural network design features that lead to outperforming HEFT: (i) hierarchical job- and task-graph embedding; and (ii) efficient use of real-time task information in the state space. Furthermore, we~introduce effective techniques to address two fundamental challenges present in our environment: delayed consequences and joint actions. Through an extensive simulation study, we~show that our DeepSoCS exhibits the significantly higher performance of job execution time than that of HEFT with a higher level of robustness under realistic noise conditions. We~conclude with a discussion of the potential improvements for our DeepSoCS neural scheduler.