论文标题
利用数据局部性以提高异质服务器群集的性能
Exploiting Data Locality to Improve Performance of Heterogeneous Server Clusters
论文作者
论文摘要
我们考虑在存在数据区域的大规模异质服务器系统中的负载平衡,这对可以将任务分配给哪个服务器的任务施加了约束。这些约束是通过服务器和调度器处理各种到达流的分配的两部分图自然捕获的。当任务到达时,相应的调度程序将其分配给$ d \ geq 2 $在符合上述约束的随机选择的服务器中最短的服务器。服务器处理速度是异质的,它们取决于服务器类型。对于一类宽类的两分图,我们表征了适当规模的占用过程的极限,无论是在过程级别还是稳定状态下,随着系统大小变大。使用这样的特征,我们表明数据局部性约束可用于显着提高异质系统的性能。这与具有均匀服务器的系统中的完全灵活系统中的异质服务器或数据局部性约束形成鲜明对比,这两种服务器都被观察到降低系统性能。广泛的数值实验证实了理论结果。
We consider load balancing in large-scale heterogeneous server systems in the presence of data locality that imposes constraints on which tasks can be assigned to which servers. The constraints are naturally captured by a bipartite graph between the servers and the dispatchers handling assignments of various arrival flows. When a task arrives, the corresponding dispatcher assigns it to a server with the shortest queue among $d\geq 2$ randomly selected servers obeying the above constraints. Server processing speeds are heterogeneous and they depend on the server-type. For a broad class of bipartite graphs, we characterize the limit of the appropriately scaled occupancy process, both on the process-level and in steady state, as the system size becomes large. Using such a characterization, we show that data locality constraints can be used to significantly improve the performance of heterogeneous systems. This is in stark contrast to either heterogeneous servers in a full flexible system or data locality constraints in systems with homogeneous servers, both of which have been observed to degrade the system performance. Extensive numerical experiments corroborate the theoretical results.