论文标题

DCAF:在线服务系统的动态计算分配框架

DCAF: A Dynamic Computation Allocation Framework for Online Serving System

论文作者

Jiang, Biye, Zhang, Pengye, Chen, Rihan, Dai, Binding, Luo, Xinchen, Yang, Yin, Wang, Guan, Zhou, Guorui, Zhu, Xiaoqiang, Gai, Kun

论文摘要

现代大规模系统(例如推荐系统和在线广告系统)建立在计算密集型基础架构的基础上。这些应用程序的典型目标是最大化总收入,例如GMV〜(总商品量),在有限的计算资源下。通常,在线服务系统遵循多阶段的级联体系结构,该体系结构由多个阶段组成,包括检索,预先排名,排名等。这些阶段通常手动将资源分配给特定的计算功率预算,这需要服务配置才能相应地适应。结果,对于最大化总收入,现有系统很容易属于次优的解决方案。限制是由于面孔,尽管流量请求的价值差异很大,但在线服务系统仍然在它们之间花费平等的计算能力。 在本文中,我们介绍了一个新颖的想法,即在线服务系统可以对每个流量请求进行不同的处理,并根据其价值分配“个性化”计算资源。我们将此资源分配问题作为背包问题提出,并提出了动态计算分配框架〜(DCAF)。根据一些一般假设,DCAF可以从理论上保证系统可以在给定计算预算内最大化总收入。 DCAF带来了重大改进,并已在淘宝的展示广告系统中部署,以服务主要的流量。使用DCAF,我们能够通过减少20 \%的计算资源来维持相同的业务绩效。

Modern large-scale systems such as recommender system and online advertising system are built upon computation-intensive infrastructure. The typical objective in these applications is to maximize the total revenue, e.g. GMV~(Gross Merchandise Volume), under a limited computation resource. Usually, the online serving system follows a multi-stage cascade architecture, which consists of several stages including retrieval, pre-ranking, ranking, etc. These stages usually allocate resource manually with specific computing power budgets, which requires the serving configuration to adapt accordingly. As a result, the existing system easily falls into suboptimal solutions with respect to maximizing the total revenue. The limitation is due to the face that, although the value of traffic requests vary greatly, online serving system still spends equal computing power among them. In this paper, we introduce a novel idea that online serving system could treat each traffic request differently and allocate "personalized" computation resource based on its value. We formulate this resource allocation problem as a knapsack problem and propose a Dynamic Computation Allocation Framework~(DCAF). Under some general assumptions, DCAF can theoretically guarantee that the system can maximize the total revenue within given computation budget. DCAF brings significant improvement and has been deployed in the display advertising system of Taobao for serving the main traffic. With DCAF, we are able to maintain the same business performance with 20\% computation resource reduction.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源