论文标题
关于CTR预测的概念漂移的改编
On the Adaptation to Concept Drift for CTR Prediction
论文作者
论文摘要
点击率(CTR)预测是Web搜索,推荐系统和在线广告显示中的至关重要任务。在实际应用中,CTR模型通常与高速用户生成的数据流一起使用,其潜在的分布随着时间的推移而迅速变化。概念漂移问题不可避免地存在于这些流数据中,这可能导致及时性问题导致性能退化。为了确保模型的新鲜度,在现实世界的生产系统中已广泛采用增量学习。但是,增量更新很难在适应性捕获快速变化趋势和保留常识的概括能力之间达到CTR模型的平衡。在本文中,我们提出了专家的自适应混合物(Adamoe),这是一个新框架,可通过CTR预测数据流中的统计加权策略来减轻概念漂移问题。基准和现实世界中的工业数据集以及在线A/B测试的广泛离线实验表明,我们的Adamoe明显优于所有考虑的所有增量学习框架。
Click-through rate (CTR) prediction is a crucial task in web search, recommender systems, and online advertisement displaying. In practical application, CTR models often serve with high-speed user-generated data streams, whose underlying distribution rapidly changing over time. The concept drift problem inevitably exists in those streaming data, which can lead to performance degradation due to the timeliness issue. To ensure model freshness, incremental learning has been widely adopted in real-world production systems. However, it is hard for the incremental update to achieve the balance of the CTR models between the adaptability to capture the fast-changing trends and generalization ability to retain common knowledge. In this paper, we propose adaptive mixture of experts (AdaMoE), a new framework to alleviate the concept drift problem by statistical weighting policy in the data stream of CTR prediction. The extensive offline experiments on both benchmark and a real-world industrial dataset, as well as an online A/B testing show that our AdaMoE significantly outperforms all incremental learning frameworks considered.