论文标题
团队以分布式电力控制的专家的深层混合
Team Deep Mixture of Experts for Distributed Power Control
论文作者
论文摘要
在无线网络的背景下,最近显示,可以共同培训多个DNN,以提供所需的协作行为,能够应对广泛的感知不确定性。特别是,已经确定DNN可用于得出相对于影响本地信息的本地信息(例如,在无线网络中的CSI)(例如,发射器)(例如发射机)做出决定的局部信息(例如CSI)的策略。尽管有希望,但实施这种方法的主要挑战是,信息噪声统计数据可能因代理而有所不同,更重要的是,这种统计数据可能在培训时无法获得或随着时间的推移可能随着时间的推移而发展,从而使繁重的重新训练必要。这种情况使人们希望设计一种“通用”机器学习模型,该模型可以对所有人进行一次培训,以便在任何未来的反馈噪声环境中进行分散的合作。考虑到这个目标,我们提出了一种启发的架构,该体系结构是从专家(MOE)模型的著名混合物中启发的,该模型以前用于在各种情况下(例如计算机视觉和语音识别)中用于非线性回归和分类任务。我们将分散的功率控制问题视为展示拟议模型的有效性并将其与其他功率控制算法进行比较的一个例子。我们展示了所谓的Team-Dmoe模型有效跟踪时变统计方案的能力。
In the context of wireless networking, it was recently shown that multiple DNNs can be jointly trained to offer a desired collaborative behaviour capable of coping with a broad range of sensing uncertainties. In particular, it was established that DNNs can be used to derive policies that are robust with respect to the information noise statistic affecting the local information (e.g. CSI in a wireless network) used by each agent (e.g. transmitter) to make its decision. While promising, a major challenge in the implementation of such method is that information noise statistics may differ from agent to agent and, more importantly, that such statistics may not be available at the time of training or may evolve over time, making burdensome retraining necessary. This situation makes it desirable to devise a "universal" machine learning model, which can be trained once for all so as to allow for decentralized cooperation in any future feedback noise environment. With this goal in mind, we propose an architecture inspired from the well-known Mixture of Experts (MoE) model, which was previously used for non-linear regression and classification tasks in various contexts, such as computer vision and speech recognition. We consider the decentralized power control problem as an example to showcase the validity of the proposed model and to compare it against other power control algorithms. We show the ability of the so called Team-DMoE model to efficiently track time-varying statistical scenarios.