论文标题

内核可靠的假设检验

Kernel Robust Hypothesis Testing

论文作者

Sun, Zhongchang, Zou, Shaofeng

论文摘要

研究了可靠的假设检验的问题,在零值和替代假设下,假定数据生成分布在某些不确定性集中,目标是设计一个在不确定性集的最差分布下表现良好的测试。在本文中,使用内核法以数据驱动的方式构建了不确定性集,即它们分别围绕着来自NULL和替代假设的训练样本的经验分布;并通过在再现内核希尔伯特空间中的内核平均分布嵌入之间的距离(即最大平均差异(MMD))受到限制。研究了贝叶斯环境和Neyman-Pearson环境。对于贝叶斯设置,目标是最大程度地减少最坏情况误差概率,首先在字母为有限时获得最佳测试。当字母内无限时,提出了可进行的近似值来量化最差的平均误差概率,并进一步将内核平滑法进一步应用于设计测试,以概括为未见样品。还提出了直接强大的内核测试,并被证明是指数一致的。对于Neyman-Pearson的设置,目标是最大程度地减少未检测的最坏情况的可能性,但对最坏情况的错误警报的概率受到限制,提出了有效的稳健内核测试,并证明是渐近的最佳选择。提供数值结果以证明所提出的可靠测试的性能。

The problem of robust hypothesis testing is studied, where under the null and the alternative hypotheses, the data-generating distributions are assumed to be in some uncertainty sets, and the goal is to design a test that performs well under the worst-case distributions over the uncertainty sets. In this paper, uncertainty sets are constructed in a data-driven manner using kernel method, i.e., they are centered around empirical distributions of training samples from the null and alternative hypotheses, respectively; and are constrained via the distance between kernel mean embeddings of distributions in the reproducing kernel Hilbert space, i.e., maximum mean discrepancy (MMD). The Bayesian setting and the Neyman-Pearson setting are investigated. For the Bayesian setting where the goal is to minimize the worst-case error probability, an optimal test is firstly obtained when the alphabet is finite. When the alphabet is infinite, a tractable approximation is proposed to quantify the worst-case average error probability, and a kernel smoothing method is further applied to design test that generalizes to unseen samples. A direct robust kernel test is also proposed and proved to be exponentially consistent. For the Neyman-Pearson setting, where the goal is to minimize the worst-case probability of miss detection subject to a constraint on the worst-case probability of false alarm, an efficient robust kernel test is proposed and is shown to be asymptotically optimal. Numerical results are provided to demonstrate the performance of the proposed robust tests.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源