论文标题
BART的艺术:高维度非均匀平稳性的最小值优化性
The art of BART: Minimax optimality over nonhomogeneous smoothness in high dimension
论文作者
论文摘要
许多用于功能估计的渐近最小值程序通常依赖于某种任意和限制性的假设,例如各向同性或空间同质性。这项工作增强了对贝叶斯添加剂回归树在实质性放松的平稳性假设下的理论理解。当回归函数具有各向异性平滑度时,我们提供了贝叶斯森林的渐近最优性和后收缩的综合研究,可能会随着功能域而变化。回归函数也可能是不连续的。我们介绍了一类新的稀疏{\ em分段异质性各向异性}hölder函数,并在$ l_2 $ -loss下的高维情况下得出其最小值的估计下限。然后,我们发现,在高维场景中稀疏的估计中,贝叶斯树先进的贝叶斯树先选择了稀疏的子集选择,适应未知的异质平滑度,不连续性和稀疏性。这些结果表明,贝叶斯森林非常适合更一般的估计问题,这些问题将使其他默认的机器学习工具(例如高斯工艺)次优。我们的数值研究表明,贝叶斯森林通常比其他竞争对手(例如随机森林和深层神经网络)表现出色,这些森林据信可以很好地适合不连续或复杂的光滑功能。除了非参数回归之外,我们还使用本研究中开发的技术检查了贝叶斯森林的后部收缩进行密度估计和二元分类。
Many asymptotically minimax procedures for function estimation often rely on somewhat arbitrary and restrictive assumptions such as isotropy or spatial homogeneity. This work enhances the theoretical understanding of Bayesian additive regression trees under substantially relaxed smoothness assumptions. We provide a comprehensive study of asymptotic optimality and posterior contraction of Bayesian forests when the regression function has anisotropic smoothness that possibly varies over the function domain. The regression function can also be possibly discontinuous. We introduce a new class of sparse {\em piecewise heterogeneous anisotropic} Hölder functions and derive their minimax lower bound of estimation in high-dimensional scenarios under the $L_2$-loss. We then find that the Bayesian tree priors, coupled with a Dirichlet subset selection prior for sparse estimation in high-dimensional scenarios, adapt to unknown heterogeneous smoothness, discontinuity, and sparsity. These results show that Bayesian forests are uniquely suited for more general estimation problems that would render other default machine learning tools, such as Gaussian processes, suboptimal. Our numerical study shows that Bayesian forests often outperform other competitors such as random forests and deep neural networks, which are believed to work well for discontinuous or complicated smooth functions. Beyond nonparametric regression, we also examined posterior contraction of Bayesian forests for density estimation and binary classification using the technique developed in this study.