论文标题

可扩展的多元直方图

Scalable Multivariate Histograms

论文作者

Sainudiin, Raazesh, Tucker, Warwick, Wiklund, Tilo

论文摘要

我们给出了先前由第一作者开发的自适应直方图估计过程的分布式变体。该程序基于常规铺路,众所周知具有许多吸引人的统计和算术特性。分布式版本使处理数据集比以前大得多。我们根据允许许可提供原型实施。

We give a distributed variant of an adaptive histogram estimation procedure previously developed by the first author. The procedure is based on regular pavings and is known to have numerous appealing statistical and arithmetical properties. The distributed version makes it possible to process data sets significantly bigger than previously. We provide prototype implementation under a permissive license.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源