论文标题

组织学习的同伴小组:与实用约束的聚类

Peer groups for organisational learning: clustering with practical constraints

论文作者

Kennedy, Daniel William, Cameron, Jessica, Wu, Paul Pao-Yen, Mengersen, Kerrie

论文摘要

同行组在许多领域用于组织学习,政策实施和基准测试。聚类提供了一种用于构建有意义的同伴组的统计,数据驱动的方法,但是同伴组必须与诸如大小和稳定性考虑的业务约束兼容。此外,统计同伴组是由许多不同变量构建的,并且很难理解,尤其是对于非统计的受众。我们开发了将业务限制应用于聚类解决方案的方法,并允许决策者选择统计拟合优惠和合规性与业务限制之间的平衡。利用了几种工具来识别同伴组中的复杂区别特征,并开发了许多可视化来解释非统计受众的高维群集。在一项案例研究中,需要小组规模很小($ \ leq 100 $成员),我们将约束聚类应用于随后的两年中嘈杂的高维数据集,以确保群集在几年之间足够稳定。我们的方法不仅满足了测试数据的聚类约束,而且在随后几年之间保持拟合优度和稳定性之间几乎单调的负相关。我们在案例研究的背景下证明了如何将特征在统计知识具有丰富且有限的不同利益相关者中清晰地传达给群集之间的特征。

Peer-grouping is used in many sectors for organisational learning, policy implementation, and benchmarking. Clustering provides a statistical, data-driven method for constructing meaningful peer groups, but peer groups must be compatible with business constraints such as size and stability considerations. Additionally, statistical peer groups are constructed from many different variables, and can be difficult to understand, especially for non-statistical audiences. We developed methodology to apply business constraints to clustering solutions and allow the decision-maker to choose the balance between statistical goodness-of-fit and conformity to business constraints. Several tools were utilised to identify complex distinguishing features in peer groups, and a number of visualisations are developed to explain high-dimensional clusters for non-statistical audiences. In a case study where peer group size was required to be small ($\leq 100$ members), we applied constrained clustering to a noisy high-dimensional data-set over two subsequent years, ensuring that the clusters were sufficiently stable between years. Our approach not only satisfied clustering constraints on the test data, but maintained an almost monotonic negative relationship between goodness-of-fit and stability between subsequent years. We demonstrated in the context of the case study how distinguishing features between clusters can be communicated clearly to different stakeholders with substantial and limited statistical knowledge.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源