论文标题
多元极端的稀疏结构
Sparse Structures for Multivariate Extremes
论文作者
论文摘要
极值统计提供了罕见事件的较小发生概率的准确估计。尽管单变量极端的理论和统计工具是完善的,但对于高维和复杂数据集的方法仍然很少。直到最近才建立了与机器学习,图形模型和高维统计的其他领域的稀疏性和连接的适当概念。本文回顾了与罕见事件中稀疏模式的检测和建模有关的新研究领域。我们首先描述了多变量随机矢量的最大观察结果之间可能出现的不同形式的极端依赖形式。然后,我们讨论当前的研究主题,包括聚类,主成分分析和极端图形建模。还解决了可能同时极端的变量组的识别。这些方法用洪水风险评估的应用进行说明。
Extreme value statistics provides accurate estimates for the small occurrence probabilities of rare events. While theory and statistical tools for univariate extremes are well-developed, methods for high-dimensional and complex data sets are still scarce. Appropriate notions of sparsity and connections to other fields such as machine learning, graphical models and high-dimensional statistics have only recently been established. This article reviews the new domain of research concerned with the detection and modeling of sparse patterns in rare events. We first describe the different forms of extremal dependence that can arise between the largest observations of a multivariate random vector. We then discuss the current research topics including clustering, principal component analysis and graphical modeling for extremes. Identification of groups of variables which can be concomitantly extreme is also addressed. The methods are illustrated with an application to flood risk assessment.