论文标题
复杂和空间相关的功能数据的最新发展
Recent Developments in Complex and Spatially Correlated Functional Data
论文作者
论文摘要
随着大规模收集高维和高频数据,新的统计模型的发展正在推动。功能数据分析通过假设数据是连续的函数,例如实现连续过程(曲线)或连续的随机场(表面),并且每个曲线或表面被认为是单个观察力,则提供了处理大规模和复杂数据所需的统计方法。在这里,当数据复杂且空间相关时,我们提供了功能数据分析的概述。我们提供相应功能随机变量的第一和第二矩的定义和估计器。我们提出了两种主要方法:第一个假设数据是对功能随机场的实现,即每个观察值是具有空间分量的曲线。我们称它们为“空间功能数据”。第二种方法假设数据是随着时间的推移观察到的连续确定性字段。在这种情况下,一个观察是表面或歧管,我们称它们为“表面时间序列”。对于两种方法,我们描述可用于统计分析的软件。我们还使用高分辨率风速模拟数据集提出了一个数据图,作为两种方法的示例。功能数据方法提供了新的数据分析范式,其中连续过程或随机字段被视为单个实体。我们认为这种方法在大数据的背景下非常有价值。
As high-dimensional and high-frequency data are being collected on a large scale, the development of new statistical models is being pushed forward. Functional data analysis provides the required statistical methods to deal with large-scale and complex data by assuming that data are continuous functions, e.g., a realization of a continuous process (curves) or continuous random fields (surfaces), and that each curve or surface is considered as a single observation. Here, we provide an overview of functional data analysis when data are complex and spatially correlated. We provide definitions and estimators of the first and second moments of the corresponding functional random variable. We present two main approaches: The first assumes that data are realizations of a functional random field, i.e., each observation is a curve with a spatial component. We call them 'spatial functional data'. The second approach assumes that data are continuous deterministic fields observed over time. In this case, one observation is a surface or manifold, and we call them 'surface time series'. For the two approaches, we describe software available for the statistical analysis. We also present a data illustration, using a high-resolution wind speed simulated dataset, as an example of the two approaches. The functional data approach offers a new paradigm of data analysis, where the continuous processes or random fields are considered as a single entity. We consider this approach to be very valuable in the context of big data.