论文标题
高维时间序列数据集的建筑优化和功能学习
Architectural Optimization and Feature Learning for High-Dimensional Time Series Datasets
论文作者
论文摘要
随着我们感知增强的能力,我们正在从数据贫穷的问题过渡,其中核心问题是缺乏相关数据,即数据丰富的问题,其中核心问题是在观察海中识别一些相关特征。通过在重力波天体物理学中应用的激励,我们研究了从检测器及其环境中丰富的测量值收集的引力波检测器中瞬时噪声伪影的问题。我们认为,功能学习 - 在数据中优化了哪些相关功能,对于实现高准确性至关重要。与以前使用固定的手工制作功能的最新现状相比,我们介绍了将错误率降低超过60%的模型。功能学习之所以有用,不仅是因为它提高了预测任务的性能;结果提供了有关与感兴趣现象相关的模式的宝贵信息,否则这些现象将是无法发现的。在我们的应用中,发现与瞬态噪声相关的功能提供了有关其起源的诊断信息,并提出了缓解策略。在高维环境中学习具有挑战性。通过对各种体系结构的实验,我们确定了成功模型中的两个关键因素:稀疏性,用于在高维观测中选择相关变量;和深度,这赋予了处理复杂相互作用和相对于时间变化的鲁棒性的灵活性。我们通过对实际检测器数据进行系统的实验来说明它们的意义。我们的结果提供了对机器学习社区中常见假设的实验性佐证,并直接适用于提高我们感知引力波的能力,以及具有类似高维,嘈杂或部分无关数据的许多其他问题设置。
As our ability to sense increases, we are experiencing a transition from data-poor problems, in which the central issue is a lack of relevant data, to data-rich problems, in which the central issue is to identify a few relevant features in a sea of observations. Motivated by applications in gravitational-wave astrophysics, we study the problem of predicting the presence of transient noise artifacts in a gravitational wave detector from a rich collection of measurements from the detector and its environment. We argue that feature learning--in which relevant features are optimized from data--is critical to achieving high accuracy. We introduce models that reduce the error rate by over 60% compared to the previous state of the art, which used fixed, hand-crafted features. Feature learning is useful not only because it improves performance on prediction tasks; the results provide valuable information about patterns associated with phenomena of interest that would otherwise be undiscoverable. In our application, features found to be associated with transient noise provide diagnostic information about its origin and suggest mitigation strategies. Learning in high-dimensional settings is challenging. Through experiments with a variety of architectures, we identify two key factors in successful models: sparsity, for selecting relevant variables within the high-dimensional observations; and depth, which confers flexibility for handling complex interactions and robustness with respect to temporal variations. We illustrate their significance through systematic experiments on real detector data. Our results provide experimental corroboration of common assumptions in the machine-learning community and have direct applicability to improving our ability to sense gravitational waves, as well as to many other problem settings with similarly high-dimensional, noisy, or partly irrelevant data.