论文标题
使用贝叶斯概念学习的数值序列预测
Numerical Sequence Prediction using Bayesian Concept Learning
论文作者
论文摘要
当人们学习数学模式或序列时,他们能够识别这些模式的概念(或规则)。学习了基本概念后,人类还能够将这些概念推广到其他数字,甚至可以识别以前看不见的这些规则的组合。当前最新的RNN架构(例如LSTMS)在预测连续数据的连续元素方面表现良好,但需要大量的培训示例。即使有了广泛的数据,这些模型也很难概括概念。从我们的行为研究中,我们还发现,人类能够无视噪声并确定产生损坏序列的基本规则。因此,我们提出了一个贝叶斯模型,该模型捕获了这些类似人类的学习能力,以在给定序列中预测下一个数字,比传统的LSTM更好。
When people learn mathematical patterns or sequences, they are able to identify the concepts (or rules) underlying those patterns. Having learned the underlying concepts, humans are also able to generalize those concepts to other numbers, so far as to even identify previously unseen combinations of those rules. Current state-of-the art RNN architectures like LSTMs perform well in predicting successive elements of sequential data, but require vast amounts of training examples. Even with extensive data, these models struggle to generalize concepts. From our behavioral study, we also found that humans are able to disregard noise and identify the underlying rules generating the corrupted sequences. We therefore propose a Bayesian model that captures these human-like learning capabilities to predict next number in a given sequence, better than traditional LSTMs.