论文标题

使用机器学习和深度学习模型的基于时间序列分析的股票价格预测

A Time Series Analysis-Based Stock Price Prediction Using Machine Learning and Deep Learning Models

论文作者

Mehtab, Sidra, Sen, Jaydip

论文摘要

对于研究人员来说,预测未来股票价格一直是一项具有挑战性的任务。尽管有效的市场假设(EMH)的拥护者认为,无法设计任何可以准确预测股价运动的预测框架,但文献中有开创性的工作清楚地证明,在股票价格的时间序列中看似随机的运动模式可以高准确地预测。此类预测模型的设计需要选择适当的变量,变量的正确转换方法以及对模型参数的调整。在这项工作中,我们提出了一个非常健壮,准确的股票价格预测框架,该框架由统计,机器学习和深度学习模型组成。我们使用一家众所周知的公司,在印度国家证券交易所(NSE)中列出了一家著名的公司的每日股票价格数据。每天将颗粒数据汇总为三个插槽,汇总数据用于构建和培训预测模型。我们认为,使用统计,机器学习和深度学习方法组合的模型构建方法的聚集方法可以非常有效地从股价数据中的挥发性和随机运动模式中学习。我们基于统计和机器学习方法构建了八个分类和八个回归模型。除这些模型外,还建立了使用长期和长期内存(LSTM)网络的深度学习回归模型。关于这些模型的性能,已经提出了广泛的结果,并对结果进行了严格的分析。

Prediction of future movement of stock prices has always been a challenging task for the researchers. While the advocates of the efficient market hypothesis (EMH) believe that it is impossible to design any predictive framework that can accurately predict the movement of stock prices, there are seminal work in the literature that have clearly demonstrated that the seemingly random movement patterns in the time series of a stock price can be predicted with a high level of accuracy. Design of such predictive models requires choice of appropriate variables, right transformation methods of the variables, and tuning of the parameters of the models. In this work, we present a very robust and accurate framework of stock price prediction that consists of an agglomeration of statistical, machine learning and deep learning models. We use the daily stock price data, collected at five minutes interval of time, of a very well known company that is listed in the National Stock Exchange (NSE) of India. The granular data is aggregated into three slots in a day, and the aggregated data is used for building and training the forecasting models. We contend that the agglomerative approach of model building that uses a combination of statistical, machine learning, and deep learning approaches, can very effectively learn from the volatile and random movement patterns in a stock price data. We build eight classification and eight regression models based on statistical and machine learning approaches. In addition to these models, a deep learning regression model using a long-and-short-term memory (LSTM) network is also built. Extensive results have been presented on the performance of these models, and the results are critically analyzed.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源