上海交通大学学报(英文版) ›› 2017, Vol. 22 ›› Issue (2): 173-179.doi: 10.1007/s12204-017-1818-4

• • 上一篇    下一篇

Stock Market Forecasting with Financial Micro-Blog Based on Sentiment and Time Series Analysis

WANG Yinglin (王英林)   

  1. (Department of Computer Science and Technology, Shanghai University of Finance and Economics, Shanghai 200433, China)
  • 出版日期:2017-03-31 发布日期:2017-04-04
  • 通讯作者: WANG Yinglin (王英林) E-mail:wang.yinglin@shufe.edu.cn

Stock Market Forecasting with Financial Micro-Blog Based on Sentiment and Time Series Analysis

WANG Yinglin (王英林)   

  1. (Department of Computer Science and Technology, Shanghai University of Finance and Economics, Shanghai 200433, China)
  • Online:2017-03-31 Published:2017-04-04
  • Contact: WANG Yinglin (王英林) E-mail:wang.yinglin@shufe.edu.cn

摘要: During the past few decades, time series analysis has become one popular method for solving stock forecasting problem. However, depending only on stock index series makes the performance of the forecast not good enough, because many external factors which may be involved are not taken into consideration. As a way to deal with it, sentiment analysis on online textual data of stock market can generate a lot of valuable information as a complement which can be named as external indicators. In this paper, a new method which combines the time series of external indicators and the time series of stock index is provided. A special text processing algorithm is proposed to obtain a weighted sentiment time series. In the experiment, we obtain financial micro-blogs from some famous portal websites in China. After that, each micro-blog is segmented and preprocessed, and then the sentiment value is calculated for each day. Finally, an NARX time series model combined with the weighted sentiment series is created to forecast the future value of Shanghai Stock Exchange Composite Index (SSECI). The experiment shows that the new model makes an improvement in terms of the accuracy.

关键词: time series, micro-blog, sentiment analysis, parsing, neural network

Abstract: During the past few decades, time series analysis has become one popular method for solving stock forecasting problem. However, depending only on stock index series makes the performance of the forecast not good enough, because many external factors which may be involved are not taken into consideration. As a way to deal with it, sentiment analysis on online textual data of stock market can generate a lot of valuable information as a complement which can be named as external indicators. In this paper, a new method which combines the time series of external indicators and the time series of stock index is provided. A special text processing algorithm is proposed to obtain a weighted sentiment time series. In the experiment, we obtain financial micro-blogs from some famous portal websites in China. After that, each micro-blog is segmented and preprocessed, and then the sentiment value is calculated for each day. Finally, an NARX time series model combined with the weighted sentiment series is created to forecast the future value of Shanghai Stock Exchange Composite Index (SSECI). The experiment shows that the new model makes an improvement in terms of the accuracy.

Key words: time series, micro-blog, sentiment analysis, parsing, neural network

中图分类号: