Press releases represent a valuable resource for financial trading and have long been exploited by researchers for the development of automatic stock price predictors. We hereby propose an NLP-based approach to generate industry-specific lexicons from news documents, with the goal of dynamically capturing, on a daily basis, the correlation between words used in these documents and stock price fluctuations. Furthermore, we design a binary classification algorithm that leverages on our lexicons to predict the magnitude of future price changes, for individual companies. Then, we validate our approach through an experimental study conducted on three different industries of the Standard & Poor’s 500 index, by processing press news published by globally renowned sources, and collected within the Dow Jones DNA dataset. Classification results let us quantify the mutual dependence between words and prices, and help us estimate the predictive power of our lexicons.

Dynamic Industry-Specific Lexicon Generation for Stock Market Forecast

Salvatore Carta;Luca Piras;Alessandro Sebastian Podda;Diego Reforgiato Recupero
2020-01-01

Abstract

Press releases represent a valuable resource for financial trading and have long been exploited by researchers for the development of automatic stock price predictors. We hereby propose an NLP-based approach to generate industry-specific lexicons from news documents, with the goal of dynamically capturing, on a daily basis, the correlation between words used in these documents and stock price fluctuations. Furthermore, we design a binary classification algorithm that leverages on our lexicons to predict the magnitude of future price changes, for individual companies. Then, we validate our approach through an experimental study conducted on three different industries of the Standard & Poor’s 500 index, by processing press news published by globally renowned sources, and collected within the Dow Jones DNA dataset. Classification results let us quantify the mutual dependence between words and prices, and help us estimate the predictive power of our lexicons.
File in questo prodotto:
File Dimensione Formato  
Dynamic_Industry_specific_Lexicon_Generation_for_Stock_Market_Forecast.pdf

Solo gestori archivio

Tipologia: versione pre-print
Dimensione 3.46 MB
Formato Adobe PDF
3.46 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11584/334813
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 7
  • ???jsp.display-item.citation.isi??? ND
social impact