In recent years, machine learning algorithms have been successfully employed to leverage the potential of identifying hidden patterns of financial market behavior and, consequently, have become the norm in financial applications. This thesis proposes a statistical arbitrage trading strategy with two key elements: an ensemble of regression algorithms for asset return prediction, followed by a dynamic asset selection. More specifically, the extreme heterogeneity of the ensemble is achieved by ensuring model diversity using state-of-the-art machine learning algorithms, data diversity by using diverse input features, and method diversity}by using individual models for each asset, as well as models that learn cross-sectional across multiple assets. Moreover, the ensemble is constructed using a novel model selection approach. Then, the predicted results are fed into a quality assurance mechanism that prunes assets that have poor forecasting performance in recent history. However, relying blindly on machine learning algorithms for decision-making can have negative consequences, especially in critical areas such as finance. At the same time, it is well acknowledged that converting data into actionable insights can be a complex task. As a particular example, the practitioners in the financial domain, find it difficult to manage large quantities of data linked to an impressive number of stocks. Given these motivations, this dissertation introduces machine learning approaches based on eXplainable Artificial Intelligence techniques that are integrated into a financial forecasting and trading pipeline. Specifically, there are presented three strategies for excluding irrelevant features for the prediction task, with the goal being to increase the prediction performance not only at the stock level but also globally, at the stock-set level. For the proposed trading strategies, the analysis that was carried out reveals that the use of the feature selection approaches improve the portfolio performance. This is achieved by using only the predictive signals which are less noisy than the original content of the entire feature set while preserving enough information. To demonstrate the utility of the proposed approaches, their performance is evaluated in real-world scenarios, i.e., historical data of stocks composing the S&P500 index or custom stock-set combining various other indexes. The thesis contributes to the scientific literature in terms of results and the novel manner of exploiting machine learning in an algorithmic trading context.

Artificial Intelligence-Driven Systems for Financial Forecasting

STANCIU, MARIA MADALINA
2022-04-22

Abstract

In recent years, machine learning algorithms have been successfully employed to leverage the potential of identifying hidden patterns of financial market behavior and, consequently, have become the norm in financial applications. This thesis proposes a statistical arbitrage trading strategy with two key elements: an ensemble of regression algorithms for asset return prediction, followed by a dynamic asset selection. More specifically, the extreme heterogeneity of the ensemble is achieved by ensuring model diversity using state-of-the-art machine learning algorithms, data diversity by using diverse input features, and method diversity}by using individual models for each asset, as well as models that learn cross-sectional across multiple assets. Moreover, the ensemble is constructed using a novel model selection approach. Then, the predicted results are fed into a quality assurance mechanism that prunes assets that have poor forecasting performance in recent history. However, relying blindly on machine learning algorithms for decision-making can have negative consequences, especially in critical areas such as finance. At the same time, it is well acknowledged that converting data into actionable insights can be a complex task. As a particular example, the practitioners in the financial domain, find it difficult to manage large quantities of data linked to an impressive number of stocks. Given these motivations, this dissertation introduces machine learning approaches based on eXplainable Artificial Intelligence techniques that are integrated into a financial forecasting and trading pipeline. Specifically, there are presented three strategies for excluding irrelevant features for the prediction task, with the goal being to increase the prediction performance not only at the stock level but also globally, at the stock-set level. For the proposed trading strategies, the analysis that was carried out reveals that the use of the feature selection approaches improve the portfolio performance. This is achieved by using only the predictive signals which are less noisy than the original content of the entire feature set while preserving enough information. To demonstrate the utility of the proposed approaches, their performance is evaluated in real-world scenarios, i.e., historical data of stocks composing the S&P500 index or custom stock-set combining various other indexes. The thesis contributes to the scientific literature in terms of results and the novel manner of exploiting machine learning in an algorithmic trading context.
22-apr-2022
File in questo prodotto:
File Dimensione Formato  
PhD_Thesis_Cover_ay_2020_21_APR-combined.pdf

accesso aperto

Descrizione: Artificial Intelligence Driven Systems for Financial Forecasting
Tipologia: Tesi di dottorato
Dimensione 4.53 MB
Formato Adobe PDF
4.53 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11584/333450
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact