Full Text Available
Note: Clicking the button above will open the full text document at the original institutional repository in a new window.
The prediction of long-term share returns is an essential yet complex task in financial analysis and formulating investment strategy. Machine learning is a promising approach for improving the accuracy of these predictions. However, the outputs of machine learning models are not transparent or inter...
| Main Author: | |
|---|---|
| Other Authors: | |
| Format: | Thesis |
| Language: | English |
| Published: |
Department of Computer Science
2024
|
| Subjects: | |
| Tags: |
No Tags, Be the first to tag this record!
|
| _version_ | 1867613228776816640 |
|---|---|
| access_status_str | Open Access |
| author | Boakes, Jamie |
| author2 | Moodley, Deshendran |
| author_browse | Boakes, Jamie Moodley, Deshendran |
| author_facet | Moodley, Deshendran Boakes, Jamie |
| author_sort | Boakes, Jamie |
| collection | Thesis |
| description | The prediction of long-term share returns is an essential yet complex task in financial analysis and formulating investment strategy. Machine learning is a promising approach for improving the accuracy of these predictions. However, the outputs of machine learning models are not transparent or interpretable, which limits their usability for real-world decision making. There is a lack of research on the use of machine learning algorithms to predict long-term share returns on the Johannesburg Stock Exchange (JSE), with no studies that specifically examine the interpretability of machine learning algorithms. This study investigates the use of machine learning algorithms to predict long-term share returns on the JSE based on fundamental data and analyses the interpretability of the top performing algorithms. Based on a review of the literature, eight machine learning classification algorithms were selected and compared to predict tercile class 12-month share returns using fundamental data, spanning a period of two decades. The XGBoost, Random Forest, and GradBoost algorithms were found to outperform the Support Vector Classifier, Logistic Regression, Decision Tree, Artificial Neural Network, and AdaBoost algorithms. XGBoost and Random Forest were further investigated using SHAP (SHapley Additive exPlanations) global summary plots to identify the most influential input features and to analyse the interpretability of these algorithms. The study found that ensemble-based classification algorithms, i.e. XGBoost, Random Forest and GradBoost, outperformed the other algorithms. Further analysis of the results varied, with some sectors outperforming the overall market. An analysis of the input features identified the most important valuation and profitability ratios that contributed to prediction performance, and thus improves the transparency and interpretability of the models. This research is the first to comprehensively compare and analyse the interpretability of machine learning algorithms to predict long-term share returns on the JSE. |
| format | Thesis |
| id | oai:open.uct.ac.za:11427/40761 |
| institution | University of Cape Town (South Africa) |
| language | eng |
| last_indexed | 2026-06-10T12:32:47.627Z |
| license_str | Not specified — see source repository |
| provenance_str_mv | Harvested via OAI-PMH from UCTD — University of Cape Town Open Access Repository |
| publishDate | 2024 |
| publishDateRange | 2024 |
| publishDateSort | 2024 |
| publisher | Department of Computer Science |
| publisherStr | Department of Computer Science |
| record_format | dspace |
| source_str | UCTD — University of Cape Town Open Access Repository |
| spelling | oai:open.uct.ac.za:11427/40761 An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE Boakes, Jamie Moodley, Deshendran Information Technology The prediction of long-term share returns is an essential yet complex task in financial analysis and formulating investment strategy. Machine learning is a promising approach for improving the accuracy of these predictions. However, the outputs of machine learning models are not transparent or interpretable, which limits their usability for real-world decision making. There is a lack of research on the use of machine learning algorithms to predict long-term share returns on the Johannesburg Stock Exchange (JSE), with no studies that specifically examine the interpretability of machine learning algorithms. This study investigates the use of machine learning algorithms to predict long-term share returns on the JSE based on fundamental data and analyses the interpretability of the top performing algorithms. Based on a review of the literature, eight machine learning classification algorithms were selected and compared to predict tercile class 12-month share returns using fundamental data, spanning a period of two decades. The XGBoost, Random Forest, and GradBoost algorithms were found to outperform the Support Vector Classifier, Logistic Regression, Decision Tree, Artificial Neural Network, and AdaBoost algorithms. XGBoost and Random Forest were further investigated using SHAP (SHapley Additive exPlanations) global summary plots to identify the most influential input features and to analyse the interpretability of these algorithms. The study found that ensemble-based classification algorithms, i.e. XGBoost, Random Forest and GradBoost, outperformed the other algorithms. Further analysis of the results varied, with some sectors outperforming the overall market. An analysis of the input features identified the most important valuation and profitability ratios that contributed to prediction performance, and thus improves the transparency and interpretability of the models. This research is the first to comprehensively compare and analyse the interpretability of machine learning algorithms to predict long-term share returns on the JSE. 2024-12-02T10:46:34Z 2024-12-02T10:46:34Z 2024 2024-11-28T10:53:10Z Thesis / Dissertation Masters MSc http://hdl.handle.net/11427/40761 eng application/pdf Department of Computer Science Faculty of Science University of Cape Town |
| spellingShingle | Information Technology Boakes, Jamie An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE |
| thesis_degree_str | Master's |
| title | An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE |
| title_full | An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE |
| title_fullStr | An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE |
| title_full_unstemmed | An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE |
| title_short | An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE |
| title_sort | analysis of the performance and interpretability of machine learning classification algorithms to predict long term share returns on the jse |
| topic | Information Technology |
| url | http://hdl.handle.net/11427/40761 |
| work_keys_str_mv | AT boakesjamie ananalysisoftheperformanceandinterpretabilityofmachinelearningclassificationalgorithmstopredictlongtermsharereturnsonthejse AT boakesjamie analysisoftheperformanceandinterpretabilityofmachinelearningclassificationalgorithmstopredictlongtermsharereturnsonthejse |