Full Text Available
Note: Clicking the button above will open the full text document at the original institutional repository in a new window.
Despite recent developments financial inclusion remains a large issue for the World's unbanked population. Financial institutions - both larger corporations and micro-finance companies - have begun to provide solutions for financial inclusion. The solutions are delivered using a combination of machi...
| Main Author: | |
|---|---|
| Other Authors: | |
| Format: | Thesis |
| Language: | English |
| Published: |
Department of Statistical Sciences
2020
|
| Subjects: | |
| Tags: |
No Tags, Be the first to tag this record!
|
| _version_ | 1867613160877326336 |
|---|---|
| access_status_str | Open Access |
| author | Stone, Devon |
| author2 | Britz, Stefan |
| author_browse | Britz, Stefan Stone, Devon |
| author_facet | Britz, Stefan Stone, Devon |
| author_sort | Stone, Devon |
| collection | Thesis |
| description | Despite recent developments financial inclusion remains a large issue for the World's unbanked population. Financial institutions - both larger corporations and micro-finance companies - have begun to provide solutions for financial inclusion. The solutions are delivered using a combination of machine learning and alternative data. This minor dissertation focuses on investigating whether alternative features generated from Short Messaging Service (SMS) data and Android application data contained on borrowers' devices can be used to improve the performance of loan default prediction models. The improvement gained by using alternative features is measured by comparing loan default prediction models trained using only traditional credit scoring data to models developed using a combination of traditional and alternative features. Furthermore, the paper investigates which of 4 machine learning techniques is best suited for loan default prediction. The 4 techniques investigated are logistic regression, random forests, extreme gradient boosting, and neural networks. Finally the paper identifies whether or not accurate loan default prediction models can be trained using only the alternative features developed throughout this minor dissertation. The results of the research show that alternative features improve the performance of loan default prediction across 5 performance indicators, namely overall prediction accuracy, repaid prediction accuracy, default prediction accuracy, F1 score, and AUC. Furthermore, extreme gradient boosting is identified as the most appropriate technique for loan default prediction. Finally, the research identifies that models trained using the alternative features developed throughout this project can accurately predict loan that have been repaid, the models do not accurately predict loans that have not been repaid. |
| format | Thesis |
| id | oai:open.uct.ac.za:11427/32377 |
| institution | University of Cape Town (South Africa) |
| language | eng |
| last_indexed | 2026-06-10T12:31:43.046Z |
| license_str | Not specified — see source repository |
| provenance_str_mv | Harvested via OAI-PMH from UCTD — University of Cape Town Open Access Repository |
| publishDate | 2020 |
| publishDateRange | 2020 |
| publishDateSort | 2020 |
| publisher | Department of Statistical Sciences |
| publisherStr | Department of Statistical Sciences |
| record_format | dspace |
| source_str | UCTD — University of Cape Town Open Access Repository |
| spelling | oai:open.uct.ac.za:11427/32377 An exploration of alternative features in micro-finance loan default prediction models Stone, Devon Britz, Stefan statistics Despite recent developments financial inclusion remains a large issue for the World's unbanked population. Financial institutions - both larger corporations and micro-finance companies - have begun to provide solutions for financial inclusion. The solutions are delivered using a combination of machine learning and alternative data. This minor dissertation focuses on investigating whether alternative features generated from Short Messaging Service (SMS) data and Android application data contained on borrowers' devices can be used to improve the performance of loan default prediction models. The improvement gained by using alternative features is measured by comparing loan default prediction models trained using only traditional credit scoring data to models developed using a combination of traditional and alternative features. Furthermore, the paper investigates which of 4 machine learning techniques is best suited for loan default prediction. The 4 techniques investigated are logistic regression, random forests, extreme gradient boosting, and neural networks. Finally the paper identifies whether or not accurate loan default prediction models can be trained using only the alternative features developed throughout this minor dissertation. The results of the research show that alternative features improve the performance of loan default prediction across 5 performance indicators, namely overall prediction accuracy, repaid prediction accuracy, default prediction accuracy, F1 score, and AUC. Furthermore, extreme gradient boosting is identified as the most appropriate technique for loan default prediction. Finally, the research identifies that models trained using the alternative features developed throughout this project can accurately predict loan that have been repaid, the models do not accurately predict loans that have not been repaid. 2020-11-11T11:13:06Z 2020-11-11T11:13:06Z 2020 2020-11-11T11:12:36Z Master Thesis Masters MSc http://hdl.handle.net/11427/32377 eng application/pdf Department of Statistical Sciences Faculty of Science |
| spellingShingle | statistics Stone, Devon An exploration of alternative features in micro-finance loan default prediction models |
| thesis_degree_str | Master's |
| title | An exploration of alternative features in micro-finance loan default prediction models |
| title_full | An exploration of alternative features in micro-finance loan default prediction models |
| title_fullStr | An exploration of alternative features in micro-finance loan default prediction models |
| title_full_unstemmed | An exploration of alternative features in micro-finance loan default prediction models |
| title_short | An exploration of alternative features in micro-finance loan default prediction models |
| title_sort | exploration of alternative features in micro finance loan default prediction models |
| topic | statistics |
| url | http://hdl.handle.net/11427/32377 |
| work_keys_str_mv | AT stonedevon anexplorationofalternativefeaturesinmicrofinanceloandefaultpredictionmodels AT stonedevon explorationofalternativefeaturesinmicrofinanceloandefaultpredictionmodels |