Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Vision-Based automatic translation for South African Sign Language (SASL)

There are more than four million South Africans who are deaf and hard of hearing (DHH). However, most people with hearing abilities neither understand sign language nor know how to sign. This creates a communication barrier between the deaf and hard of hearing and people with hearing, to the disadva...

Full description

Saved in:
Bibliographic Details
Main Author: Setshekgamollo, Mokgadi
Other Authors: Verrinder, Robyn
Format: Thesis
Language:English
English
Published: Department of Electrical Engineering 2025
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867614512696262656
access_status_str Open Access
author Setshekgamollo, Mokgadi
author2 Verrinder, Robyn
author_browse Setshekgamollo, Mokgadi
Verrinder, Robyn
author_facet Verrinder, Robyn
Setshekgamollo, Mokgadi
author_sort Setshekgamollo, Mokgadi
collection Thesis
description There are more than four million South Africans who are deaf and hard of hearing (DHH). However, most people with hearing abilities neither understand sign language nor know how to sign. This creates a communication barrier between the deaf and hard of hearing and people with hearing, to the disadvantage of the DHH. In 2018, South African Sign Language (SASL) became an official subject in South African schools and in 2023 it became South Africa's 12th official language. However, these implementations do not impose it on institutions and service providers. Although some provisions are made to cater to the needs of DHH people in the form of sign language interpreters, such interpreters are not always readily available. Sign language interpreters are also costly, charging in excess of R500.00 per hour. In this research, we developed the first vision-based Neural Sign Language Translation model for SASL as a first step towards bridging the communication gap. To this end, we recorded a sizeable parallel SASL and English corpus with the help of six sign language interpreters, three of which are native signers and constitute around 90% of the dataset. The dataset comprises 5047 sentences in the domain of government and politics recorded in a studio setting with a uniform green background. At an average of 3.83 seconds per segment, this equates to around five hours of sign language data. We conducted comprehensive experiments using various visual feature extraction architectures as well as translation architectures. We found that recurrent translation models outperform transformer models. We also investigated the impact of pretraining our feature extractor on a Continuous Sign Language Recognition task and fine-tuning on the SASL dataset and found that this is effective for improving feature extraction. Our best models achieved a BLEU-4 score of 1.35 on the SASL test set, a comparable performance to the How2Sign dataset with a best BLEU-4 score of 1.73 but much lower than on the RWTH-PHOENIX-Weather 2014T dataset which produced a BLEU-4 score of 13.23 without gloss supervision. Our experiments also showed that annotating fingerspelled words as individual letters improves the performance of the model. Our model might benefit from the collection of more data and the addition of gloss annotation. Our results on the SASL dataset are very poor and still very far from practical, indicating that more resources and experiments are required before we can remove the language barrier between the hearing and the Deaf. This would be most effectively achieved by working in collaboration with the Deaf Community to produce high quality datasets, annotations, and models.
format Thesis
id oai:open.uct.ac.za:11427/41355
institution University of Cape Town (South Africa)
language English
eng
last_indexed 2026-06-10T12:53:13.706Z
license_str Not specified — see source repository
provenance_str_mv Harvested via OAI-PMH from UCTD — University of Cape Town Open Access Repository
publishDate 2025
publishDateRange 2025
publishDateSort 2025
publisher Department of Electrical Engineering
publisherStr Department of Electrical Engineering
record_format dspace
source_str UCTD — University of Cape Town Open Access Repository
spelling oai:open.uct.ac.za:11427/41355 Vision-Based automatic translation for South African Sign Language (SASL) Setshekgamollo, Mokgadi Verrinder, Robyn Tsoeu, Mohohlo Engineering There are more than four million South Africans who are deaf and hard of hearing (DHH). However, most people with hearing abilities neither understand sign language nor know how to sign. This creates a communication barrier between the deaf and hard of hearing and people with hearing, to the disadvantage of the DHH. In 2018, South African Sign Language (SASL) became an official subject in South African schools and in 2023 it became South Africa's 12th official language. However, these implementations do not impose it on institutions and service providers. Although some provisions are made to cater to the needs of DHH people in the form of sign language interpreters, such interpreters are not always readily available. Sign language interpreters are also costly, charging in excess of R500.00 per hour. In this research, we developed the first vision-based Neural Sign Language Translation model for SASL as a first step towards bridging the communication gap. To this end, we recorded a sizeable parallel SASL and English corpus with the help of six sign language interpreters, three of which are native signers and constitute around 90% of the dataset. The dataset comprises 5047 sentences in the domain of government and politics recorded in a studio setting with a uniform green background. At an average of 3.83 seconds per segment, this equates to around five hours of sign language data. We conducted comprehensive experiments using various visual feature extraction architectures as well as translation architectures. We found that recurrent translation models outperform transformer models. We also investigated the impact of pretraining our feature extractor on a Continuous Sign Language Recognition task and fine-tuning on the SASL dataset and found that this is effective for improving feature extraction. Our best models achieved a BLEU-4 score of 1.35 on the SASL test set, a comparable performance to the How2Sign dataset with a best BLEU-4 score of 1.73 but much lower than on the RWTH-PHOENIX-Weather 2014T dataset which produced a BLEU-4 score of 13.23 without gloss supervision. Our experiments also showed that annotating fingerspelled words as individual letters improves the performance of the model. Our model might benefit from the collection of more data and the addition of gloss annotation. Our results on the SASL dataset are very poor and still very far from practical, indicating that more resources and experiments are required before we can remove the language barrier between the hearing and the Deaf. This would be most effectively achieved by working in collaboration with the Deaf Community to produce high quality datasets, annotations, and models. 2025-04-03T12:56:15Z 2025-04-03T12:56:15Z 2024 2025-04-03T12:47:39Z Thesis / Dissertation Masters MSc http://hdl.handle.net/11427/41355 en eng application/pdf Department of Electrical Engineering Faculty of Engineering and the Built Environment
spellingShingle Engineering
Setshekgamollo, Mokgadi
Vision-Based automatic translation for South African Sign Language (SASL)
thesis_degree_str Master's
title Vision-Based automatic translation for South African Sign Language (SASL)
title_full Vision-Based automatic translation for South African Sign Language (SASL)
title_fullStr Vision-Based automatic translation for South African Sign Language (SASL)
title_full_unstemmed Vision-Based automatic translation for South African Sign Language (SASL)
title_short Vision-Based automatic translation for South African Sign Language (SASL)
title_sort vision based automatic translation for south african sign language sasl
topic Engineering
url http://hdl.handle.net/11427/41355
work_keys_str_mv AT setshekgamollomokgadi visionbasedautomatictranslationforsouthafricansignlanguagesasl