Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Evaluation of the usability and usefulness of automatic speech recognition among users in South Africa [electronic resource]

Includes bibliographical references.

Saved in:
Bibliographic Details
Main Author: Idowu, Modupeola Florence
Other Authors: Mbogho, Audrey J W
Format: Thesis
Language:English
Published: Department of Computer Science 2014
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613286348881920
access_status_str Open Access
author Idowu, Modupeola Florence
author2 Mbogho, Audrey J W
author_browse Idowu, Modupeola Florence
Mbogho, Audrey J W
author_facet Mbogho, Audrey J W
Idowu, Modupeola Florence
author_sort Idowu, Modupeola Florence
collection Thesis
description Includes bibliographical references.
format Thesis
id oai:open.uct.ac.za:11427/9042
institution University of Cape Town (South Africa)
language eng
last_indexed 2026-06-10T12:33:43.673Z
license_str Not specified — see source repository
provenance_str_mv Harvested via OAI-PMH from UCTD — University of Cape Town Open Access Repository
publishDate 2014
publishDateRange 2014
publishDateSort 2014
publisher Department of Computer Science
publisherStr Department of Computer Science
record_format dspace
source_str UCTD — University of Cape Town Open Access Repository
spelling oai:open.uct.ac.za:11427/9042 Evaluation of the usability and usefulness of automatic speech recognition among users in South Africa [electronic resource] Idowu, Modupeola Florence Mbogho, Audrey J W Includes bibliographical references. An automatic speech recognition (ASR) system is a software application which recognizes human speech, processes it as input, and displays a text version of the speech as output or uses the input as commands for another application’s usage. ASR can either be speaker-dependent or speakerindependent. A speaker-dependent ASR system requires every user to perform training before its usage, while speaker-independent ASR requires no prior training before usage. The technology of ASR is based on identification and comparison of sound patterns; these sound patterns are combinations of the smallest units of sound called phonemes. The phonemes constitute fragments of uttered sounds in speech and their combination gives meaningful sound patterns in languages. There exists a set of phonemes for every language group, and associated with each group is the method of pronunciation called the accent. A language group could be identified by the accent in their speech; accent is the set of pronunciation rules of a language group. Accent reflects the cultural divide of a multi cultural society with a common language such as English. Some commercially available ASR systems are designed based on the accents of the following language groups: English, French, German, Italian, Dutch, and Spanish. These language groups are European with none having any similarities with African languages and accents, (except Afrikaans and English, which, though spoken in Africa, originated from Proto-Indo-European languages). This study involved the evaluation of commercially available English ASR systems, establishing their usability and usefulness among different language groups in South Africa which use English as a common language. Of particular interest was the effect of African accents on the performance of the ASR systems. ASR technology is widely used and researched in the developed world with reported recognition accuracy of up to 99%. However, English spoken with African accents may have adverse effect on the recognition accuracy. Despite the fact that most existing ASR systems are not designed for English spoken with South Africans’ accents, one can easily purchase them over the shelf in South Africa. The systems used in this study are: 1. Nuance Dragon NaturallySpeaking, Version10.0 (NDNS). 2. Windows Speech Recognition, Windows Vista version (WSR). The result of this study indicated that accent has influence on the ASR recognition accuracy. It also indicated that users’ satisfaction was greatly affected by the recognition accuracy obtained. The results also indicated poor performance in environments where speech cannot be loud, for example, in the library. 2014-11-03T08:32:22Z 2014-11-03T08:32:22Z 2011 Master Thesis Masters MSc http://hdl.handle.net/11427/9042 eng application/pdf Department of Computer Science Faculty of Science University of Cape Town
spellingShingle Idowu, Modupeola Florence
Evaluation of the usability and usefulness of automatic speech recognition among users in South Africa [electronic resource]
thesis_degree_str Master's
title Evaluation of the usability and usefulness of automatic speech recognition among users in South Africa [electronic resource]
title_full Evaluation of the usability and usefulness of automatic speech recognition among users in South Africa [electronic resource]
title_fullStr Evaluation of the usability and usefulness of automatic speech recognition among users in South Africa [electronic resource]
title_full_unstemmed Evaluation of the usability and usefulness of automatic speech recognition among users in South Africa [electronic resource]
title_short Evaluation of the usability and usefulness of automatic speech recognition among users in South Africa [electronic resource]
title_sort evaluation of the usability and usefulness of automatic speech recognition among users in south africa electronic resource
url http://hdl.handle.net/11427/9042
work_keys_str_mv AT idowumodupeolaflorence evaluationoftheusabilityandusefulnessofautomaticspeechrecognitionamongusersinsouthafricaelectronicresource