Javascript must be enabled for the correct page display

Enhancing Speech Recognition of Welsh for Older Adults Using Data Augmentation Techniques

Zhu, W (2024) Enhancing Speech Recognition of Welsh for Older Adults Using Data Augmentation Techniques. Master thesis, Voice Technology (VT).

[img]
Preview
PDF
MA-S5551390-W-Zhu.pdf

Download (446kB) | Preview

Abstract

Automatic Speech Recognition (ASR) is widely used in various applications, enhancing clarity in educational, daily, and cross-cultural interactions. While promising for older adults, ASR systems often struggle with their speech due to physiological and cognitive changes. This study addresses this challenge by fine-tuning ASR models with older adults’ speech data and employing data augmentation techniques. Focusing on Welsh, a low-resource language, the research demonstrates that fine-tuning the XLSR model reduced word error rate (WER) from 62.19% to 57.64%. Further improvements were achieved using advanced techniques such as speed perturbation with a factor of 0.9, reducing WER to 54.30%. These results underscore the potential for enhancing ASR performanceforolder adults through tailored augmentation methods, contributing to more inclusive speech technology for low-resource languages.

Item Type: Thesis (Master)
Name supervisor: Do, T.P.
Date Deposited: 19 Sep 2025 13:38
Last Modified: 19 Sep 2025 13:38
URI: https://campus-fryslan.studenttheses.ub.rug.nl/id/eprint/508

Actions (login required)

View Item View Item