Browse by Degree programme
![]() | Up a level |
B
Baumann, Judith (2022) Improving the naturalness of an end-to-end text-to-speech system with information structure. Master thesis, Voice Technology (VT).
Bian, Qianqian (2025) Character Identity and Emotion-Aware TTS for Otome Games. Master thesis, Voice Technology (VT).
Bălan, Dragoș Alexandru (2023) Improving the State-of-the-Art Frisian ASR by fine-tuning Large-Scale Cross-Lingual Pre-Trained Models. Master thesis, Voice Technology (VT).
C
Cai, Youyang (2024) Multimodal sarcasm recognition based on different feature fusion methods. Master thesis, Voice Technology (VT).
Chen, Hang (2025) Layer-wise Cross-Lingual Depression Detection from Speech: A HuBERT-Based Study on English and Mandarin. Master thesis, Voice Technology (VT).
Chen, Shuyi (2025) Enhancing Surprise Perception in TTS through Keyword-Level Prosody Control. Master thesis, Voice Technology (VT).
Chen, Yitong (2025) Speech Emotion Recognition via Multimodal CNN-LSTM Architectures. Master thesis, Voice Technology (VT).
D
Deng, Yaling (2024) Improving the Performance of Code-Switching Recognition Using Whisper. Master thesis, Voice Technology (VT).
Dimitrova, Iva (2023) Speaker Profiling of Phonated and Whispered Speech. Master thesis, Voice Technology (VT).
Ding, Shenghuan (2024) Comparative Study of Low Resource Language Manchu Speech Synthesis: Transfer Learning from Spanish vs. Mandarin Chinese. Master thesis, Voice Technology (VT).
Dong, Jiashu (2025) Singing Voice Synthesis in Your Language: Cross-Lingual Transfer with Limited Data Using Diffusion Models. Master thesis, Voice Technology (VT).
F
Faste, Sarah (2022) WAV2VEC 2.0 FOR IRISH ASR: A MULTILINGUAL APPROACH TO UNDER-RESOURCED LANGUAGES. Master thesis, Voice Technology (VT).
Feenstra, Lian (2023) Aladdin, Alla dien, Allendien, please evaluate the performance of Whisper on Dutch dysarthric speech. Master thesis, Voice Technology (VT).
G
Galarneau, Jocomin (2024) Synthesizing Anger: Enhancing Emotional Speech from Text in Novel Dialogues. Master thesis, Voice Technology (VT).
Galjaard, Ellemijn (2023) A Self-Supervised Approach to Speech Enhancement in Noisy Climbing Gym Environments. Master thesis, Voice Technology (VT).
H
He, Zhizhi (2025) Cross-Cultural Perception of Emotional Text-to-Speech: A Pilot Study on Mandarin. Master thesis, Voice Technology (VT).
Herygers, Aaricia (2022) Spraakherkenning, wa is da? — Bias in Flemish Speech Recognition. Master thesis, Voice Technology (VT).
Hongell, Brandi (2024) Assessing the relationship between stimulus duration and Mean Opinion Score for speech synthesis evaluation. Master thesis, Voice Technology (VT).
Huang, Qiyan (2025) Towards Fine-Grained Emotional Modulation in FastSpeech 2 with Hierarchical Emotion Distributions. Master thesis, Voice Technology (VT).
I
Ivnova, Victoria (2023) Synthesising Proto-Indo-European using Phonological Features for Zero-Shot Synthesis. Master thesis, Voice Technology (VT).
J
Jiang, Weihao (2024) Synthesis of sarcastic speech: Research on adjusting pitch and energy at keyword level using FastSpeech2. Master thesis, Voice Technology (VT).
Jingsi, Huang (2024) Dutch Speech Restoration Research based on Miipher. Master thesis, Voice Technology (VT).
K
Kang, Ruoxin (2025) Streaming Speech Recognition for Smart Glasses: A Fine-tuning Approach Based on Pre-trained FastConformer. Master thesis, Voice Technology (VT).
Kokowski, Jan (2025) F0-Based Masking Policies for Self-Supervised Whispered Speech Recognition. Master thesis, Voice Technology (VT).
L
LI, Xinchi (2025) From Zero-Shot to Fine-Tuned: Linguistic Error Analysis in Frisian ASR with Whisper. Master thesis, Voice Technology (VT).
Lai, Weixi (2024) Parameter-Efficient Fine-Tuning for Sarcasm Detection in Speech Using the Self-Supervised Pre-Trained Model WavLM. Master thesis, Voice Technology (VT).
Laméris, Cárolos (2024) Topological Featurization of Speech Data for Speech Recognition. Master thesis, Voice Technology (VT).
Lankheet, Amber (2025) A Cross-Lingual Approach to Dutch Dysarthric Speech Recognition. Master thesis, Voice Technology (VT).
Lei, Yi (2024) Optimizing Text-to-Speech: Investigating Training Data Volume for Human-Level Synthesis with Fastspeech2. Master thesis, Voice Technology (VT).
Lei, Yining (2024) Chinese-speaking English learners' Vowel Pronunciation Error Detection. Master thesis, Voice Technology (VT).
Leijenhorst, Elja (2023) Fine-tuning ASR to specific noise environments: noise robustness in a climbing gym. Master thesis, Voice Technology (VT).
Leivaditi, Spyretta (2023) The Role of Speech Elicitation Methods and Disease Factors in Dysartrhric ASR System Development. Master thesis, Voice Technology (VT).
Li, Chenyu (2024) Exploring the Potential of Accent Conversion Techniques to Enhance Fairness in Language Assessment. Master thesis, Voice Technology (VT).
Li, Qing, Q (2024) Fine-tuning Cantonese based on Wav2vec 2.0 XLRS model that pretrained on Mandarin Chinese to improve ASR performance. Master thesis, Voice Technology (VT).
Li, Qing, Q (2024) Fine-tuning Cantonese based on Wav2vec 2.0 XLRS model that pretrained on Mandarin Chinese to improve ASR performance. Master thesis, Voice Technology (VT).
Li, ZiYi (2025) Transfer Learning for Sichuan Dialect Automatic Speech Recognition Based on pretrained Wav2vec 2.0 Model. Master thesis, Voice Technology (VT).
Liang, Hao-Wei (2025) Cross-lingual Voice Conversion and Its Prosodic Impact on Perceived Naturalness. Master thesis, Voice Technology (VT).
Lin, Chenyi (2024) Manipulating Acoustic Correlates for Vocal Persona Transition: From Neutral to Friendly. Master thesis, Voice Technology (VT).
Lin, Xiaoling (2024) Identifying ASMR-Style Audio: Development of a Predictive Classification Model. Master thesis, Voice Technology (VT).
Liu, Xueying (2024) Parameter-Efficient Fine-Tuning on Multilingual ASR Whisper Model for Frisian. Master thesis, Voice Technology (VT).
Luks, BH (2022) End-to-End ASR with Binarized Neural Networks. Master thesis, Voice Technology (VT).
M
Marchenko, Igor (2024) Phone Masking Augmentation for Automatic Recognition of Whispered Speech. Master thesis, Voice Technology (VT).
Matsushima, Tatsunari (2022) Dutch Dysarthric Speech Recognition: Applying Self-Supervised Learning to Overcome the Data Scarcity Issue. Master thesis, Voice Technology (VT).
Mei, Zhengkun (2023) Chinese Multi-Model Sarcasm Detection Based on Contrastive Attention Residual Late Fusion. Master thesis, Voice Technology (VT).
Meng, Wenjun (2024) Beyond Adult Speech: Exploring SepFormer’s Performance in Child Speech Separation. Master thesis, Voice Technology (VT).
Monen, Janay (2022) Automatic Detection and Severity Estimation for Oral Cancer Speech. Master thesis, Voice Technology (VT).
N
Naazeri, Hiva (2025) Minimal Acoustic Markers for Age Prediction in Human Voice: A Machine Learning Approach. Master thesis, Voice Technology (VT).
Narang, Mohammadhossein (2025) Can Multimodal Transformers Beat LLMs? A Cross-Attention Approach to Sarcasm Detection in Social Media Videos. Master thesis, Voice Technology (VT).
Nguyen, Le Minh (2022) Improving Luxembourgish Speech Recognition with Cross-Lingual Speech Representations. Master thesis, Voice Technology (VT).
O
Ouyang, Yanpei (2024) Assessing Knowledge-Distillation Based Compression of Whisper Model for Frisian ASR. Master thesis, Voice Technology (VT).
Q
Qiu, Yan (2025) A Trial Toward Real-Time Vision-to-Speech Systems: An Exploratory Study of BLIP and FastSpeech 2 for Assistive Applications and Latency-Precision Trade-Offs. Master thesis, Voice Technology (VT).
Qu, Layla (2024) Data Augmentation and VAE-GAN for Few-Shot Singing Singing Voice Cloning. Master thesis, Voice Technology (VT).
S
Shekoufandeh, Golshid (2023) Evaluation of wav2vec 2.0 Speech Recognition for the Elderly Frisian Population. Master thesis, Voice Technology (VT).
Shen, Gaofei (2022) Does Where Words Come From Matter? Leveraging Self-supervised Models for Multilingual ASR and LID. Master thesis, Voice Technology (VT).
Shi, Erin (2024) Multimodal Sarcasm Detection Using BERT, TimesFormer, and Wav2Vec 2.0 with MUStARD++. Master thesis, Voice Technology (VT).
Shi, Jingwen (2024) Enhanced Multimodal Emotion Recognition using GRU and Self-Attention Mechanisms: Techniques and Applications. Master thesis, Voice Technology (VT).
Shin, Soogyeong (2024) Enhanced Disease Classification in Respiratory Sounds: A Transfer Learning Approach Utilizing ICBHI and Coswara Datasets. Master thesis, Voice Technology (VT).
Shin, Soogyeong (2024) Enhanced Disease Classification in Respiratory Sounds: A Transfer Learning Approach Utilizing ICBHI and Coswara Datasets. Master thesis, Voice Technology (VT).
Siu, Stella (2025) Speaking Volumes: How Acoustic Features Reveal Speaker Height. Master thesis, Voice Technology (VT).
Sixing, Mi (2025) Speaker Identification in Mandarin Conference Speech via Transfer Learning with wav2vec 2.0. Master thesis, Voice Technology (VT).
Spijkerman, Marjolein (2022) Using voice conversion and time-stretching to enhance the quality of dysarthric speech for automatic speech recognition. Master thesis, Voice Technology (VT).
Su, Cantao (2024) Enhancing English Dysarthric Speech Recognition with Age-Matched Healthy Speech: A Fine-Tuning Approach Using wav2vec 2.0. Master thesis, Voice Technology (VT).
Sun, Shiran (2025) A Comparative Evaluation of Closed- and Open-Vocabulary ASR Systems for the Recognition of Dutch Healthcare Terms. Master thesis, Voice Technology (VT).
T
Tepei, Maria (2024) Addressing ASR Bias Against Foreign-Accented Dutch: A Synthetic Data Approach. Master thesis, Voice Technology (VT).
V
Vanni, Alice (2024) Age-controllable speech synthesis: A pilot study on English. Master thesis, Voice Technology (VT).
van Heerwaarden, Floor M (2023) Hubert wins the lottery PARP(-P) pruning the HuBERT model for downstream tasks. Master thesis, Voice Technology (VT).
W
Wang, Yinqiu (2024) Code-switching speech synthesis for Mandarin-English using FastSpeech2: A unified IPA-based approach. Master thesis, Voice Technology (VT).
Wang, Yinzi (2025) A Lightweight Multimodal Framework for Context-Aware Punchline Detection. Master thesis, Voice Technology (VT).
Wang, Yiqiu (2023) THE EVALUATION OF THE FEMININE LEVEL OF SPEECH. Master thesis, Voice Technology (VT).
Weggeman, Sjors (2023) The relevance of using authentic laughter data in natural laughter synthesis: A case study on LaughNet. Master thesis, Voice Technology (VT).
Wei, Yilan (2024) An Innovative Method for Multi-Effect Speech Synthesis through Training File Modification. Master thesis, Voice Technology (VT).
Wildenburg, Kirsten (2022) Automatic speech recognition and error analyses of Dutch oral cancer speech. Master thesis, Voice Technology (VT).
Willis, Leslie (2023) Exploring Automatic Speech Recognition for Podcast Audio: Fine-Tuning HuBERT on the Spotify Podcast Dataset. Master thesis, Voice Technology (VT).
Y
Yu, Hantao (2025) Fine-Tuning Whisper for Dutch-Speaking Autistic Children: Adapting ASR to Atypical Speech in Low-Resource Settings. Master thesis, Voice Technology (VT).
Yue, Jingxuan (2024) Identifying Acoustic Features that Enhance TTS Voice Intelligibility and Naturalness in Noisy Environments. Master thesis, Voice Technology (VT).
Z
Zhang, Meiling (2025) An Exploration of Cross-Lingual Model Transfer in Multimodal Sarcasm Detection. Master thesis, Voice Technology (VT).
Zhang, Tiantian (2025) Exploratory Analysis of Correlation between Earnings Call Acoustic Features and Credit Ratings: A FinBERT Validation Approach. Master thesis, Voice Technology (VT).
Zhang, Tiantian (2025) Exploratory Analysis of Correlation between Earnings Call Acoustic Features and Credit Ratings: A FinBERT Validation Approach. Master thesis, Voice Technology (VT).
Zhang, Ting (2024) CMGAN-Based Speech Enhancement for Automotive Environments: Targeted Noise Reduction. Master thesis, Voice Technology (VT).
Zhang, Ziyun (2025) Personalized Speech Enhancement Using Time-Domain Convolutional Networks. Master thesis, Voice Technology (VT).
Zhao, Guanrong (2022) Road to Deep Learning-driven Chinese Traditional Verbal Art Synthesis. Master thesis, Voice Technology (VT).
Zheng, Siqi (2024) End-to-End Speech Emotion Recognition based on CNN-Transformer. Master thesis, Voice Technology (VT).
Zheng, Siqi (2024) End-to-End Speech Emotion Recognition based on CNN-Transformer. Master thesis, Voice Technology (VT).
Zhou, Wangyiyao (2024) From Tolkien’s Novel to Synthetic Speech: Developing TTS Systems for Quenya. Master thesis, Voice Technology (VT).
Zhu, Dongwen (2024) Enhancing Automatic Speech Recognition in Vehicular Environments: A Noise-Specific Fine-Tuning Approach. Master thesis, Voice Technology (VT).
Zhu, Qiye (2025) Zero-Shot Voice Cloning with Minimal Data: Impact of Reference Duration on Long-Form Speech Synthesis. Master thesis, Voice Technology (VT).
Zwart, Tessa (2023) Sarcastic speech synthesis in Dutch using voice-transformation. Master thesis, Voice Technology (VT).
Ö
Özyilmaz, Ömer Tarik (2024) The Effects of Fine-Tuning on the ASR Performance of Dialectal Arabic. Master thesis, Voice Technology (VT).