Improving Speech Recognition by Enhancing Accent Discrimination

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

With globalization, English has become an increasingly important global lingua franca. However, the diversity of English accents-shaped by native languages, regions, and cultures-presents significant challenges for speech recognition systems. This paper proposes a method to enhance the recognition of accented English by improving accent discrimination. We introduce a technique that integrates accent classification into the speech recognition model, allowing for better identification of various accents. Our results demonstrate that this approach reduces error rates and provides a detailed analysis of accent features across the model's layers, thereby improving the recognition of diverse accents.

Original languageEnglish
Title of host publication2024 27th Conference on the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2024 - Proceedings
EditorsMing-Hsiang Su, Jui-Feng Yeh, Yuan-Fu Liao, Chi-Chun Lee, Yu Taso
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331506032
DOIs
Publication statusPublished - 2024
Event27th Conference on the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2024 - Hsinchu, Taiwan
Duration: 2024 Oct 172024 Oct 19

Publication series

Name2024 27th Conference on the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2024 - Proceedings

Conference

Conference27th Conference on the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2024
Country/TerritoryTaiwan
CityHsinchu
Period2024/10/172024/10/19

Keywords

  • Accent
  • Data Visualization
  • Model Probing
  • Multi-Task Learning
  • Speech Recognition

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition
  • Human-Computer Interaction
  • Information Systems
  • Information Systems and Management
  • Safety, Risk, Reliability and Quality
  • Library and Information Sciences
  • Language and Linguistics

Fingerprint

Dive into the research topics of 'Improving Speech Recognition by Enhancing Accent Discrimination'. Together they form a unique fingerprint.

Cite this