Taiwanese Across Taiwan Corpus and Its Applications

Yuan Fu Liao, Jane S. Tsay, Peter Kang, Hui Lu Khoo, Le Kun Tan, Li Chen Chang, Un Gian Iunn, Huang Lan Su, Tsun Guan Thiann, Hak Khiam Tiun, Su Lian Liao

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

Taiwanese across Taiwan (TAT) corpus is the first large-scale and publicly released Taiwanese speech corpus which represents the modern Taiwanese around Taiwan. This paper briefly reviews the TAT corpus and a corresponding parallel Chinese, Hàn-Lô-Tâi-bûn, Tai-Luo and Peh-ōe-jī lexicon and demonstrate some of their potential applications including ASR, TTS and voice conversion. The corresponding pretrained ASR and TTS models, sample model usage codes and training scripts will also be released. More information could be found on the Formosa Speech in the Wild website: https: //sites. google. com/nycu.edu.tw/fsw.

Original languageEnglish
Title of host publication2022 25th Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2022 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350398564
DOIs
Publication statusPublished - 2022
Event25th Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2022 - Hanoi, Viet Nam
Duration: 2022 Nov 242022 Nov 26

Publication series

Name2022 25th Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2022 - Proceedings

Conference

Conference25th Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2022
Country/TerritoryViet Nam
CityHanoi
Period2022/11/242022/11/26

Keywords

  • Taiwanese speech corpus
  • automatic speech recognition
  • speech synthesis
  • voice conversion

ASJC Scopus subject areas

  • Computer Science Applications
  • Computer Vision and Pattern Recognition
  • Information Systems
  • Linguistics and Language
  • Information Systems and Management
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Taiwanese Across Taiwan Corpus and Its Applications'. Together they form a unique fingerprint.

Cite this