In this paper, we propose an A*-admissible key-phrase spotting framework, which needs little domain knowledge and is capable of extracting salient key-phrase fragments from an input utterance in real-time. There are two key features in our approach. Firstly, the acoustic models and the search framework are specially designed such that very high degree vocabulary flexibility can be achieved for any desired application tasks. Secondly, the search framework uses an efficient two-pass A* search to generate N-best key-phrase candidates and then several sub-syllable level verification functions are properly weighted and used to further improve the recognition accuracy. Experimental results show that the A*-admissible key-phrase spotting with sub-word level utterance method outperforms the baseline methods used in common approaches.
|出版狀態||已發佈 - 1998|
|事件||5th International Conference on Spoken Language Processing, ICSLP 1998 - Sydney, 澳大利亚|
持續時間: 1998 11月 30 → 1998 12月 4
|會議||5th International Conference on Spoken Language Processing, ICSLP 1998|
|期間||1998/11/30 → 1998/12/04|
ASJC Scopus subject areas