Fast Bootstrapping of LVCSR Systems with Multilingual Phoneme Sets

Tanja Schultz and Alex Waibel

Conference Paper, Proceedings of 5th European Conference on Speech Communication and Technology (EUROSPEECH '97), Vol. 1, pp. 371 - 373, September, 1997

View Publication

Abstract

In this paper we described an efficient method to bootstrap continuously spoken, large vocabulary speech recognition systems by multilingual phoneme sets. To evaluate this techniques we collected the multilingual database GlobalPhone which currently consists of 9 different languages. A multilingual recognizer (MULTI) based on the four languages German, English, Japanese and Spanish was developed to serve as a source system. Likewise this system is very useful for language identification and achieves 100% language identification rate. Based on the MULTI system we evaluated our bootstrap technique on such completely different languages as Chinese, Croatian, and Turkish.

BibTeX

@conference{Schultz-1997-16433,
author = {Tanja Schultz and Alex Waibel},
title = {Fast Bootstrapping of LVCSR Systems with Multilingual Phoneme Sets},
booktitle = {Proceedings of 5th European Conference on Speech Communication and Technology (EUROSPEECH '97)},
year = {1997},
month = {September},
volume = {1},
pages = {371 - 373},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.