Fast Bootstrapping of LVCSR Systems with Multilingual Phoneme Sets - Robotics Institute Carnegie Mellon University

Fast Bootstrapping of LVCSR Systems with Multilingual Phoneme Sets

Tanja Schultz and Alex Waibel
Conference Paper, Proceedings of 5th European Conference on Speech Communication and Technology (EUROSPEECH '97), Vol. 1, pp. 371 - 373, September, 1997

Abstract

In this paper we described an efficient method to bootstrap continuously spoken, large vocabulary speech recognition systems by multilingual phoneme sets. To evaluate this techniques we collected the multilingual database GlobalPhone which currently consists of 9 different languages. A multilingual recognizer (MULTI) based on the four languages German, English, Japanese and Spanish was developed to serve as a source system. Likewise this system is very useful for language identification and achieves 100% language identification rate. Based on the MULTI system we evaluated our bootstrap technique on such completely different languages as Chinese, Croatian, and Turkish.

BibTeX

@conference{Schultz-1997-16433,
author = {Tanja Schultz and Alex Waibel},
title = {Fast Bootstrapping of LVCSR Systems with Multilingual Phoneme Sets},
booktitle = {Proceedings of 5th European Conference on Speech Communication and Technology (EUROSPEECH '97)},
year = {1997},
month = {September},
volume = {1},
pages = {371 - 373},
}