Towards Universal Speech Recognition - Robotics Institute Carnegie Mellon University

Towards Universal Speech Recognition

Zhirong Wang, Umut Topkara, T. Schultz, and Alex Waibel
Conference Paper, Proceedings of 4th IEEE International Conference on Multimodal Interfaces (ICMI '02), pp. 247 - 252, October, 2002

Abstract

The increasing interest in multilingual applications like speech-to-speech translation systems is accompanied by the need for speech recognition front-ends in many languages that can also handle multiple input languages at the same time. In this paper we describe a universal speech recognition system that fulfills such needs. It is trained by sharing speech and text data across languages and thus reduces the number of parameters and overhead significantly at the cost of only slight accuracy loss. The final recognizer eases the burden of maintaining several monolingual engines, makes dedicated language identification obsolete and allows for code-switching within an utterance. To achieve these goals we developed new methods for constructing multilingual acoustic models and multilingual n-gram language models.

BibTeX

@conference{Wang-2002-8572,
author = {Zhirong Wang and Umut Topkara and T. Schultz and Alex Waibel},
title = {Towards Universal Speech Recognition},
booktitle = {Proceedings of 4th IEEE International Conference on Multimodal Interfaces (ICMI '02)},
year = {2002},
month = {October},
pages = {247 - 252},
}