Towards Universal Speech Recognition

Zhirong Wang, Umut Topkara, T. Schultz, and Alex Waibel

Conference Paper, Proceedings of 4th IEEE International Conference on Multimodal Interfaces (ICMI '02), pp. 247 - 252, October, 2002

View Publication

Abstract

The increasing interest in multilingual applications like speech-to-speech translation systems is accompanied by the need for speech recognition front-ends in many languages that can also handle multiple input languages at the same time. In this paper we describe a universal speech recognition system that fulfills such needs. It is trained by sharing speech and text data across languages and thus reduces the number of parameters and overhead significantly at the cost of only slight accuracy loss. The final recognizer eases the burden of maintaining several monolingual engines, makes dedicated language identification obsolete and allows for code-switching within an utterance. To achieve these goals we developed new methods for constructing multilingual acoustic models and multilingual n-gram language models.

BibTeX

@conference{Wang-2002-8572,
author = {Zhirong Wang and Umut Topkara and T. Schultz and Alex Waibel},
title = {Towards Universal Speech Recognition},
booktitle = {Proceedings of 4th IEEE International Conference on Multimodal Interfaces (ICMI '02)},
year = {2002},
month = {October},
pages = {247 - 252},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.