Integrating Different Learning Approaches into a Multilingual Spoken Translation System

P. Geutner, B. Suhm, T. Kemp, A. Lavie, A. E. McNair, I. Rogina, T. Sloboda, Wayne Ward, M. Woszczcyna, and Alex Waibel

Conference Paper, Proceedings of 14th International Joint Conference on Artificial Intelligence (IJCAI '95), pp. 117 - 131, August, 1995

View Publication

Abstract

Building multilingual spoken language translation systems requires knowledge about both acoustic models and language models of each language to be translated. Our multilingual translation system JANUS-2 is able to translate English and German spoken input into either English, German, Spanish, Japanese or Korean output. Getting optimal acoustic and language models as well as developing adequate dictionaries for all these languages requires a lot of hand-tuning and is time-consuming and labor intensive. In this paper we will present learning techniques that improve acoustic models by automatically adapting codebook sizes, a learning algorithm that increases and adapts phonetic dictionaries for the recognition process and also a statistically based language model with some linguistic knowledge that increases recognition performance. To ensure a robust translation system, semantic rather than syntactic analysis is done. Concept based speech translation and a connectionist parser that learns to parse into feature structures are introduced. Furthermore, different repair mechanisms to recover from recognition errors will be described.

Notes
Our German recognition engine, developed at the University of Karlsruhe, is part of the VERBMOBIL project and VERBMOBIL systems developed under BMBF funding. The Spanish speech translation module has been developed at Carnegie Mellon University under project ENTHUSIAST funded by the US Government. Other components are under development in collaboration with partners of the C-STAR Consortium.

BibTeX

@conference{Geutner-1995-13949,
author = {P. Geutner and B. Suhm and T. Kemp and A. Lavie and and A. E. McNair and I. Rogina and T. Sloboda and Wayne Ward and M. Woszczcyna and Alex Waibel},
title = {Integrating Different Learning Approaches into a Multilingual Spoken Translation System},
booktitle = {Proceedings of 14th International Joint Conference on Artificial Intelligence (IJCAI '95)},
year = {1995},
month = {August},
pages = {117 - 131},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.