Integrating Different Learning Approaches into a Multilingual Spoken Translation System - Robotics Institute Carnegie Mellon University

Integrating Different Learning Approaches into a Multilingual Spoken Translation System

P. Geutner, B. Suhm, T. Kemp, A. Lavie, A. E. McNair, I. Rogina, T. Sloboda, Wayne Ward, M. Woszczcyna, and Alex Waibel
Conference Paper, Proceedings of 14th International Joint Conference on Artificial Intelligence (IJCAI '95), pp. 117 - 131, August, 1995

Abstract

Building multilingual spoken language translation systems requires knowledge about both acoustic models and language models of each language to be translated. Our multilingual translation system JANUS-2 is able to translate English and German spoken input into either English, German, Spanish, Japanese or Korean output. Getting optimal acoustic and language models as well as developing adequate dictionaries for all these languages requires a lot of hand-tuning and is time-consuming and labor intensive. In this paper we will present learning techniques that improve acoustic models by automatically adapting codebook sizes, a learning algorithm that increases and adapts phonetic dictionaries for the recognition process and also a statistically based language model with some linguistic knowledge that increases recognition performance. To ensure a robust translation system, semantic rather than syntactic analysis is done. Concept based speech translation and a connectionist parser that learns to parse into feature structures are introduced. Furthermore, different repair mechanisms to recover from recognition errors will be described.

Notes
Our German recognition engine, developed at the University of Karlsruhe, is part of the VERBMOBIL project and VERBMOBIL systems developed under BMBF funding. The Spanish speech translation module has been developed at Carnegie Mellon University under project ENTHUSIAST funded by the US Government. Other components are under development in collaboration with partners of the C-STAR Consortium.

BibTeX

@conference{Geutner-1995-13949,
author = {P. Geutner and B. Suhm and T. Kemp and A. Lavie and and A. E. McNair and I. Rogina and T. Sloboda and Wayne Ward and M. Woszczcyna and Alex Waibel},
title = {Integrating Different Learning Approaches into a Multilingual Spoken Translation System},
booktitle = {Proceedings of 14th International Joint Conference on Artificial Intelligence (IJCAI '95)},
year = {1995},
month = {August},
pages = {117 - 131},
}