Translation of Conversational Speech with JANUS-II - Robotics Institute Carnegie Mellon University

Translation of Conversational Speech with JANUS-II

Alon Lavie, Alex Waibel, Lori Levin, Donna Gates, Marsal Gavalda, Torsten Zeppenfeld, Puming Zhan, and Oren Glickman
Conference Paper, Proceedings of 4th International Conference on Spoken Language Processing (ICSLP '96), Vol. 4, pp. 2375 - 2378, October, 1996

Abstract

In this paper we investigate the possibility of translating continuous spoken conversations in a cross-talk environment. This is a task known to be difficult for human translators due to several factors. It is characterized by rapid and even overlapping turn-taking, a high degree of co-articulation, and fragmentary language. We describe experiments using both push-to-talk as well as cross-talk recording conditions. Our results indicate that conversational speech recognition and translation is possible, even in a free crosstalk environment. To date, our system has achieved performances of over 80% acceptable translations on transcribed input, and over 70% acceptable translations on speech input recognized with a 70-80% word accuracy. The system's performance on spontaneous conversations recorded in a cross-talk environment is shown to be as good and even slightly superior to the simpler and easier push-to-talk scenario.

BibTeX

@conference{Lavie-1996-14224,
author = {Alon Lavie and Alex Waibel and Lori Levin and Donna Gates and Marsal Gavalda and Torsten Zeppenfeld and Puming Zhan and Oren Glickman},
title = {Translation of Conversational Speech with JANUS-II},
booktitle = {Proceedings of 4th International Conference on Spoken Language Processing (ICSLP '96)},
year = {1996},
month = {October},
volume = {4},
pages = {2375 - 2378},
}