Experiments with LVCSR Based Language Identification

Tanja Schultz, Ivica Rogina, and Alex Waibel

Conference Paper, Proceedings of Speech Research Symposium (SRS '95), pp. 89 - 94, June, 1995

View Publication

Abstract

Automatic language identification is an important problem in building multilingual speech recognition and understanding systems. We have developed a front-end LID module based on LVCSR to identify English, German, and Spanish language for use in spontaneous speech-to-speech translation. We studied the constitution of different levels of knowledge to identify a language, i.e. the phonetic, phonotactic, lexical, and syntactic-semantic knowledge. A comparison of LID systems using different levels of these knowledge sources is presented. We showed that the incorporation of lexical and linguistic knowledge leads to a reduction of the language identification error by up to 50%.

BibTeX

@conference{Schultz-1995-13908,
author = {Tanja Schultz and Ivica Rogina and Alex Waibel},
title = {Experiments with LVCSR Based Language Identification},
booktitle = {Proceedings of Speech Research Symposium (SRS '95)},
year = {1995},
month = {June},
pages = {89 - 94},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.