Speaker-independent Connected Letter Recognition with a Multi-state Time Delay Neural Network
Conference Paper, Proceedings of 3rd European Conference on Speech Communication and Technology (EUROSPEECH '93), pp. 1481 - 1484, September, 1993
Abstract
We present a Multi-State Time Delay Neural Network (MS-TDNN) for speaker-independent, connected letter recognition. Our MS-TDNN achieves 98.5/92.0% word accuracy on speaker dependent/independent English letter tasks. In this paper we will summarize several techniques to improve (a) continuous recognition performance, such as sentence level training, and (b) phonetic modeling, such as network architectures with ``internal speaker models'', allowing for ``tuning-in'' to new speakers. We also present results on our large and still growing new German Letter data base, containing over 40.000 letters continuously spelled by 55 speakers.
BibTeX
@conference{Hild-1993-13567,author = {Hermann Hild and Alex Waibel},
title = {Speaker-independent Connected Letter Recognition with a Multi-state Time Delay Neural Network},
booktitle = {Proceedings of 3rd European Conference on Speech Communication and Technology (EUROSPEECH '93)},
year = {1993},
month = {September},
pages = {1481 - 1484},
}
Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.