Flexible Transcription Alignment - Robotics Institute Carnegie Mellon University

Flexible Transcription Alignment

M. Finke and Alex Waibel
Workshop Paper, IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU '97), pp. 34 - 40, December, 1997

Abstract

Presents a set of techniques that we employed in our Janus Recognition Toolkit (JRTk) Switchboard and CallHome recognizer in order to deal with imperfections in the transcriptions: inconsistent transcription of pronunciations and contractions, as well as errors in utterance segmentations. These techniques consist of a dynamic, speaking-mode-dependent pronunciation model and a flexible utterance alignment procedure which is based on speaker-adapted models (label boosting). The idea is (a) to automatically retranscribe the training corpus based on these models and procedures, (b) to train a recognizer based on these flexible transcription graphs, and (c) to decode with a dynamic speaking-mode-dependent dictionary. The framework is successfully applied to increase the performance of our state-of-the-art JRTk Switchboard recognizer significantly.

BibTeX

@workshop{Finke-1997-16427,
author = {M. Finke and Alex Waibel},
title = {Flexible Transcription Alignment},
booktitle = {Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU '97)},
year = {1997},
month = {December},
pages = {34 - 40},
}