How effective is unsupervised data collection for children's speech recognition? - Robotics Institute Carnegie Mellon University

How effective is unsupervised data collection for children’s speech recognition?

Gregory Aist, Peggy Chan, X. D. Huang, L. Jiang, Rebecca Kennedy, DeWitt Talmadge Latimer, Jack Mostow, and Calvin Yeung
Conference Paper, Proceedings of 5th International Conference on Spoken Language Processing (ICSLP '98), December, 1998

Abstract

Children present a unique challenge to automatic speech recognition. Today's state-of-the-art speech recognition systems still have problems handling children's speech because acoustic models are trained on data collected from adult speech. In this paper we describe an inexpensive way to mend this problem. We collected children's speech when they interact with an automated reading tutor. These data are subsequently transcribed by a speech recognition system and automatically filtered. We studied how to use these automatically collected data to improve children's speech recognition system's performance. Experiments indicate that automatically collected data can reduce the error rate significantly on children's speech.

BibTeX

@conference{Aist-1998-14827,
author = {Gregory Aist and Peggy Chan and X. D. Huang and L. Jiang and Rebecca Kennedy and DeWitt Talmadge Latimer and Jack Mostow and Calvin Yeung},
title = {How effective is unsupervised data collection for children's speech recognition?},
booktitle = {Proceedings of 5th International Conference on Spoken Language Processing (ICSLP '98)},
year = {1998},
month = {December},
}