Recognizing Emotion in Speech
Conference Paper, Proceedings of 4th International Conference on Spoken Language Processing (ICSLP '96), Vol. 3, pp. 1970 - 1973, October, 1996
Abstract
This paper explores several statistical pattern recognition techniques to classify utterances according to their emotional content. We have recorded a corpus containing emotional speech with over a 1000 utterances from different speakers. We present a new method of extracting prosodic features from speech, based on a smoothing spline approximation of the pitch contour. To make maximal use of the limited amount of training data available, we introduce a novel pattern recognition technique: majority voting of subspace specialists. Using this technique, we obtain classification performance that is close to human performance on the task.
BibTeX
@conference{Dellaert-1996-14214,author = {Frank Dellaert and and Alex Waibel},
title = {Recognizing Emotion in Speech},
booktitle = {Proceedings of 4th International Conference on Spoken Language Processing (ICSLP '96)},
year = {1996},
month = {October},
volume = {3},
pages = {1970 - 1973},
}
Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.