Name: Improving Prosody through Analysis by Synthesis
Start: 2016-12-09T09:30:00-05:00
End: 2016-12-09T00:00:00-05:00

Kevin A. Lenzo Carnegie Mellon University

Friday, December 9
9:30 am to 12:00 am

Improving Prosody through Analysis by Synthesis

Event Location: GHC 6501

Abstract: An iterative model-based method is proposed for improving linguistic structure, segmentation, and prosodic annotations that correspond to the delivery of each utterance as regularized across the data. For each iteration, the training utterances are resynthized according to the existing symbolic annotation. Values of various features and subgraph structures are “twiddled:” each is perturbed based on the features and constraints of the model. Twiddled utterances are evaluated using an objective function appropriate to the type of perturbation and compared with the unmodified, resynthesized utterance. The instance with least error is assigned as the current annotation, and the entire process is repeated. At each iteration, the model is re-estimated, and the distributions and annotations regularize across the corpus. As a result, the annotations have more accurate and effective distributions, which leads to improved control and expressiveness given the features of the model.

Committee:Alan W. Black, Chair

Jack Mostow

Alex Rudnicky

Julia Hirschberg, Columbia University

PhD Thesis Defense

December

Event Navigation

PhD Thesis Defense

December

Share This Event!

Event Navigation