Modeling the F0 curve for Speech Synthesis: Difference between revisions
From HLT@INESC-ID
No edit summary |
No edit summary |
||
Line 15: | Line 15: | ||
== Speaker == | == Speaker == | ||
* Gopala K Anumanchipalli, Carnegie Mellon University, USA and INESC-ID Lisboa | * Gopala K Anumanchipalli, Carnegie Mellon University, USA and INESC-ID Lisboa, IST | ||
== Abstract == | == Abstract == |
Revision as of 10:40, 1 September 2010
Gopala Krishna Anumanchipalli |
![]() |
Addresses: www mail |
Date
- 15:00, Friday, September 3rd, 2010
- Room 336
Speaker
- Gopala K Anumanchipalli, Carnegie Mellon University, USA and INESC-ID Lisboa, IST
Abstract
In this talk I will review some approaches used for modeling the Fundamental Frequency, the F0 contour. I will detail the F0 modeling strategy currently used in Clustergen, CMU's statistical parametric Synthesis framework. I will describe our recent work attempting to improve the baseline modeling strategy by incorporation of longer range (Syllable and Phrase) features into the F0 model. We use the TILT model of Intonation for this work, and I will briefly describe the tilt framework and mention the alternative frameworks used in Intonation modeling.
Note: This seminar will be held in English.