Speech Science Forum: Dr Rob Clark
Title: Making Text-to-Speech more engaging.
Room: 118, Chandler House
This talk discusses how to make TTS more engaging by better dealing with the prosodic variation found in TTS training data.
There are many potential ways of saying the same piece of text, and simple models tend to generate the average way of saying something rather than the most appropriate way for a given context.
We consider the implications of choosing one way over the other and we also consider how to evaluate how appropriate what we generate is.
Dr Rob Clark
Google