Logo del repository
  1. Home
 
Opzioni

Prosodic Data-Driven Modelling of Narrative Style in FESTIVAL TTS

Tesser F
•
Cosi P
•
Tisato G.
•
DRIOLI, Carlo
2004
  • conference object

Abstract
A general data-driven procedure for creating new prosodic modules for the Italian FESTIVAL Text-To-Speech (TTS) synthesizer is described. These modules are based on the “Classification and Regression Trees” (CART) theory. The prosodic factors taken into consideration are: duration, pitch and loudness. Loudness control has been implemented as an extension to the MBROLA diphone concatenative synthesizer. The prosodic models were trained using two speech corpora with different speaking style, and the effectiveness of the CART-based prosody was assessed with a set of evaluation tests.
Archivio
http://hdl.handle.net/11390/696283
http://www.istc.cnr.it/doc/75a_2005060715358t_ft-SSW2004.pdf
Diritti
metadata only access
Soggetti
  • Speech synthesis

Visualizzazioni
5
Data di acquisizione
Apr 19, 2024
Vedi dettagli
google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback