search query: @keyword HMM-based / total: 1
reference: 1 / 1
« previous | next »
Author: | Romero Blanco, Arturo |
Title: | Spanish Emotional Speech Synthesis |
Publication type: | Final Project work |
Publication year: | 2014 |
Pages: | viii + 38 s. + liitt. 12 Language: eng |
Department/School: | Sähkötekniikan korkeakoulu |
Degree programme: | Tietoliikennetekniikan koulutusohjelma |
Main subject: | Signaalinkäsittely (S3013) |
Supervisor: | Alku, Paavo |
Instructor: | Raitio, Tuomo |
Electronic version URL: | http://urn.fi/URN:NBN:fi:aalto-201405131804 |
OEVS: | Electronic archive copy is available via Aalto Thesis Database.
Instructions Reading digital theses in the closed network of the Aalto University Harald Herlin Learning CentreIn the closed network of Learning Centre you can read digital and digitized theses not available in the open network. The Learning Centre contact details and opening hours: https://learningcentre.aalto.fi/en/harald-herlin-learning-centre/ You can read theses on the Learning Centre customer computers, which are available on all floors.
Logging on to the customer computers
Opening a thesis
Reading the thesis
Printing the thesis
|
Location: | P1 Ark Aalto 1054 | Archive |
Keywords: | emotional speech synthesis synthetic speech vocoder HMM-based GlottHMM STRAIGHT |
Abstract (eng): | In this project a text-to-speech (TTS) HMM-based speech system (HTS) has been used to create emotional synthetic speech in Spanish. Nowadays the synthetic voices have high quality, but this is not enough, they must be able to capture the natural expressiveness of the human speech. Giving this expressiveness to the synthetic voices will lead to a much more natural voice, that is the goal of these systems. To achieve this, both male and female voices will be used and two different techniques will be applied: dependent models and average voice models with adaptation. In this TTS system diffeerent vocoders can be used. For this project GlottHMM has been used and then three perceptual test have been carried out to compare it with STRAIGHT vocoder. The results of the perceptual tests shows that STRAIGHT is very robust and that GlottHMM is not yet at its level regarding the emotional speech synthesis. |
ED: | 2014-05-18 |
INSSI record number: 49013
+ add basket
« previous | next »
INSSI