Language-independent acoustic cloning of HTS voices: a preliminary study

TitleLanguage-independent acoustic cloning of HTS voices: a preliminary study
Publication TypeConference Proceedings
Year of Publication2016
AuthorsMagariños, C, Erro, D, Rodríguez Banga, E
Conference NameICASSP
Pagination5615-5619
Date Published03/2016
Conference LocationShanghai, China
AbstractThis paper presents a new method for cross-lingual speaker adaptation in the framework of HMM-based speech synthesis. Taking two HTS voice models as input, one for the desired language and another for the aimed speaker identity, it yields a third model that produces speech in the target language while sounding like the target speaker. The method operates at segmental level (spectral information and average fundamental frequency) and does not require any phonetic or linguistic information. Perceptual evaluation experiments show that, when the input models are good enough, the resulting synthetic voice is perceived as similar to the target speaker with no important quality degradation.
URLhttp://ieeexplore.ieee.org/document/7472752/?arnumber=7472752
DOI10.1109/ICASSP.2016.7472752
ProjectMultimedia and Multilingual Human-Centered Content Discovery
Citation Key587