Perturbation Measurements on the Degree of Naturalness of Synthesized Vowels

Perturbation Measurements on the Degree of Naturalness of Synthesized Vowels

Author Yamasaki, Rosiane Google Scholar
Montagnoli, Arlindo Google Scholar
Murano, Emi Z. Google Scholar
Gebrim, Eloisa Google Scholar
Hachiya, Adriana Google Scholar
Lopes da Silva, Jorge Vicente Google Scholar
Behlau, Mara Autor UNIFESP Google Scholar
Tsuji, Domingos Google Scholar
Abstract Objective. To determine the impact of jitter and shimmer on the degree of naturalness perception of synthesized vowels produced by acoustical simulation with glottal pulses (GP) and with solid model of the vocal tract (SMVT). Study Design. Prospective study. Methods. Synthesized vowels were produced in three steps: 1. Eighty GP were developed (20 with jitter, 20 with shimmer, 20 with jitter+shimmer, 20 without perturbation)

2. A SMVT was produced based on magnetic resonance imaging (MRI) from a woman during phonation-/epsilon/ and using rapid prototyping technology

3. Acoustic simulations were performed to obtain eighty synthesized vowels-/epsilon/. Two experiments were performed. First Experiment: three judges rated 120 vowels (20 humans+80 synthesized+20% repetition) as "human" or "synthesized". Second Experiment: twenty PowerPoint slide sequences were created. Each slide had 4 synthesized vowels produced with the four perturbation condition. Evaluators were asked to rate the vowels from the most natural to the most artificial. Results. First Experiment: all the human vowels were classified as human

27 out of eighty synthesized vowels were rated as human, 15 of those were produced with jitter+shimmer, 10 with jitter, 2 without perturbation and none with shimmer. Second Experiment: Vowels produced with jitter+shimmer were considered as the most natural. Vowels with shimmer and without perturbation were considered as the most artificial. Conclusions. The association of jitter and shimmer increased the degree of naturalness of synthesized vowels. Acoustic simulations performed with GP and using SMVT demonstrated a possible method to test the effect of the perturbation measurements on synthesized voices.
Keywords Synthesized voices
Acoustical measurements
perceptual evaluation
Naturalness perception
Vocal tract model
xmlui.dri2xhtml.METS-1.0.item-coverage New York
Language English
Sponsor Fundacao de Amparo a Pesquisa do Estado de Sao Paulo
Grant number FAPESP: 2012/17390-3
Date 2017
Published in Journal Of Voice. New York, v. 31, n. 3, p. -, 2017.
ISSN 0892-1997 (Sherpa/Romeo, impact factor)
Publisher Mosby-Elsevier
Extent -
Access rights Closed access
Type Article
Web of Science ID WOS:000403543600053

Show full item record


File Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)




My Account