Quality of Synthetic Speech, Softcover reprint of the original 1st ed. 2017
Perceptual Dimensions, Influencing Factors, and Instrumental Assessment

T-Labs Series in Telecommunication Services Series

Author:

Language: English

126.59 €

In Print (Delivery period: 15 days).

Add to cartAdd to cart
Quality of Synthetic Speech
Publication date:
Support: Print on demand

Approximative price 126.59 €

In Print (Delivery period: 15 days).

Add to cartAdd to cart
Quality of Synthetic Speech
Publication date:
Support: Print on demand

This book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality dimensions for TTS signals. They are: (i) naturalness of voice, (ii) prosodic quality, (iii) fluency and intelligibility, (iv) absence of disturbances, and (v) calmness. Moreover, a test protocol for the efficient indentification of those dimensions in a listening test is introduced. Furthermore, several factors influencing these dimensions are examined. In addition, different techniques for the instrumental quality assessment of TTS signals are introduced, reviewed and tested. Finally, the requirements for the integration of an instrumental quality measure into a concatenative TTS system are examined.

Introduction.- Speech Synthesis.- Auditory and Instrumental Quality Evaluation Metrics.- Perceptual Quality Dimensions.- Influencing Factors on Perceptual Quality.- Instrumental Quality Assessment.- Requirements for the Integration of an Instrumental Quality Measure into a Concatenative TTS System.- Conclusions.

Developes a set of five universal perceptual quality dimensions for TTS signals

Introduces a test protocol for the assessment of the five dimensions in a listening test

Investigates factors influencing the five perceptual quality dimensions

Presents different approaches towards instrumental quality assessment of synthetic speech.

Examines the integration of an instrumental quality assessment model into a TTS system for quality improvement

Includes supplementary material: sn.pub/extras