Alwaisi, Shaimaa and Németh, Géza (2024) Advancements in Expressive Speech Synthesis: A Review. INFOCOMMUNICATIONS JOURNAL : A PUBLICATION OF THE SCIENTIFIC ASSOCIATION FOR INFOCOMMUNICATIONS (HTE), 16 (1). pp. 35-46. ISSN 2061-2079
|
Text
InfocomJournal_2024_1_5.pdf Download (1MB) | Preview |
Abstract
In recent years, we have witnessed a fast and wide spread acceptance of speech synthesis technology in, leading to the transition toward a society characterized by a strong desire to incorporate these applications in their daily lives. We provide a comprehensive survey on the recent advancements in the field of expressive Text-To-Speech systems. Among different methods to represent expressivity, this paper focuses the development of ex pressive TTS systems, emphasizing the methodologies employed to enhance the quality and expressiveness of synthetic speech, such as style transfer and improving speaker variability. After that, we point out some of the subjective and objective metrics that are used to evaluate the quality of synthesized speech. Finally, we point out the realm of child speech synthesis, a domain that has been neglected for some time. This underscores that the field of research in children's speech synthesis is still wide open for exploration and development. Overall, this paper presents a comprehensive overview of historical and contemporary trends and future directions in speech synthesis research.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Speech style, Expressivity, Emotional speech, Expressive TTS, Prosody modification, Multilingual and multispeaker TTS |
Subjects: | Q Science / természettudomány > QA Mathematics / matematika > QA75 Electronic computers. Computer science / számítástechnika, számítógéptudomány |
SWORD Depositor: | MTMT SWORD |
Depositing User: | MTMT SWORD |
Date Deposited: | 07 May 2024 14:13 |
Last Modified: | 07 May 2024 14:13 |
URI: | https://real.mtak.hu/id/eprint/194128 |
Actions (login required)
![]() |
Edit Item |