Mandeel, Ali Raheem and Aggar, Ammar Abdullah and Al-Radhi, Mohammed Salah and Csapó, Tamás Gábor (2023) Implementing a Text-to-Speech synthesis model on a Raspberry Pi for Industrial Applications. In: Proceedings of the 1st Workshop on Intelligent Infocommunication Networks, Systems and Services (WI2NS2). Budapesti Műszaki és Gazdaságtudományi Egyetem, Villamosmérnöki és Informatikai Kar, Budapest, pp. 77-81. ISBN 9789634219026
|
Text
Mandeel-et-al-wins2023.pdf Download (452kB) | Preview |
Abstract
Text-to-Speech (TTS) technology produces human-like speech from input text. It has recently acquired prominence by applying deep neural networks. Nowadays, endto-end TTS models produce highly natural synthesized speech but require extremely high computational resources. Deploying such high-quality TTS models in a real-time environment has been a challenging problem due to the limited resources of embedding systems and cell phones. This paper demonstrated the implementation of an end-to-end TTS model (FastSpeech 2) in an embedded device (Raspberry Pi4 B+). The objective experimental results showed that the TTS model is compatible with the Raspberry Pi (RPi) with high-quality synthesized speech and acceptable performance in terms of processing speed. Our proposed model could be used in many real-life applications if used together with a mechanism for caching, such as railway announcements and industrial purposes.
Item Type: | Book Section |
---|---|
Uncontrolled Keywords: | Real-time system, speech synthesis, FastSpeech |
Subjects: | P Language and Literature / nyelvészet és irodalom > P0 Philology. Linguistics / filológia, nyelvészet Q Science / természettudomány > QA Mathematics / matematika > QA75 Electronic computers. Computer science / számítástechnika, számítógéptudomány T Technology / alkalmazott, műszaki tudományok > T2 Technology (General) / műszaki tudományok általában |
SWORD Depositor: | MTMT SWORD |
Depositing User: | MTMT SWORD |
Date Deposited: | 19 Sep 2023 08:28 |
Last Modified: | 19 Sep 2023 08:28 |
URI: | http://real.mtak.hu/id/eprint/173948 |
Actions (login required)
![]() |
Edit Item |