Kereső
Bejelentkezés
Kapcsolat
Improving Naturalness of Neural-based TTS System Trained with Arabic Limited Data |
| Tartalom: | http://hdl.handle.net/10890/40714 |
|---|---|
| Archívum: | Műegyetem Digitális Archívum |
| Gyűjtemény: |
1. Tudományos közlemények, publikációk
Konferenciák gyűjteményei 1st Workshop on Intelligent Infocommunication Networks, Systems and Services, 2023 Workshop on Intelligent Infocommunication Networks, Systems and Services |
| Cím: |
Improving Naturalness of Neural-based TTS System Trained with Arabic Limited Data
|
| Létrehozó: |
Sawalha, Layan
Alradhi, Mohammad
|
| Dátum: |
2023-03-13T16:07:09Z
2023-03-13T16:07:09Z
2023
|
| Tartalmi leírás: |
In this paper, we investigated different approaches, a neural network speech synthesis system and a non-autoregressive text-to-speech (TTS) model. In the neural network speech synthesis, we showed how a baseline system based on Merlin is used for TTS synthesis to produce the most human-like voice; typically, it is only implemented with a front-end text processor and a WORLD vocoder. Here, we first adapted Continuous and Ahocoder vocoders; and then we investigated the effectiveness of each vocoder’s techniques to produce the highest quality speech. In the non-autoregressive TTS model, we implemented the state-of-the-results Fastspeech2 system, which provided high-quality speech synthesis in a timely manner without controllability and robustness problems. Here, we focused on integrating a different language but with limited data while maintaining its high-quality produced sounds. Through objective and subjective evaluations, we verify that our method can outperform the baseline system with full data.
|
| Nyelv: |
angol
|
| Típus: |
Konferenciaközlemény
|
| Formátum: |
application/pdf
|
| Azonosító: |