Kereső
Bejelentkezés
Kapcsolat
Regeneration of Ultrasound Tongue Images using Tongue Position Values towards Articulatory-to-Acoustic Mapping |
Tartalom: | http://hdl.handle.net/10890/58912 |
---|---|
Archívum: | Műegyetem Digitális Archívum |
Gyűjtemény: |
1. Tudományos közlemények, publikációk
Konferenciák gyűjteményei Workshop on Intelligent Infocommunication Networks, Systems and Services 3rd Workshop on Intelligent Infocommunication Networks, Systems and Services, 2025 |
Cím: |
Regeneration of Ultrasound Tongue Images using Tongue Position Values towards Articulatory-to-Acoustic Mapping
|
Létrehozó: |
Ibrahimov, Ibrahim
Gosztolya, Gábor
Zainkó, Csaba
|
Dátum: |
2025-02-20T13:51:34Z
2025-02-20T13:51:34Z
2025
|
Tartalmi leírás: |
Silent Speech Interfaces (SSI) aim to help people communicate by using articulatory data, supporting those who cannot speak aloud or need to communicate in noisy or silent environments. Ultrasound Tongue Imaging (UTI) is a widely used, safe, and cost-effective method for studying tongue movements during speech. UTI provides real-time views of tongue shapes and dynamics, making it useful for Articulation-to-Speech (ATS) synthesis. However, challenges such as probe misalignment, incomplete tongue images, and differences between speakers can affect data quality. Recent advancements, such as neural vocoders like WaveGlow and adaptations of Tacotron2, have improved ATS synthesis, producing more natural and accurate speech. UTI has also been used to enhance text-to-speech (TTS) systems, showing potential for applications like speech training. To address limitations, studies have explored techniques like dynamic time warping (DTW) and data augmentation, aiming to mitigate alignment issues and enrich training datasets. This study investigates the optimal representation of tongue shape using UTI, employing DeepLabCut for pose estimation and tongue image regeneration from tongue position values towards articulatory-to-acoustic mapping (AAM). The findings aim to refine UTI-based speech synthesis systems and expand their applicability.
|
Nyelv: |
angol
|
Típus: |
Könyvfejezet
|
Formátum: |
application/pdf
|
Azonosító: |