반응형
IPACue-TTS: Integrating Prosody and Articulatory Cues in Conditional Flow Matching for Multilingual Zero-Shot TTSNative-sounding cross-lingual, code-mixed Text-to-Speech model이 필요함IPACue-TTSPronunciation, prosodic accuracy를 향상하기 위해 articulatory phoneme refinement를 incorporateFlow-based framework를 통해 fine-grained acoustic, prosodic feature를 explicitly modeling논문 (ICASSP 2026) : Paper Link1. Intro..
Paper/TTS
2026. 5. 14. 14:03
반응형
