반응형
FC-TTS: Style and Timbre Control in Zero-Shot Text-to-Speech with Disentangled Speech RepresentationsZero-shot Text-to-Speech는 여전히 independent, precise control 측면에서 한계가 있음FC-TTS2-stage spectrogram generation pipeline과 VQ-VAE-based style encoder를 도입 추가적으로 conditioning-aware consistency loss를 도입해 attribute separation과 dual-reference control의 reliability를 향상논문 (ACL 2026) : Paper Link1. Introduction..
Paper/TTS
2026. 6. 8. 10:57
반응형
