반응형
DMOSpeech: Direct Metric Optimization via Distilled Diffusion Model in Zero-Shot Speech SynthesisDiffusion model은 iterative denoising process로 인해 computationally intensive 함DMOSpeechDistilled diffusion-based model을 활용하여 teacher 보다 더 빠른 추론 속도를 달성Connectionist Temporal Classification, Speaker Verification loss에 대한 end-to-end optimization을 지원논문 (ICML 2025) : Paper Link1. IntroductionSpeechX, MaskGC..
Paper/TTS
2025. 12. 22. 13:44
반응형
