반응형
[Paper 리뷰] ALO-VC: Any-to-Any Low-Latency One-Shot Voice Conversion
ALO-VC: Any-to-Any Low-Latency One-Shot Voice ConversionNon-parallel low-latency one-shot phonetic posteriorgrams-based voice conversion을 통해 빠른 합성이 가능함ALO-VCPre-trained speaker encoder, pitch predictor, positional encoding을 결합해 구성됨ALO-VC-R은 pre-trained d-vector speaker encoder를 활용하고 ALO-VC-E는 ECAPA-TDNN을 활용해 성능을 개선논문 (INTERSPEECH 2023) : Paper Link1. IntroductionVoice Conversion (VC)는 linguistic..
Paper/Conversion
2024. 8. 12. 10:17
반응형