반응형

CASC-XVC: Zero-Shot Cross-Lingual Voice Conversion with Content Accordant and Speaker Contrastive LossesCross-Lingual Voice Conversion은 language mismatch와 train-test inconsistency로 인해 한계가 있음CASC-XVCContent accordant loss와 Speaker contrastive loss를 incorporate 하고 content disentanglement를 위해 shared self-supervised learning representation과 information perturbation을 도입서로 다른 language의 utterance pair를..
Paper/Conversion
2025. 5. 19. 17:35
반응형