반응형
Beyond Hard Sharing: Efficient Multi-Task Speech-to-Text Modeling with Supervised Mixture of ExpertsHard parameter sharing은 task interference로 인해 model performance가 저하됨S-MoE각 task를 designated expert에 route 하는 special guiding token을 활용해 gating function을 eliminate해당 S-MoE를 Speech-to-Text model에 적용하여 mixed-bandwidth input을 처리논문 (INTERSPEECH 2025) : Paper Link1. IntroductionSpeech-to-Text (STT) mode..
Paper/ASR
2025. 8. 30. 07:41
반응형
