'2025/04/19 글 목록

[Paper 리뷰] WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing

WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech ProcessingSpeech siganl에는 speaker identity, paralinguistics, spoken content 등의 multi-faceted information이 포함되어 있음WavLMPre-training 과정에서 masked speech prediction, denoising을 jointly leariningInput speech의 sequence ordering을 capture 하기 위해 Transformer structure에 gated relative position bias를 도입논문 (JSTSP 2022) : Paper Link1. Intro..

Paper/Representation 2025. 4. 19. 21:15

이전 1 다음

이전 다음

최근에 올라온 글

최근에 달린 댓글

« 2025/04 »
일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

Total

Today

Yesterday

Let IT Begin

티스토리툴바