'whisperx' 태그의 글 목록

[Paper 리뷰] WhisperX: Time-Accurate Speech Transcription of Long-Form Audio

WhisperX: Time-Accurate Speech Transcription of Long-Form AudioWeakly-supervised speech recognition model은 각 utterance에 해당하는 predicted timestamp가 inaccurate 하고 word-level timestamp를 out-of-the-box로 사용할 수 없음특히 sequential natrue로 인해 long audio의 buffered transcription을 통한 batched inference가 어려움WhisperXWord-level timestamp를 가진 time-accurate speech recognition modelVoice Activity Detection과 forced ph..

Paper/ASR 2025. 3. 18. 21:54

이전 1 다음

이전 다음

최근에 올라온 글

최근에 달린 댓글

« 2025/04 »
일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

Total

Today

Yesterday

Let IT Begin

티스토리툴바