'VoXtream' 태그의 글 목록

[Paper 리뷰] VoXtream: Full-Stream Text-to-Speech with Extremely Low Latency

VoXtream: Full-Stream Text-to-Speech with Extremely Low LatencyReal-time zero-shot streaming text-to-speech model이 필요함VoXtreamLimited look-ahead를 사용하여 incoming phoneme을 audio token으로 directly mapping구조적으로는 incremental phoneme transformer, temporal transformer, depth transformer를 활용논문 (ICASSP 2026) : Paper Link1. IntroductionLow-latency streaming Text-to-Speech (TTS)를 위해서는 first-packet latency를 m..

Paper/TTS 2026. 3. 23. 10:23

이전 1 다음

이전 다음

최근에 올라온 글

최근에 달린 댓글

« 2026/03 »
일	월	화	수	목	금	토
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30	31

Total

Today

Yesterday

Let IT Begin

티스토리툴바