'Vocoder' 태그의 글 목록

[Paper 리뷰] GLA-Grad++: An Improved Griffin-Lim Guided Diffusion Model for Speech Synthesis

GLA-Grad++: An Improved Griffin-Lim Guided Diffusion Model for Speech SynthesisDiffusion vocoder는 computational cost와 mismatched distribution에 대한 robustness의 한계가 있음GLA-Grad++Griffin-Lim과 reverse process를 integrate 하여 generated signal과 mel-spectrogram 간의 inconsistency를 완화추가적으로 correction을 적용하여 phase-awareness를 개선논문 (ICASSP 2026) : Paper Link1. IntroductionWaveGrad, DiffWave와 같은 diffusion-based vo..

Paper/Vocoder 2026. 5. 4. 11:00

[Paper 리뷰] Flow2GAN: Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation

Flow2GAN: Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio GenerationFlow Matching 기반의 audio generation model을 추가적으로 개선할 수 있음Flow2GANFlow matching을 end-point estimation으로 reformulate 하고 perceptually salient quieter region을 emphasize 하기 위해 spectral energy-based loss scaling을 적용추가적으로 lightweight Generative Adversarial Network fine-tuning과 multi-branch net..

Paper/Vocoder 2026. 4. 27. 12:55

[Paper 리뷰] ComVo: Toward Complex-Valued Neural Networks for Waveform Generation

ComVo: Toward Complex-Valued Neural Networks for Waveform GenerationiSTFT-based vocoder는 complex spectrogram의 inherent structure를 capture 하기 어려움ComVoGenerator, discriminator에서 native complex arithmetic을 사용하여 complex-valued representation에 대한 structured feedback을 제공Phase quantization을 도입하여 phase value를 discretize 하고 training process를 regularize추가적으로 block-matrix computation을 통해 training efficienc..

Paper/Vocoder 2026. 4. 7. 13:03

[Paper 리뷰] DegVoC: Revisiting Neural Vocoder from a Degradation Perspective

DegVoC: Revisiting Neural Vocoder from a Degradation Perspective기존의 neural vocoder는 performance-cost trade-off가 존재함DegVoCMel-spectrogram을 target spectrum으로부터의 signal degradation process로 취급Degradation prior를 활용하여 simple linear transformation을 통해 initial spectral structure를 retrieve 하고 time-frequency domain에서 heterogeneous distribution을 고려한 deep prior solver를 도입논문 (AAAI 2026) : Paper Link1. Intro..

Paper/Vocoder 2026. 3. 30. 13:05

[Paper 리뷰] WaveNeXt2: ConvNeXt-based Fast Neural Vocoders with Residual Denoising and Sub-Modeling for GAN and Diffusion Models

WaveNeXt2: ConvNeXt-based Fast Neural Vocoders with Residual Denoising and Sub-Modeling for GAN and Diffusion Models대부분의 ConvNeXt-based vocoder는 Generative Adversarial Network framework만 사용함WaveNeXt2Residual denoising과 sub-modeling을 도입하여 waveform을 progressively refineGenerative Adeversarial Network, diffusion에 모두 compatible 한 ConvNeXt-based architecture를 구성논문 (ICASSP 2026) : Paper Link1. Introdu..

Paper/Vocoder 2026. 3. 16. 10:52

[Paper 리뷰] Wave-Trainer-Fit: Neural Vocoder with Trainable Prior and Fixed-Point Iteration Towards High-Quality Speech Generation from SSL Features

Wave-Trainer-Fit: Neural Vocoder with Trainable Prior and Fixed-Point Iteration Towards High-Quality Speech Generation from SSL FeaturesSelf-Supervised Learning과 같은 data-driven feature에 대해 high-quality waveform generation을 수행할 수 있음WaveTrainerFitTrainable prior를 도입하여 target speech와 close 한 noise에서 inference process를 수행Reference-aware gain adjustment를 통해 trainable prior에 constraint를 impose논문 (ICAS..

Paper/Vocoder 2026. 3. 4. 13:15

이전 1 2 3 4 ··· 13 다음

이전 다음

최근에 올라온 글

최근에 달린 댓글

« 2026/07 »
일	월	화	수	목	금	토
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31

Total

Today

Yesterday

Let IT Begin

티스토리툴바