Parallel waveform generation with DiffusionGAN
![](https://www.deeplearningdaily.com/wp-content/uploads/2022/03/parallel-waveform-generation-with-diffusiongan_621feb0640438-375x210.jpeg)
Parallel waveform generation with DiffusionGAN
- DiffWave: A Versatile Diffusion Model for Audio Synthesis, Zhifeng Kong et al., 2020. [arXiv:2009.09761]
- Tackling the Generative Learning Trilemma with Denoising Diffusion GANs, Xiao et al., 2021. [2112.07804]
Requirements
Tested in python 3.7.9 conda environment.
Usage
Download LJSpeech dataset from official:keithito.
To train model, run train.py
python -m utils.dump
--data-dir /datasets/ljspeech
--output-dir /datasets/ljspeech/vocdump
--num-proc 8
python train.py
--data-dir /datasets/ljspeech/vocdump
--from-dump
GitHub