Simple improvement of VQVAE that allow to generate x2 sized images compared to baseline
Simple improvement of VQVAE that allow to generate x2 sized images compared to baseline.
It allows to generate 512×512 images using ruDALL-E.
POC checkpoint: https://drive.google.com/file/d/1GjGXs1l0mOiFxKJwutjTQyCHEaF-wrIL/view?usp=sharing
GitHub