OpenAI DALLE model and generating images from given texts
DALLE-reproduction
This repository is for sharing pre-trained OpenAI DALLE model and generating images from given texts.
All models are trained by lucidrains/DALLE-pytorch + VQGAN (Taming transformer) with different training code and BPE model.
The notebook includes
- Text to image generation
- Pre-trained CLIP reranking
3. Generate rest of image based on the given cropped image
Usage
- Install requirements
$ pip install -r requirements
- Install DeepSpeed
- Follow the instruction here and install DeepSpeed
Models
- Download models below and save them in pretrained folder
- Check the link in Details for the model specifics
Dataset | Download | Password | Optimizer | Size | Details |
---|---|---|---|---|---|
CUB200 | link | v9ge | Adam | 1.1GB | link |
CUB200 | link |