OpenAI DALLE model and generating images from given texts

DALLE-reproduction

This repository is for sharing pre-trained OpenAI DALLE model and generating images from given texts.

All models are trained by lucidrains/DALLE-pytorch + VQGAN (Taming transformer) with different training code and BPE model.

The notebook includes

  1. Text to image generation
  2. Pre-trained CLIP reranking

cub_reranking

coco_reranking

3. Generate rest of image based on the given cropped image

cub_cropped

coco_cropped

Usage

  1. Install requirements
$ pip install -r requirements
  1. Install DeepSpeed
  • Follow the instruction here and install DeepSpeed

Models

  • Download models below and save them in pretrained folder
  • Check the link in Details for the model specifics

 

 

 

To finish reading, please visit source site

Dataset Download Password Optimizer Size Details
CUB200 link v9ge Adam 1.1GB link
CUB200 link