A Distributed Classification Training Framework with PyTorch

Distribuuuu

The pure and clear PyTorch Distributed Training Framework.

Distribuuuu is a Distributed Classification Training Framework powered by native PyTorch.

Please check tutorial for detailed Distributed Training tutorials:

  • Single Node Single GPU Card Training [snsc.py]
  • Single Node Multi-GPU Crads Training (with DataParallel) [snmc_dp.py]
  • Multiple Nodes Multi-GPU Cards Training (with DistributedDataParallel)
  • ImageNet training example [imagenet.py]

For the complete training framework, please see distribuuuu.

Requirements and Usage

Dependency

  • Install PyTorch>= 1.5 (has been tested on 1.5, 1.7.1 and 1.8)
  • Install other dependencies: pip install -r requirements.txt

Dataset

Download the ImageNet dataset and move validation images to labeled subfolders, using the script valprep.sh.

Expected datasets structure for ILSVRC

ILSVRC
|_ train
| |_ n01440764
| |_ ...
| |_ n15075141
|_ val

 

 

 

To finish reading, please visit source site