Optimizing Protein Structure Prediction Model Training and Inference on GPU Clusters
Optimizing Protein Structure Prediction Model Training and Inference on GPU Clusters
FastFold provides a high-performance implementation of Evoformer with the following characteristics.
- Excellent kernel performance on GPU platform
- Supporting Dynamic Axial Parallelism(DAP)
- Break the memory limit of single GPU and reduce the overall training time
- DAP can significantly speed up inference and make ultra-long sequence inference possible
- Ease of use
- Huge performance gains with a few lines changes
- You don’t need to care about how the parallel part is implemented
Installation
You will need Python 3.8 or later and NVIDIA CUDA 11.1 or