A Python library intended to liberate data scientists and machine learning engineers
lazycluster is a Python library intended to liberate data scientists and machine learning engineers by abstracting away cluster management and configuration so that they are able to focus on their actual tasks. Especially, the easy and convenient cluster setup with Python for various distributed machine learning frameworks is emphasized.
Highlights
- High-Level API for starting clusters:
- DASK
- Hyperopt
- More lazyclusters (e.g. Ray, PyTorch, Tensorflow, Horovod, Spark) to come …
- Lower-level API for:
- Managing Runtimes or RuntimeGroups to:
- A-/synchronously execute RuntimeTasks by leveraging the power of ssh
- Expose services (e.g. a DB) from or to a
Runtime
or in a wholeRuntimeGroup
- Command line interface (CLI)
- List all available
Runtimes
- Add a
Runtime
configuration - Delete a
Runtime
configuration
Concept Definition: Runtime
A
Runtime
is the logical representation