Train with GPUs
Train ZenML pipelines on GPUs and scale out with 🤗 Accelerate.
1 Request extra resources for a step
from zenml import step
from zenml.config import ResourceSettings
@step(settings={
"resources": ResourceSettings(cpu_count=8, gpu_count=2, memory="16GB")
})
def training_step(...):
... # heavy training logic2 Build a CUDA‑enabled container image
Optional – clear the CUDA cache
3 Multi‑GPU / multi‑node training with 🤗 Accelerate
Prepare the container
4 Troubleshooting & Tips
Problem
Quick fix
Last updated
Was this helpful?