Deploying ZenML
Why do we need to deploy ZenML?
Last updated
Why do we need to deploy ZenML?
Last updated
Moving your ZenML Server to a production environment offers several benefits over staying local:
Scalability: Production environments are designed to handle large-scale workloads, allowing your models to process more data and deliver faster results.
Reliability: Production-grade infrastructure ensures high availability and fault tolerance, minimizing downtime and ensuring consistent performance.
Collaboration: A shared production environment enables seamless collaboration between team members, making it easier to iterate on models and share insights.
Despite these advantages, transitioning to production can be challenging due to the complexities involved in setting up the needed infrastructure.
A ZenML deployment consists of multiple infrastructure components:
FastAPI server backed with a SQLite or MySQL database
An open-source companion ReactJS dashboard
You can read more in-depth about the system architecture of ZenML here. This documentation page will focus on the components required to deploy ZenML OSS.
When you first get started with ZenML, you have the following architecture on your machine.
The SQLite database that you can see in this diagram is used to store information about pipelines, pipeline runs, stacks, and other configurations. This default setup allows you to get started and try out the core features but you won't be able to use cloud-based components like serverless orchestrators and so on.
Users can run the zenml login --local
command to spin up a local ZenML OSS server to serve the dashboard. For the local OSS server option, the zenml login --local
command implicitly connects the client to the server. The diagram for this looks as follows:
In order to move into production, the ZenML server needs to be deployed somewhere centrally so that the different cloud stack components can read from and write to the server. Additionally, this also allows all your team members to connect to it and share stacks and pipelines.
You connect to your deployed ZenML server using the zenml login
command and then you have the full benefits and power of ZenML. You can use all the cloud-based components, your metadata will be stored and synchronized across all the users of the server and you can leverage features like centralized logs storage and pipeline artifact visualization.
Deploying the ZenML Server is a crucial step towards transitioning to a production-grade environment for your machine learning projects. By setting up a deployed ZenML Server instance, you gain access to powerful features, allowing you to use stacks with remote components, centrally track progress, collaborate effectively, and achieve reproducible results.
Currently, there are two main options to access a deployed ZenML server:
Managed deployment: With ZenML Pro offering you can utilize a control plane to create ZenML servers, also known as tenants. These tenants are managed and maintained by ZenML's dedicated team, alleviating the burden of server management from your end. Importantly, your data remains securely within your stack, and ZenML's role is primarily to handle tracking of metadata and server maintenance.
Self-hosted Deployment: Alternatively, you have the ability to deploy ZenML on your own self-hosted environment. This can be achieved through various methods, including using Docker, Helm, or HuggingFace Spaces. We also offer our Pro version for self-hosted deployments, so you can use our full paid feature-set while staying fully in control with an air-gapped solution on your infrastructure.
Both options offer distinct advantages, allowing you to choose the deployment approach that best aligns with your organization's needs and infrastructure preferences. Whichever path you select, ZenML facilitates a seamless and efficient way to take advantage of the ZenML Server and enhance your machine learning workflows for production-level success.
Documentation for the various deployment strategies can be found in the following pages below (in our 'how-to' guides):
Deploying ZenML using ZenML Pro
Deploying ZenML using ZenML Pro.
Deploy with Docker
Deploying ZenML in a Docker container.
Deploy with Helm
Deploying ZenML in a Kubernetes cluster with Helm.
Deploy with HuggingFace Spaces
Deploying ZenML to Hugging Face Spaces.