Modal
Executing individual steps in Modal.
Modal is a platform for running cloud infrastructure. It offers specialized compute instances to run your code and has a fast execution time, especially around building Docker images and provisioning hardware. ZenML's Modal step operator allows you to submit individual steps to be run on Modal compute instances.
When to use it
You should use the Modal step operator if:
You need fast execution time for steps that require computing resources (CPU, GPU, memory).
You want to easily specify the exact hardware requirements (e.g., GPU type, CPU count, memory) for each step.
You have access to Modal.
How to deploy it
To use the Modal step operator:
Sign up for a Modal account if you haven't already.
Install the Modal CLI by running
pip install modal
(orzenml integration install modal
) and authenticate by runningmodal setup
in your terminal.
How to use it
To use the Modal step operator, we need:
The ZenML
modal
integration installed. If you haven't done so, runDocker installed and running.
A cloud artifact store as part of your stack. This is needed so that both your orchestration environment and Modal can read and write step artifacts. Any cloud artifact store supported by ZenML will work with Modal.
A cloud container registry as part of your stack. Any cloud container registry supported by ZenML will work with Modal.
We can then register the step operator:
Once you added the step operator to your active stack, you can use it to execute individual steps of your pipeline by specifying it in the @step
decorator as follows:
ZenML will build a Docker image which includes your code and use it to run your steps in Modal. Check out this page if you want to learn more about how ZenML builds these images and how you can customize them.
Additional configuration
You can specify the hardware requirements for each step using the ResourceSettings
class as described in our documentation on resource settings:
Note that the cpu
parameter in ResourceSettings
currently only accepts a single integer value. This specifies a soft minimum limit - Modal will guarantee at least this many physical cores, but the actual usage could be higher. The CPU cores/hour will also determine the minimum price paid for the compute resources.
For example, with the configuration above (2 CPUs and 32GB memory), the minimum cost would be approximately $1.03 per hour ((0.135 * 2) + (0.024 * 32) = $1.03).
This will run my_modal_step
on a Modal instance with 1 A100 GPU, 2 CPUs, and 32GB of CPU memory.
Check out the Modal docs for the full list of supported GPU types and the SDK docs for more details on the available settings.
The settings do allow you to specify the region and cloud provider, but these settings are only available for Modal Enterprise and Team plan customers. Moreover, certain combinations of settings are not available. It is suggested to err on the side of looser settings rather than more restrictive ones to avoid pipeline execution failures. In the case of failures, however, Modal provides detailed error messages that can help identify what is incompatible. See more in the Modal docs on region selection for more details.
Last updated