Logging

Learn how to control and customize logging behavior in ZenML pipelines.

By default, ZenML uses a logging handler to capture two types of logs:

Pipeline run logs: Logs collected from your ZenML client while triggering and waiting for a pipeline to run. These logs cover everything that happens client-side: building and pushing container images, triggering the pipeline, waiting for it to start, and waiting for it to finish. These logs are now stored in the artifact store, making them accessible even after the client session ends.
Step logs: Logs collected from the execution of individual steps. These logs only cover what happens during the execution of a single step and originate mostly from the user-provided step code and the libraries it calls.

For step logs, users are free to use the default python logging module or print statements, and ZenML's logging handler will catch these logs and store them.

import logging

from zenml import step

@step 
def my_step() -> None:
    logging.warning("`Hello`")  # You can use the regular `logging` module.
    print("World.")  # You can utilize `print` statements as well.

All these logs are stored within the respective artifact store of your stack. You can visualize the pipeline run logs and step logs in the dashboard as follows:

Local ZenML server (zenml login --local): Both local and remote artifact stores may be accessible
Deployed ZenML server: Local artifact store logs won't be accessible; remote artifact store logs require service connector configuration (see remote storage guide)

In order for logs to be visible in the dashboard with a deployed ZenML server, you must configure both a remote artifact store and the appropriate service connector to access it. Without this configuration, your logs won't be accessible through the dashboard.

Logging Configuration

Environment Variables and Remote Execution

For all logging configurations below, note:

Setting environment variables on your local machine only affects local pipeline runs
For remote pipeline runs, you must set these variables in the pipeline's execution environment using Docker settings:

from zenml import pipeline
from zenml.config import DockerSettings

docker_settings = DockerSettings(environment={"ENVIRONMENT_VARIABLE": "value"})

# Either add it to the decorator
@pipeline(settings={"docker": docker_settings})
def my_pipeline() -> None:
    my_step()

# Or configure the pipelines options
my_pipeline = my_pipeline.with_options(
    settings={"docker": docker_settings}
)

Enabling or Disabling Logs Storage

You can control log storage for both pipeline runs and steps:

Step Logs

To disable storing step logs in your artifact store:

Using the enable_step_logs parameter with step decorator:

from zenml import step

@step(enable_step_logs=False)  # disables logging for this step
def my_step() -> None:
    ...

Setting the ZENML_DISABLE_STEP_LOGS_STORAGE=true environment variable in the execution environment:

from zenml import pipeline
from zenml.config import DockerSettings

docker_settings = DockerSettings(environment={"ZENML_DISABLE_STEP_LOGS_STORAGE": "true"})

# Either add it to the decorator
@pipeline(settings={"docker": docker_settings})
def my_pipeline() -> None:
    my_step()

# Or configure the pipelines options
my_pipeline = my_pipeline.with_options(
    settings={"docker": docker_settings}
)

This environment variable takes precedence over the parameter mentioned above.

Pipeline Run Logs

To disable storing client-side pipeline run logs in your artifact store:

Using the enable_pipeline_logs parameter with pipeline decorator:

from zenml import pipeline

@pipeline(enable_pipeline_logs=False)  # disables client-side logging for this pipeline
def my_pipeline():
    ...

Using the runtime configuration:

# Disable pipeline logs at runtime
my_pipeline.with_options(enable_pipeline_logs=False)

Setting the ZENML_DISABLE_PIPELINE_LOGS_STORAGE=true environment variable:

from zenml import pipeline
from zenml.config import DockerSettings

docker_settings = DockerSettings(environment={"ZENML_DISABLE_PIPELINE_LOGS_STORAGE": "true"})

# Either add it to the decorator
@pipeline(settings={"docker": docker_settings})
def my_pipeline() -> None:
    my_step()

# Or configure the pipelines options
my_pipeline = my_pipeline.with_options(
    settings={"docker": docker_settings}
)

The environment variable takes precedence over parameters set in the decorator or runtime configuration.

Setting Logging Verbosity

Change the default logging level (INFO) with:

export ZENML_LOGGING_VERBOSITY=INFO

Options: INFO, WARN, ERROR, CRITICAL, DEBUG

For remote pipeline runs:

from zenml import pipeline
from zenml.config import DockerSettings

docker_settings = DockerSettings(environment={"ZENML_LOGGING_VERBOSITY": "DEBUG"})

# Either add it to the decorator
@pipeline(settings={"docker": docker_settings})
def my_pipeline() -> None:
    my_step()

# Or configure the pipelines options
my_pipeline = my_pipeline.with_options(
    settings={"docker": docker_settings}
)

Setting Logging Format

Change the default logging format with:

export ZENML_LOGGING_FORMAT='%(asctime)s %(message)s'

The format must use %-string formatting style. See available attributes.

Disabling Rich Traceback Output

ZenML uses rich for enhanced traceback display. Disable it with:

export ZENML_ENABLE_RICH_TRACEBACK=false

Disabling Colorful Logging

Disable colorful logging with:

ZENML_LOGGING_COLORS_DISABLED=true

Disabling Step Names in Logs

By default, ZenML adds step name prefixes to console logs:

[data_loader] Loading data from source...
[data_loader] Data loaded successfully.
[model_trainer] Training model with parameters...

These prefixes only appear in console output, not in stored logs. Disable them with:

ZENML_DISABLE_STEP_NAMES_IN_LOGS=true

Limitations

on Steps and pipelines

When running steps and pipelines, ZenML only captures logs emitted from the thread that executes the corresponding function. If your step code spawns additional threads or runs async code, logs from those execution contexts may not be captured.

For instance, only the log emitted directly in the step function is captured:

import logging
import threading

from zenml import step

logger = logging.getLogger(__name__)


@step
def async_step() -> None:
    def _process() -> None:
        logger.info("This log is NOT captured")

    logger.info("This log is captured")
    thread = threading.Thread(target=_process)
    thread.start()
    thread.join()

As a workaournd, you can run it under the copied contextvars context so ZenML can associate the log records with the running step:

import contextvars
import logging
import threading

from zenml import step

logger = logging.getLogger(__name__)


@step
def async_step() -> None:
    def _process() -> None:
        logger.info("This log is now captured")

    ctx = contextvars.copy_context()
    thread = threading.Thread(target=lambda: ctx.run(_process))
    thread.start()
    thread.join()

on the Dashboard

When viewing logs in the dashboard, ZenML currently loads logs in bulk and pagination/filtering happens on the client side. To keep the response size and server memory usage bounded (especially when logs are stored in remote artifact stores), the dashboard is limited to 500 pages (100 log entries per page, i.e. 50,000 entries total) by default.

You can adjust this limit by setting ZENML_LOGS_MAX_ENTRIES_PER_REQUEST in the environment when you are deploying your ZenML workspace.

Downloading logs from the dashboard will also only include up to this limit.

We’re actively working on improving log loading to remove the need for this cap. We'll update the documentation as this evolves with future releases.

Best Practices for Logging

Use appropriate log levels:
- DEBUG: Detailed diagnostic information
- INFO: Confirmation that things work as expected
- WARNING: Something unexpected happened
- ERROR: A more serious problem occurred
- CRITICAL: A serious error that may prevent continued execution
Include contextual information in logs
Log at decision points to track execution flow
Avoid logging sensitive information
Use structured logging when appropriate
Configure appropriate verbosity for different environments

Good night

Logging

Logging Configuration

Environment Variables and Remote Execution

Enabling or Disabling Logs Storage

Step Logs

Pipeline Run Logs

Setting Logging Verbosity

Setting Logging Format

Disabling Rich Traceback Output

Disabling Colorful Logging

Disabling Step Names in Logs

Limitations

on Steps and pipelines

on the Dashboard

Best Practices for Logging

See Also

Good night

hashtagLogging Configuration

hashtagEnvironment Variables and Remote Execution

hashtagEnabling or Disabling Logs Storage

hashtagStep Logs

hashtagPipeline Run Logs

hashtagSetting Logging Verbosity

hashtagSetting Logging Format

hashtagDisabling Rich Traceback Output

hashtagDisabling Colorful Logging

hashtagDisabling Step Names in Logs

hashtagLimitations

hashtagon Steps and pipelines

hashtagon the Dashboard

hashtagBest Practices for Logging

hashtagSee Also

Logging Configuration

Environment Variables and Remote Execution

Enabling or Disabling Logs Storage

Step Logs

Pipeline Run Logs

Setting Logging Verbosity

Setting Logging Format

Disabling Rich Traceback Output

Disabling Colorful Logging

Disabling Step Names in Logs

Limitations

on Steps and pipelines

on the Dashboard

Best Practices for Logging

See Also