Logging and visualizing experiments with neptune.ai
The Neptune Experiment Tracker is an Experiment Tracker flavor provided with the Neptune-ZenML integration that uses neptune.ai to log and visualize information from your pipeline steps (e.g. models, parameters, metrics).
When would you want to use it?
Neptune is a popular tool that you would normally use in the iterative ML experimentation phase to track and visualize experiment results or as a model registry for your production-ready models. Neptune can also track and visualize the results produced by your automated pipeline runs, as you make the transition towards a more production-oriented workflow.
You should use the Neptune Experiment Tracker:
if you have already been using neptune.ai to track experiment results for your project and would like to continue doing so as you are incorporating MLOps workflows and best practices in your project through ZenML.
if you are looking for a more visually interactive way of navigating the results produced from your ZenML pipeline runs (e.g. models, metrics, datasets)
if you would like to connect ZenML to neptune.ai to share the artifacts and metrics logged by your pipelines with your team, organization, or external stakeholders
You should consider one of the other Experiment Tracker flavors if you have never worked with neptune.ai before and would rather use another experiment tracking tool that you are more familiar with.
How do you deploy it?
The Neptune Experiment Tracker flavor is provided by the Neptune-ZenML integration. You need to install it on your local machine to be able to register the Neptune Experiment Tracker and add it to your stack:
zenmlintegrationinstallneptune-y
The Neptune Experiment Tracker needs to be configured with the credentials required to connect to Neptune using an API token.
Authentication Methods
You need to configure the following credentials for authentication to Neptune:
api_token: API key token of your Neptune account. You can create a free Neptune account here. If left blank, Neptune will attempt to retrieve the token from your environment variables.
project: The name of the project where you're sending the new run, in the form "workspace-name/project-name". If the project is not specified, Neptune will attempt to retrieve it from your environment variables.
This method requires you to configure a ZenML secret to store the Neptune tracking service credentials securely.
You can create the secret using the zenml secret create command:
Once the secret is created, you can use it to configure the neptune Experiment Tracker:
# Reference the project and api-token in our experiment tracker componentzenmlexperiment-trackerregisterneptune_experiment_tracker \--flavor=neptune \--project=<project_name> \--api_token={{neptune_secret.api_token}}...# Register and set a stack with the new experiment trackerzenmlstackregisterneptune_stack-eneptune_experiment_tracker...--set
Read more about ZenML Secrets in the ZenML documentation.
This option configures the credentials for neptune.ai directly as stack component attributes.
This is not recommended for production settings as the credentials won't be stored securely and will be clearly visible in the stack configuration.
# Register the Neptune experiment trackerzenmlexperiment-trackerregisterneptune_experiment_tracker--flavor=neptune \ --project=<project_name> --api_token=<token># Register and set a stack with the new experiment trackerzenmlstackregisterneptune_stack-eneptune_experiment_tracker...--set
For more, up-to-date information on the Neptune Experiment Tracker implementation and its configuration, you can have a look at the SDK docs .
How do you use it?
To log information from a ZenML pipeline step using the Neptune Experiment Tracker component in the active stack, you need to enable an experiment tracker using the @step decorator. Then fetch the Neptune run object and use logging capabilities as you would normally do. For example:
from zenml.integrations.neptune.experiment_trackers.run_state import ( get_neptune_run)from neptune.utils import stringify_unsupportedfrom zenml import get_step_contextfrom sklearn.model_selection import train_test_splitfrom sklearn.svm import SVCfrom sklearn.datasets import load_irisfrom zenml import pipeline, stepfrom zenml.client import Clientfrom zenml.integrations.neptune.experiment_trackers import NeptuneExperimentTracker# Get the experiment tracker from the active stackexperiment_tracker: NeptuneExperimentTracker =Client().active_stack.experiment_tracker@step(experiment_tracker="neptune_experiment_tracker")deftrain_model() -> SVC: iris =load_iris() X_train, _, y_train, _ =train_test_split( iris.data, iris.target, test_size=0.2, random_state=42 ) params ={"kernel":"rbf","C":1.0,} model =SVC(**params) model.fit(X_train, y_train)# Log the model to Neptune neptune_run =get_neptune_run() neptune_run["parameters"]= paramsreturn model
Instead of hardcoding an experiment tracker name, you can also use the Client to dynamically use the experiment tracker of your active stack:
from zenml.client import Clientexperiment_tracker =Client().active_stack.experiment_tracker@step(experiment_tracker=experiment_tracker.name)deftf_trainer(...): ...
Logging ZenML pipeline and step metadata to the Neptune run
You can use the get_step_context method to log some ZenML metadata in your Neptune run:
Check out the SDK docs for a full list of available attributes
Neptune UI
Neptune comes with a web-based UI that you can use to find further details about your tracked experiments. You can find the URL of the Neptune run linked to a specific ZenML run printed on the console whenever a Neptune run is initialized. You can also find it in the dashboard in the metadata tab of any step that has used the tracker:
Each pipeline run will be logged as a separate experiment run in Neptune, which you can inspect in the Neptune UI.
Clicking on one run will reveal further metadata logged within the step:
Full Code Example
This section shows an end to end run with the ZenML Neptune integration.
Code Example of this Section
from zenml.integrations.neptune.experiment_trackers.run_state import ( get_neptune_run)from neptune.utils import stringify_unsupportedfrom zenml import get_step_contextfrom sklearn.model_selection import train_test_splitfrom sklearn.datasets import load_irisfrom sklearn.svm import SVCfrom sklearn.metrics import accuracy_scorefrom zenml import pipeline, stepfrom zenml.client import Clientfrom zenml.integrations.neptune.experiment_trackers import NeptuneExperimentTrackerimport neptune.integrations.sklearn as npt_utils# Get the experiment tracker from the active stackexperiment_tracker: NeptuneExperimentTracker =Client().active_stack.experiment_tracker@step(experiment_tracker=experiment_tracker.name)deftrain_model() -> SVC: iris =load_iris() X_train, _, y_train, _ =train_test_split( iris.data, iris.target, test_size=0.2, random_state=42 ) params ={"kernel":"rbf","C":1.0,} model =SVC(**params) model.fit(X_train, y_train)# Log parameters and model to Neptune neptune_run =get_neptune_run() neptune_run["parameters"]= params neptune_run["estimator/pickled-model"]= npt_utils.get_pickled_model(model)return model@step(experiment_tracker=experiment_tracker.name)defevaluate_model(model: SVC): iris =load_iris() _, X_test, _, y_test =train_test_split( iris.data, iris.target, test_size=0.2, random_state=42 ) y_pred = model.predict(X_test) accuracy =accuracy_score(y_test, y_pred) neptune_run =get_neptune_run() context =get_step_context()# Log metadata using Neptune neptune_run["zenml_metadata/pipeline_metadata"]=stringify_unsupported( context.pipeline_run.get_metadata().model_dump() ) neptune_run[f"zenml_metadata/{context.step_name}"]=stringify_unsupported( context.step_run.get_metadata().model_dump() )# Log accuracy metric to Neptune neptune_run["metrics/accuracy"]= accuracyreturn accuracy@pipelinedefml_pipeline(): model =train_model() accuracy =evaluate_model(model)if__name__=="__main__":from zenml.integrations.neptune.flavors import NeptuneExperimentTrackerSettings neptune_settings =NeptuneExperimentTrackerSettings( tags={"regression", "sklearn"} ) ml_pipeline.with_options(settings={"experiment_tracker": neptune_settings})()
Further reading
Check Neptune's docs for further information on how to use this integration and Neptune in general.