> For the complete documentation index, see [llms.txt](https://docs.zenml.io/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.zenml.io/kitaru/getting-started/examples.md).

# Examples

Every example here is real, runnable code that shows the Kitaru loop in practice: **run** an agent so every model and tool call lands as a durable checkpoint, **replay** a real run from a checkpoint with one input changed (a different model, a different prompt), and **diff** the result against a faithful baseline. The repo groups them by purpose:

* **Agent Harness Platform** — a stage-by-stage tour through building a durable agent harness platform on Kitaru + PydanticAI. Read this first if you're new.
* **Other end-to-end examples** — production-shaped, self-contained scenarios that exercise multiple primitives at once. No required reading order.
* **Feature-focused examples** — small examples that demo one Kitaru primitive in isolation. Start with `features/replay/` to see the differentiator in isolation.

Every example is a standalone project — clone the repo, `cd` into the example you want, and run it directly.

Clone the repo, install dependencies with `uv`, initialize the project once, and then run the example you want.

```bash
git clone https://github.com/zenml-io/kitaru.git
cd kitaru
uv sync --extra local
uv run kitaru init
uv run python examples/features/basic_flow/first_working_flow.py
```

{% hint style="info" %}
Run `uv run kitaru init` once in the repo checkout before your first example. It creates the project marker Kitaru uses when replaying or resolving flow source from a saved execution.
{% endhint %}

Most examples can be run from the repository root with `uv run python path/to/script.py`. Some end-to-end examples (including the Agent Harness Platform tour) tell you to `cd` into their directory first because they read a local `.env` file or have a multi-step README.

{% hint style="info" %}
Adapting an existing PydanticAI, OpenAI Agents, LangGraph, Claude Agent SDK, Gemini Interactions, or Google ADK project? See [Agent Skills](/kitaru/agent-native/claude-code-skill.md) for migration skills that guide a coding agent through the adapter-specific path.
{% endhint %}

## Connection context

Examples use whatever Kitaru connection context is already active.

* If you are just trying Kitaru locally, run `uv run kitaru login` and use them as-is.
* If you already have a deployed Kitaru server and want the examples to use it, connect first and verify the active context before running the example.

```bash
uv run kitaru login https://my-server.example.com
uv run kitaru status
```

## Start here — Agent Harness Platform

A platform engineer's starter kit for building their org's internal agent harness platform on top of Kitaru + PydanticAI. **Bring Docker and one model-provider API key; then `bash setup.sh && uv run python stage_N_*.py`.** Stages build progressively from a 30-line durable agent to a sandboxed, credential-isolated agent with HITL — each stage adds exactly one tool or one architectural primitive, the library grows monotonically, and the per-stage `Profile` gates which capabilities each agent actually exercises.

Use it as the first thing you read end-to-end, and as the thing you fork for your team.

→ [Agents guide](https://docs.zenml.io/user-guides/agents-guide) — read the stage-by-stage docs, then [grab the code on GitHub](https://github.com/zenml-io/kitaru/tree/develop/examples/end_to_end/agent_harness_platform).

```bash
git clone https://github.com/zenml-io/kitaru.git
cd kitaru/examples/end_to_end/agent_harness_platform
uv sync
uv run kitaru init
export OPENAI_API_KEY=sk-...
uv run python stage_1_basic_agent.py
```

## Other end-to-end examples

Production-shaped examples that exercise multiple primitives in one runnable scenario. Each is self-contained and focused on one harness or scenario — no progressive tour, no required reading order. Pick the one closest to your domain.

| Example             | Demonstrates                                                                                                                                                                                                                                                       | Path                                                                                                                                  |
| ------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | ------------------------------------------------------------------------------------------------------------------------------------- |
| Compliance review   | Multi-stage Claude audit using the [Claude Agent SDK](https://github.com/anthropics/claude-agent-sdk-python). Each agent turn is a checkpoint; later stages add domain decomposition with partial replay and conversational `kitaru.wait()` resume across crashes. | [`examples/end_to_end/compliance_review/`](https://github.com/zenml-io/kitaru/tree/develop/examples/end_to_end/compliance_review)     |
| OpenAI research bot | Multi-agent OpenAI research bot using `KitaruRunner(checkpoint_strategy="runner_call")` — planner/writer runner checkpoints with submitted search fan-out. Publishes `research_plan`, `search_summaries`, and `final_report` artifacts.                            | [`examples/end_to_end/openai_research_bot/`](https://github.com/zenml-io/kitaru/tree/develop/examples/end_to_end/openai_research_bot) |
| Coding agent        | Interactive coding agent built directly on provider SDKs (no PydanticAI, no LangChain). Demos parallel tool execution, durable HITL via `kitaru.wait()`, custom materializers, and descriptive checkpoint names supplied by the LLM.                               | [`examples/end_to_end/coding_agent/`](https://github.com/zenml-io/kitaru/tree/develop/examples/end_to_end/coding_agent)               |
| News scout          | PydanticAI agent that scores news across your interest list — `checkpoint_strategy="calls"` makes every search/fetch/score call replayable. Interests come from CLI flags or a built-in default list. No Docker required.                                          | [`examples/end_to_end/news_scout/`](https://github.com/zenml-io/kitaru/tree/develop/examples/end_to_end/news_scout)                   |

## Feature-focused examples

Small examples that demo one primitive in isolation. Pick by the thing you want to see.

### Core workflow basics

| Example                                                 | Demonstrates                                                                                                              | Related docs                                                                              |
| ------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------- | ----------------------------------------------------------------------------------------- |
| `features/basic_flow/first_working_flow.py`             | Smallest `@flow` + `@checkpoint` example                                                                                  | [Quickstart](/kitaru/getting-started/quickstart.md)                                       |
| `features/basic_flow/flow_with_logging.py`              | `kitaru.log()` metadata at flow and checkpoint scope                                                                      | [Logging](/kitaru/core-concepts/logging.md)                                               |
| `features/checkpoint_streaming/checkpoint_streaming.py` | `kitaru.progress()` and `kitaru.events.publish()` from checkpoint bodies                                                  | [Checkpoint Live Events](/kitaru/guides/checkpoint-streaming.md)                          |
| `features/basic_flow/flow_with_artifacts.py`            | `kitaru.save()` and `kitaru.load()` across executions                                                                     | [Artifacts](/kitaru/guides/artifacts.md)                                                  |
| `features/basic_flow/flow_with_checkpoint_runtime.py`   | `@checkpoint(runtime="isolated")` for work that should run outside the runner process                                     | [Checkpoints](/kitaru/core-concepts/checkpoints.md)                                       |
| `features/basic_flow/flow_with_configuration.py`        | `kitaru.configure()` defaults, overrides, and frozen specs                                                                | [Configuration](/kitaru/guides/configuration.md)                                          |
| `features/sandbox/active_stack_sandbox_command.py`      | A tracked `@flow` + `@checkpoint` that calls `kitaru.run_sandbox_command(...)` using the active stack's sandbox component | [Stacks](/kitaru/agent-runtime-stacks/stacks.md#use-the-active-stack-sandbox-from-python) |

### Replay, lifecycle, and recovery

| Example                                                        | Demonstrates                                                                                                             | Related docs                                                   |
| -------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------ | -------------------------------------------------------------- |
| `features/execution_management/client_execution_management.py` | `KitaruClient` for listing runs, reading details, and loading data                                                       | [Execution Management](/kitaru/guides/execution-management.md) |
| `features/execution_management/wait_and_resume.py`             | `kitaru.wait()` with inline prompt or CLI input/resume                                                                   | [Wait, Input, and Resume](/kitaru/guides/wait-and-resume.md)   |
| `features/replay/replay_with_overrides.py`                     | Re-execute a real run from a checkpoint with one input overridden (model, prompt), then diff against a faithful baseline | [Replay and Overrides](/kitaru/guides/replay-and-overrides.md) |

### LLMs and agent integrations

| Example                                                                 | Demonstrates                                                                                                                                                                                             | Related docs                                                                                           |
| ----------------------------------------------------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------ |
| `features/llm/flow_with_llm.py`                                         | `kitaru.llm()` prompt-response tracking with usage metadata                                                                                                                                              | [Tracked LLM Calls](/kitaru/guides/llm-calls.md)                                                       |
| `integrations/pydantic_ai_agent/pydantic_ai_adapter.py`                 | Wrap a PydanticAI agent with granular Kitaru replay boundaries                                                                                                                                           | [PydanticAI Adapter](/kitaru/adapters/pydantic-ai.md)                                                  |
| `integrations/pydantic_ai_agent/pydantic_ai_streaming.py`               | Watch best-effort `pydantic_ai.stream.*` live events while `.wait()` returns the durable final answer                                                                                                    | [PydanticAI Adapter](/kitaru/adapters/pydantic-ai.md#streaming)                                        |
| `integrations/pydantic_ai_agent/pydantic_ai_sandbox_toolset.py`         | Let a PydanticAI model call `run_sandbox_command`; the dashboard shows `run_sandbox_command_tool` before the final answer checkpoint (`OPENAI_API_KEY` and one sandbox on your current stack required)   | [PydanticAI Adapter](/kitaru/adapters/pydantic-ai.md#sandbox-command-toolset)                          |
| `integrations/openai_agents_agent/openai_agents_adapter.py`             | Wrap an OpenAI Agents SDK agent with call-level or runner-call durability in a real API-backed support flow                                                                                              | [OpenAI Agents Adapter](/kitaru/adapters/openai-agents.md)                                             |
| `integrations/openai_agents_agent/openai_agents_sandbox_tool.py`        | Let an OpenAI agent call `kitaru_sandbox_command`, which runs a command through your current stack's sandbox and returns compact JSON                                                                    | [OpenAI Agents sandbox tool](/kitaru/adapters/openai-agents.md#sandbox-command-tool)                   |
| `integrations/openai_agents_agent/openai_agents_streaming.py`           | Watch best-effort `openai_agents.stream.*` live events while `.wait()` returns the durable `OpenAIRunResult`                                                                                             | [OpenAI Agents Adapter](/kitaru/adapters/openai-agents.md#streaming-with-kitaru-durability)            |
| `integrations/claude_agent_sdk_agent/claude_agent_sdk_adapter.py`       | Wrap one Claude Agent SDK invocation as one Kitaru checkpoint, with final text, session ID, usage/cost, and audit artifacts (`ANTHROPIC_API_KEY` or Claude SDK provider credentials required)            | [Claude Agent SDK Adapter](/kitaru/adapters/claude-agent-sdk.md)                                       |
| `integrations/claude_agent_sdk_agent/claude_agent_sdk_streaming.py`     | Watch best-effort `claude_agent_sdk.stream.*` live events while `.wait()` returns the durable `ClaudeRunResult` (`ANTHROPIC_API_KEY` or Claude SDK provider credentials required)                        | [Claude Agent SDK Adapter](/kitaru/adapters/claude-agent-sdk.md#live-streaming-with-kitaru-durability) |
| `integrations/gemini_interactions_agent/gemini_interactions_adapter.py` | Wrap one Gemini Interactions API response as one Kitaru checkpoint, with no-network previews, streaming mode, and an Antigravity managed-agent path                                                      | [Gemini Interactions Adapter](/kitaru/adapters/gemini-interactions.md)                                 |
| `integrations/google_adk_agent/google_adk_adapter.py`                   | Experimental Google ADK direct runner-call result capture, plus explicit ADK model/tool wrappers in a deterministic local run. Use an isolated no-dev `google-adk` environment.                          | [Google ADK Adapter](/kitaru/adapters/google-adk.md)                                                   |
| `integrations/google_adk_agent/google_adk_workflow.py`                  | Persisted Kitaru flow using ADK calls mode, explicit `KitaruADKModel` / `KitaruADKTool`, deterministic tool-confirmation resume, and structured output. Use an isolated no-dev `google-adk` environment. | [Google ADK Adapter](/kitaru/adapters/google-adk.md)                                                   |
| `integrations/langgraph_agent/langgraph_adapter.py`                     | Local `graph_call` interrupt/resume demo, plus OpenAI-backed `calls` mode with LangChain model/tool checkpoints and deterministic local ticket tools                                                     | [LangGraph Adapter](/kitaru/adapters/langgraph.md)                                                     |
| `integrations/langgraph_agent/langgraph_streaming.py`                   | Watch best-effort `langgraph.stream.*` live events from a local graph-call stream while `.wait()` returns the durable `LangGraphRunResult`                                                               | [LangGraph Adapter](/kitaru/adapters/langgraph.md#graph-call-streaming)                                |

\| `end_to_end/coding_agent/agent.py` | A tool-using coding agent whose LLM calls and tool decisions are visible as durable execution state | [Tracked LLM Calls](/kitaru/guides/llm-calls.md) | | `end_to_end/news_scout/scout.py` | PydanticAI news monitor with per-model/per-tool checkpoints, explicit run inputs, and remote-secret image config | [Examples index](/kitaru/getting-started/examples.md) | | `end_to_end/openai_research_bot/research_bot.py` | Multi-agent OpenAI research bot with planner/writer runner checkpoints, submitted search fan-out, and published report artifacts | [Research bot section](/kitaru/adapters/openai-agents.md#end-to-end-research-bot-example) | | `end_to_end/compliance_review/README.md` | Four-stage Claude Agent SDK audit: checkpointed turns, partial replay, and durable wait/resume conversation | [Replay and Overrides](/kitaru/guides/replay-and-overrides.md) | | `features/mcp/mcp_query_tools.py` | Query executions and data through the Kitaru MCP server | [MCP Server](/kitaru/agent-native/mcp-server.md) |

{% hint style="info" %}
The LLM and most adapter examples require additional dependencies and provider API keys. The Gemini Interactions example has `--help` and `--dry-run` paths that require no credentials or network. The Google ADK examples have local no-provider paths, but they must run in an isolated no-dev `google-adk` environment while local/dev extras remain intentionally blocked. The OpenAI Agents sandbox-tool example also needs your current stack to have exactly one sandbox component. The LangGraph `graph_call` strategy is deterministic and local; the LangGraph `calls` strategy requires `langgraph-openai` and `OPENAI_API_KEY`. Check each example's README before running a real model-backed example.
{% endhint %}

## If you'd rather build up primitive-by-primitive first

Agent Harness Platform is the recommended starting point for most readers — it's structured as a tour and each stage's commit message points at the docs page that explains the primitive being introduced. If you'd rather see each primitive in isolation before reading them woven together, follow this path:

1. [Quickstart](/kitaru/getting-started/quickstart.md) — `@flow` + `@checkpoint` in 6 lines.
2. `features/basic_flow/first_working_flow.py` — the same idea as a runnable file.
3. `features/basic_flow/flow_with_logging.py` — [Logging](/kitaru/core-concepts/logging.md).
4. `features/checkpoint_streaming/checkpoint_streaming.py` — [Checkpoint Live Events](/kitaru/guides/checkpoint-streaming.md).
5. `features/basic_flow/flow_with_artifacts.py` — [Artifacts](/kitaru/guides/artifacts.md).
6. `features/execution_management/wait_and_resume.py` — [Wait, Input, and Resume](/kitaru/guides/wait-and-resume.md).
7. `features/replay/replay_with_overrides.py` — [Replay and Overrides](/kitaru/guides/replay-and-overrides.md).
8. `features/llm/flow_with_llm.py` — [Tracked LLM Calls](/kitaru/guides/llm-calls.md).
9. `features/sandbox/active_stack_sandbox_command.py` — [Stacks](/kitaru/agent-runtime-stacks/stacks.md#use-the-active-stack-sandbox-from-python). This example runs the sandbox command inside a tracked flow checkpoint. It needs your current stack to have one sandbox; `uv run kitaru stack create sandbox-demo` creates a local one.
10. `integrations/pydantic_ai_agent/pydantic_ai_adapter.py` — [PydanticAI Adapter](/kitaru/adapters/pydantic-ai.md).
11. `integrations/pydantic_ai_agent/pydantic_ai_streaming.py` — [PydanticAI streaming](/kitaru/adapters/pydantic-ai.md#streaming).
12. `integrations/pydantic_ai_agent/pydantic_ai_sandbox_toolset.py` — [PydanticAI sandbox toolset](/kitaru/adapters/pydantic-ai.md#sandbox-command-toolset).
13. `integrations/openai_agents_agent/openai_agents_adapter.py` — [OpenAI Agents Adapter](/kitaru/adapters/openai-agents.md).
14. `integrations/openai_agents_agent/openai_agents_sandbox_tool.py` — [OpenAI Agents sandbox tool](/kitaru/adapters/openai-agents.md#sandbox-command-tool).
15. `integrations/openai_agents_agent/openai_agents_streaming.py` — [OpenAI Agents streaming](/kitaru/adapters/openai-agents.md#streaming-with-kitaru-durability).
16. `integrations/claude_agent_sdk_agent/claude_agent_sdk_adapter.py` — [Claude Agent SDK Adapter](/kitaru/adapters/claude-agent-sdk.md).
17. `integrations/claude_agent_sdk_agent/claude_agent_sdk_streaming.py` — [Claude Agent SDK streaming](/kitaru/adapters/claude-agent-sdk.md#live-streaming-with-kitaru-durability).
18. `integrations/gemini_interactions_agent/gemini_interactions_adapter.py` — [Gemini Interactions Adapter](/kitaru/adapters/gemini-interactions.md).
19. `integrations/google_adk_agent/google_adk_adapter.py` — [Google ADK Adapter](/kitaru/adapters/google-adk.md). Run it with isolated `--no-dev --extra google-adk`.
20. `integrations/google_adk_agent/google_adk_workflow.py` — [Google ADK Adapter](/kitaru/adapters/google-adk.md). Persisted calls-mode workflow with explicit model/tool checkpoints.
21. `integrations/langgraph_agent/langgraph_adapter.py` — [LangGraph Adapter](/kitaru/adapters/langgraph.md).
22. `integrations/langgraph_agent/langgraph_streaming.py` — [LangGraph streaming](/kitaru/adapters/langgraph.md#graph-call-streaming).
23. `end_to_end/openai_research_bot/research_bot.py` — [Research bot](/kitaru/adapters/openai-agents.md#end-to-end-research-bot-example).
24. `features/mcp/mcp_query_tools.py` — [MCP Server](/kitaru/agent-native/mcp-server.md).
25. [**Agents guide**](https://docs.zenml.io/user-guides/agents-guide) — the same primitives, woven into one runnable agent harness platform.


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://docs.zenml.io/kitaru/getting-started/examples.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.