For the complete documentation index, see llms.txt. This page is also available as Markdown.

Overview

The mental model behind Kitaru's durable execution primitives.

Kitaru sits in the runtime layer of your agent stack. Underneath it are the harness your team picked (Pydantic AI, LangGraph, Claude Agent SDK, raw Python) and the model(s) that harness drives. On top sits the platform your org already runs (auth, observability, policy, UI). Kitaru's job is to make the run durable — checkpoints, replay, resume, wait states, versioned deployments — without forcing a graph DSL or a rewrite of your control flow.

Start with Harness, Runtime, Platform if you want the big-picture split, or How It Works for the three-planes model (control / orchestration / execution) and what-runs-where in local dev vs production.

Core ideas

Concept
What it is

Flow

The outer durable boundary around your workflow

Checkpoint

A unit of work inside a flow whose output is persisted

Execution

A single run of a flow, identified by a unique ID

Structured metadata

Key-value data you attach to executions and checkpoints with kitaru.log()

Runtime log storage

Where runtime logs are sent (configured separately from structured metadata)

Active stack

The default execution target used when no per-run stack=... override is passed

What you can use today

Kitaru's current release includes:

  • @flow — mark a function as a durable workflow

  • @checkpoint — mark a function as a persisted work unit

  • kitaru.log() — attach structured metadata to the current scope

  • kitaru.wait() — pause a flow until external input is supplied

  • kitaru.llm() — make tracked model calls with prompt/response capture

  • kitaru.connect() — connect to a Kitaru server

  • kitaru.configure() — set process-local runtime defaults

  • kitaru.save() / kitaru.load() — persist and load named artifacts in checkpoints

  • kitaru.list_stacks() / kitaru.current_stack() / kitaru.use_stack() — manage the default stack

  • KitaruClient — inspect executions, fetch logs, resolve waits, retry, replay, and browse artifacts

  • FlowHandle — interact with a running or finished execution

All of the primitives listed here ship today. Some capabilities are backend-dependent — for example, runtime log retrieval requires a server-backed connection — but they are part of the supported Kitaru surface.

Explore the concepts

Last updated

Was this helpful?