> For the complete documentation index, see [llms.txt](https://docs.zenml.io/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.zenml.io/kitaru/adapters/gemini-interactions.md).

# Gemini Interactions

This adapter makes Gemini Interactions API turns replayable inside a Kitaru flow: each stable Gemini interaction response becomes one durable checkpoint, so you can replay the surrounding flow without re-running Gemini for turns that already finished. Use it when a Gemini call is one step in a larger durable workflow.

{% hint style="info" %}
**Using Google ADK instead?** Use the [Google ADK adapter](/kitaru/adapters/google-adk.md) if your application runs a `google.adk` agent or runner. This page is for direct Gemini Interactions API calls, including Antigravity managed-agent runs.
{% endhint %}

The [Google Gemini Interactions API](https://ai.google.dev/) is a hosted interaction runtime: you send a request to Gemini or a Google-managed agent, Google runs the interaction, and you receive a response. Kitaru does **not** replace that runtime. It wraps an outer durable boundary around it:

```
one stable Gemini interaction response = one Kitaru checkpoint
```

A Gemini checkpoint behaves like any other checkpoint in a flow. Consider:

```
collect input → ask Gemini to analyze it → write report → wait for approval → publish
```

If Gemini finishes the analysis and the later `write report` checkpoint fails, Kitaru can replay the flow and reuse the completed Gemini interaction result instead of calling Google again. You keep the interaction ID, output text, status, usage when available, safe step summaries, and Kitaru artifacts that help you inspect what happened.

The important word is **stable**. The adapter only saves a successful checkpoint when Gemini reaches a durable status:

* `completed`: Gemini produced a final response.
* `requires_action`: Gemini reached a handoff point and needs your code, a human, or another system to do something before the interaction can continue.

In-progress background states are not saved as successful checkpoints. Poll them again by interaction ID instead of treating them as finished work.

## The mental model

Think of Gemini Interactions as the **hosted worker** and Kitaru as the **outer workflow recorder**.

Gemini still runs the inside of the interaction:

```
Gemini interaction
  ├─ Gemini model or Google-managed agent planning
  ├─ interaction steps
  ├─ built-in tools, web, code execution, or hosted MCP work
  ├─ Antigravity sandbox files and environment reuse
  └─ completed / requires_action response
```

Kitaru records the outside:

```
@kitaru.flow
  └─ gemini_summary_gemini_interaction checkpoint
      └─ one call to client.interactions.create(...) or .get(...)
```

So Kitaru can honestly say:

> “This Gemini interaction reached a stable response. On replay, I can return that saved boundary result without asking Google to run the same interaction again.”

Kitaru cannot honestly say:

> “I can rewind to Gemini's third internal step, restore Google's hosted sandbox, and replay only the remaining tool work.”

A concrete failure story:

```
1. Antigravity inspects a repository in a Google-managed environment.
2. Gemini returns a completed interaction with a summary.
3. Kitaru stores that result as a checkpoint.
4. A later Kitaru checkpoint fails while writing your own report.
5. You replay the flow.
```

On replay, Kitaru can return the saved `GeminiInteractionResult` without starting another Antigravity job. But Kitaru did not snapshot Google's remote sandbox filesystem. If a file, report, or decision must be durable in your own workflow, return it from the interaction or write it in a later Kitaru-owned checkpoint.

## What you get

The adapter gives existing Gemini Interactions users:

* one durable Kitaru checkpoint around each stable Gemini interaction response
* replay-skip for completed Gemini turns inside larger Kitaru flows
* a typed `GeminiInteractionResult` with status, interaction IDs, output text, model/agent, environment ID when reported, usage, estimated cost when Kitaru can estimate one automatically or a configured calculator returns one, timing, and warnings
* safe step summaries from `interaction.steps` so your flow can respond to handoff points without storing raw provider payloads by default. Step summaries keep metadata such as type, status, and call IDs; `text_preview` is only filled for clearly identified final assistant/model output.
* optional streaming methods for watching a long Gemini interaction while it is running, without changing the final checkpoint result Kitaru saves
* a clear `requires_action` path for function results and human approval gates
* explicit `cache_identity` support when the same runner/request can point at different Google projects, regions, credential aliases, or client setups
* redacted request manifests, output, usage, event-log, and run-summary artifacts by default
* opt-in raw provider payload capture when you need deeper debugging
* a convenience Antigravity request preset used by the adapter example

Google's Interactions API is still a preview/Beta-style surface. Treat schemas, agent names, and hosted-agent behavior as more likely to change than stable Gemini text-generation APIs.

## Usage and cost statistics

When `save_usage=True` (the default), the Gemini adapter records two related things after a stable provider response:

```
Gemini returns usage metadata
  -> result.usage keeps the raw token metadata
  -> Kitaru logs one llm_usage_v1 statistics record for the interaction
```

That canonical statistics record uses Gemini token fields such as `prompt_token_count` / `promptTokenCount`, `candidates_token_count` / `candidatesTokenCount`, `cached_content_token_count` / `cachedContentTokenCount`, and `thoughts_token_count` / `thoughtsTokenCount` when Google reports them. Kitaru normally rolls those token counts into execution-level LLM usage summary fields and flat statistics keys when the execution finishes. `FlowHandle.wait()` and `FlowHandle.get()` can populate missing summaries for older executions or executions where the finish-time summary was not written.

The canonical usage record's `status` describes the provider call, not the next thing your application must do. So a Gemini interaction that returns `GeminiInteractionResult.status == "requires_action"` still writes a canonical LLM usage record with `status="completed"`: Google returned a stable response, and that response is asking your code to send a follow-up turn.

Google reports token usage metadata for these calls, but Kitaru does not receive a provider-reported per-call dollar cost from Gemini Interactions. By default, Kitaru estimates with `genai-prices` when the interaction names a Gemini model and includes priceable input/output/cache-read token counts. That value is stored as `estimated_cost_usd`, not `actual_cost_usd`.

If Kitaru cannot identify the model, cannot price the model, or only has token categories that `genai-prices` does not price, the record keeps the tokens and leaves the cost empty. Kitaru does not add `thoughts_token_count` / `thoughtsTokenCount` to output tokens. It also does not subtract thought tokens from Gemini candidate/output token counts unless Google's usage contract proves those counts include thoughts.

If you know the pricing you want to use, pass `cost_calculator=` to `KitaruGeminiInteractionsRunner`. Kitaru calls it with a `GeminiUsageSummary` containing the requested/resolved model fields and normalized token counts, then records the returned non-negative number as `estimated_cost_usd`. Return `None` when your calculator cannot estimate the call; Kitaru leaves the cost empty and records the usage as a no-estimate token record.

```python
from kitaru.adapters.gemini import GeminiUsageSummary, KitaruGeminiInteractionsRunner


def calculate_gemini_cost(usage: GeminiUsageSummary) -> float | None:
    if usage.input_tokens is None or usage.output_tokens is None:
        return None
    # Replace these example rates with your own contract or billing source.
    return (usage.input_tokens / 1_000_000 * 0.35) + (
        usage.output_tokens / 1_000_000 * 1.05
    )


runner = KitaruGeminiInteractionsRunner(
    name="gemini_summary",
    cost_calculator=calculate_gemini_cost,
)
```

User calculators take priority over the built-in estimate. If you disable automatic estimates with `KITARU_LLM_ESTIMATED_COSTS=off`, or if pricing is not possible, `records_without_cost_count` can still increase. It means “this provider call had no dollar-cost value attached,” not “usage tracking failed.” Token counts can still be present and aggregated.

Set `save_usage=False` when you do not want Gemini usage persisted. That disables both the raw usage artifact and the canonical `llm_usage_v1` metadata record for that interaction, even if Gemini-specific events are still enabled.

## Install

Add the Gemini extra. Include `local` if you want the local Kitaru server and dashboard:

```bash
uv add "kitaru[gemini,local]"
```

Initialize the project once:

```bash
kitaru init
kitaru login        # local server; add a URL for a deployed one
kitaru status
```

For real Google calls, authenticate one of two ways.

Most users set a Gemini API key (the AI Studio / Developer API backend):

```bash
export GEMINI_API_KEY='<your-gemini-api-key>'
# GOOGLE_API_KEY is also accepted by the example wrapper.
```

If your organization blocks raw API keys, use Application Default Credentials (ADC) through Vertex AI instead. ADC is picked up automatically after you log in with `gcloud`, so you set the backend, project, and region rather than a key:

```bash
gcloud auth application-default login
export GOOGLE_GENAI_USE_VERTEXAI=true
export GOOGLE_CLOUD_PROJECT='<your-gcp-project-id>'
export GOOGLE_CLOUD_LOCATION=global
```

On Vertex AI the Interactions API currently serves agent interactions (`agent=`, such as Antigravity), not raw model interactions, and only in the `global` location. Use `model=` interactions with an API key, and `agent=` interactions on Vertex. Antigravity defaults to `background=True` because some Vertex / Chiliagon managed-agent paths require it. If a preview endpoint explicitly rejects background mode, pass `background=False` as a foreground escape hatch.

{% hint style="info" %}
Migrating an existing Gemini Interactions or Antigravity managed-agent project? The [`zenml-io/kitaru-skills`](https://github.com/zenml-io/kitaru-skills) package includes `/kitaru:kitaru-gemini-interactions-migration` for wrapping stable interaction responses and checking polling, `requires_action`, function-result, and Google-owned-internals boundaries. See [Agent Skills](/kitaru/agent-native/claude-code-skill.md).
{% endhint %}

## Minimal flow

Use `GeminiInteractionRequest.start(...)` for a fresh interaction. Exactly one of `model=` or `agent=` must be set.

```python
from kitaru import flow
from kitaru.adapters.gemini import (
    GeminiInteractionRequest,
    GeminiInteractionResult,
    KitaruGeminiInteractionsRunner,
)

runner = KitaruGeminiInteractionsRunner(
    name="gemini_summary",
    # Set this when the same request/runner name may target different Google
    # projects, regions, credential aliases, or client configurations.
    cache_identity="my-project/us-central1",
)

@flow
def summarize(topic: str) -> GeminiInteractionResult:
    request = GeminiInteractionRequest.start(
        f"Explain {topic} in three plain sentences.",
        model="gemini-3.5-flash",
        metadata={"example": "minimal_gemini_interaction"},
    )
    return runner.run_sync(request)

result = summarize.run("Kitaru checkpoints").wait()
print(result.output_text)
print(result.interaction_id)
```

The checkpoint name is derived from the runner name. In this example, the adapter-created checkpoint is named:

```
gemini_summary_gemini_interaction
```

On replay, if that checkpoint is already complete and cache/replay rules allow reuse, Kitaru serves the saved `GeminiInteractionResult`. Gemini is not called again for that completed interaction boundary.

## How a run works, step by step

When `runner.run_sync(GeminiInteractionRequest.start(...))` executes inside a Kitaru flow, this is the concrete sequence:

```
1. Kitaru opens one synthetic checkpoint.
2. The adapter builds a Gemini Interactions API request.
3. The adapter calls client.interactions.create(...).
4. Gemini runs the hosted interaction.
5. The adapter waits for a stable status: completed or requires_action.
6. It extracts output text, IDs, status, usage, environment, and step summaries.
7. It saves configured artifacts: manifest, output, usage, event log, and summary.
8. It returns GeminiInteractionResult as the checkpoint output.
```

For poll requests, step 3 is `client.interactions.get(...)` instead. That matters for background jobs: polling an existing interaction checks the job you already started; it does not create a duplicate remote job.

## Streaming

Use streaming when you want to watch a Gemini interaction while Google is still working. The replay promise stays the same:

```
streaming changes how you watch the run
streaming does not change what Kitaru saves for replay
```

A foreground model streamed run looks like this:

```
1. Your flow calls runner.run_stream_sync(request) or awaits runner.run_stream(request).
2. Kitaru opens the same one Gemini interaction checkpoint.
3. The adapter calls Gemini with interactions.create(..., stream=True).
4. Live events are published while Gemini emits stream updates.
5. Kitaru reconstructs the final interaction result from the stream.
6. Only completed or requires_action returns normally.
7. Kitaru saves and returns the final GeminiInteractionResult.
```

A background managed-agent streamed run uses a slightly different provider route:

```
1. Kitaru calls interactions.create(..., background=True) once.
2. Gemini returns one interaction id.
3. Kitaru observes that same id with interactions.get(id, stream=True) when possible.
4. If that stream drops or ends before a stable result, Kitaru polls that same id.
5. Within that same live invocation, Kitaru never starts a second provider job after the interaction id is known.
```

So if the flow later replays from a completed checkpoint, the saved final result may come back immediately. Gemini is not called again, and fresh live stream events should not be expected on that checkpoint cache hit. Think of the stream like the window in a train: useful while the train is moving, but the ticket Kitaru saves is still the final arrival record.

### Synchronous streaming

```python
request = GeminiInteractionRequest.start(
    "Explain why durable checkpoints help long-running AI workflows.",
    model="gemini-3.5-flash",
)
result = runner.run_stream_sync(request)
print(result.output_text)
print(result.metadata.get("stream"))
```

### Async streaming

```python
result = await runner.run_stream(request)
```

Use `run_stream_sync(...)` from normal synchronous flow code. If you are already inside an event loop, use `await runner.run_stream(...)`; the sync method refuses to run inside an active event loop so it does not hide an `asyncio` deadlock.

### Live event constants

The streaming live-event names are exported from `kitaru.adapters.gemini`:

```python
from kitaru.adapters.gemini import (
    GEMINI_STREAM_COMPLETED,
    GEMINI_STREAM_EVENT,
    GEMINI_STREAM_EVENT_KINDS,
    GEMINI_STREAM_FAILED,
    GEMINI_STREAM_STARTED,
    GEMINI_STREAM_TERMINAL_EVENT_KINDS,
)
```

Their values are:

| Constant                  | Event name                             | Meaning                                                                                                                                                                               |
| ------------------------- | -------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `GEMINI_STREAM_STARTED`   | `gemini_interactions.stream.started`   | Kitaru started draining a Gemini stream.                                                                                                                                              |
| `GEMINI_STREAM_EVENT`     | `gemini_interactions.stream.event`     | A safe normalized stream update, such as interaction status, step start/stop, text delta metadata, tool-argument metadata, thought metadata, media metadata, provider error, or done. |
| `GEMINI_STREAM_COMPLETED` | `gemini_interactions.stream.completed` | The adapter finalized a stable provider result and is returning it to the surrounding checkpoint. This terminal event is flushed.                                                     |
| `GEMINI_STREAM_FAILED`    | `gemini_interactions.stream.failed`    | The provider stream, finalization, artifact capture, strict event persistence, or cancellation failed. This terminal event is flushed.                                                |

`GEMINI_STREAM_EVENT_KINDS` contains all four names. `GEMINI_STREAM_TERMINAL_EVENT_KINDS` contains the completed and failed names.

### Privacy defaults

Live stream events are best-effort observability. If event publishing fails, the Gemini provider result is still allowed to finish. By default, live events avoid raw content. They can include safe metadata such as:

* adapter name, runner name, public surface, and `scope="interaction"`
* event type, event ID, interaction ID, status, and category
* step index, step ID, step type, tool name, and call ID
* whether usage was present

By default, live events do **not** include raw prompts, full assistant content, tool arguments, function results, thought-summary text, raw SDK objects, image/audio/document/video payloads, sandbox files, or secrets.

If you deliberately want short streamed model-output text deltas in live events, opt in on the capture policy:

```python
from kitaru.adapters.gemini import GeminiInteractionCapturePolicy

runner = KitaruGeminiInteractionsRunner(
    name="gemini_summary",
    capture=GeminiInteractionCapturePolicy(include_stream_text_deltas=True),
)
```

The text deltas are clipped for display. Tool arguments and thought content still stay out of the default live payloads. This setting also changes the streaming cache identity, so Kitaru does not silently mix checkpoints created with different stream-display policies.

The runnable `examples/integrations/gemini_interactions_agent/` script is more visual by design: `--stream` enables clipped text-delta display by default so you can confirm streaming in a terminal. Pass `--hide-text-deltas` there if you want the example watcher to show event labels only.

### Stable result semantics

The public return value is still `GeminiInteractionResult`. Streaming metadata is stored under `result.metadata["stream"]` and includes the event count, counts by provider event type, last event ID when reported, final status, truncation flags, accumulated step count, the reconstruction policy, which observation route was used, and whether same-id fallback polling was needed. These are inspection details, not new top-level result fields.

Kitaru only completes normally for:

* `completed`
* `requires_action`

A streamed `requires_action` result is the same handoff point as a non-streamed one: save the checkpoint, run your local code or wait for a human, then send a matching `GeminiInteractionRequest.function_result(...)` turn.

### Poll streaming

`GeminiInteractionRequest.poll(...)` is also streamable when the installed `google-genai` SDK exposes `interactions.get(..., stream=True)`:

```python
poll_request = GeminiInteractionRequest.poll(
    interaction_id="interaction_123",
    timeout_s=30,
)
result = runner.run_stream_sync(poll_request)
```

That streamed poll still checks the existing remote interaction. It does not create a duplicate Gemini job. Background streaming also depends on streamed `get(...)` when `timeout_s` is set, because Kitaru creates the background job without `stream=True` and then watches the returned id. If no timeout is set, Kitaru does not open an unbounded background stream: it creates the job, checks the same id once, and raises with a `GeminiInteractionRequest.poll(...)` continuation instruction if the status is still not stable. If the installed SDK does not support streamed `get(...)`, Kitaru raises a feature-availability error when you call the stream method; importing `kitaru.adapters.gemini` still works.

### Antigravity streaming caveats

`GeminiInteractionRequest.antigravity(...)` can be used with `run_stream_sync(...)` on the same Interactions streaming surface as other agent interactions. It is still preview-shaped and still one coarse checkpoint: Google owns the managed agent loop, sandbox, tool execution, environment reuse, and internal event schema.

The adapter's Antigravity preset now keeps `store=True` and defaults to `background=True`. With `timeout_s` set, Kitaru creates one background interaction, observes that same id with streaming when the backend supports it, and falls back to polling that same id if the stream is unavailable or incomplete. Without `timeout_s`, Kitaru creates/checks the same id once and asks you to continue with `GeminiInteractionRequest.poll(...)` if it is still running. Google's public Antigravity surfaces have not been consistent about background behavior during preview, so pass `background=False` only when a specific endpoint explicitly rejects background mode. Keep tasks non-destructive unless you intentionally want the managed agent to edit or execute things.

## Requests and continuation

### Start a new interaction

Use `GeminiInteractionRequest.start(...)` when you want a fresh Gemini turn:

```python
request = GeminiInteractionRequest.start(
    "Draft a customer-support reply for this incident.",
    model="gemini-3.5-flash",
    metadata={"ticket_type": "support"},
)
result = runner.run_sync(request)
```

### Continue an interaction

Google's Interactions API can continue server-side history with `previous_interaction_id`. When you continue an interaction, re-specify the model or agent and any tools, system instruction, generation config, or response format that should apply to the new turn.

```python
follow_up = GeminiInteractionRequest.resume(
    "Now turn that reply into a five-item checklist.",
    previous_interaction_id=result.interaction_id,
    model="gemini-3.5-flash",
)
follow_up_result = runner.run_sync(follow_up)
```

Set `store=True` when you need continuation. If `store=False`, Google should not be expected to keep enough server-side state for a later `previous_interaction_id` turn.

Continuation and Kitaru replay are related but different:

| Thing                     | What it means                                                                                |
| ------------------------- | -------------------------------------------------------------------------------------------- |
| Gemini `interaction_id`   | Google's handle for continuing or polling an interaction.                                    |
| Kitaru checkpoint         | Kitaru's durable workflow boundary for skipping completed workflow work.                     |
| Gemini hosted environment | Google's runtime/sandbox state. Kitaru records IDs and summaries, not a filesystem snapshot. |
| Kitaru artifact           | Data Kitaru saved around the boundary for audit/debugging.                                   |

## Antigravity managed-agent runs

`GeminiInteractionRequest.antigravity(...)` is the convenience path for the Google Antigravity managed-agent preview. It uses the adapter's centralized Antigravity agent ID, defaults `environment="remote"`, forces `store=True`, defaults `background=True`, and adds non-sensitive preview metadata.

```python
request = GeminiInteractionRequest.antigravity(
    "Inspect this repository and summarize the main test strategy. Do not edit files.",
    timeout_s=300,
)
result = runner.run_sync(request)
```

This is still one coarse interaction checkpoint. Kitaru records the stable Antigravity response and capture envelope. Google still owns the hosted agent loop, sandbox, web/code/tool execution, and environment reuse.

The runnable adapter example includes Antigravity support so you can exercise that environment path explicitly:

```bash
uv run python examples/integrations/gemini_interactions_agent/gemini_interactions_adapter.py --mode antigravity
```

Use Antigravity mode intentionally. It may be slower, costlier, and more preview-shaped than a simple model interaction. Set `timeout_s` when you want Kitaru to wait for a background Antigravity job to finish or reach `requires_action`. Without `timeout_s`, Kitaru creates/checks the background job without waiting forever; if the status is still in progress, it raises with a `GeminiInteractionRequest.poll(...)` continuation instruction for the same interaction id. If a preview endpoint explicitly rejects background mode, pass `background=False` for that endpoint only.

## `requires_action` and function results

A Gemini interaction can return `status="requires_action"`. That is not a final answer; it is a durable handoff point.

A typical flow looks like this:

```
1. Gemini asks for a function result or approval.
2. Kitaru saves the requires_action response as a completed checkpoint boundary.
3. Your flow performs the action, or pauses with kitaru.wait() for a human.
4. Your flow sends a follow-up function_result interaction.
```

The result's `steps` list summarizes the interaction steps so your flow can find a function call ID or action cue without putting raw prompt/tool payloads into top-level metadata.

```python
if result.status == "requires_action":
    call = next(step for step in result.steps if step.call_id)
    answer = GeminiInteractionRequest.function_result(
        previous_interaction_id=result.interaction_id,
        function_call_id=call.call_id,
        function_name=call.tool_name,
        function_result=[
            {"type": "text", "text": '{"approved": true}'},
        ],
        model="gemini-3.5-flash",
    )
    result = runner.run_sync(answer)
```

Keep human-in-the-loop waits at flow scope. For example, let the flow inspect the `requires_action` result, call `kitaru.wait()` if a person must decide, then send the later `function_result` interaction after the wait resumes. That keeps the pause visible to Kitaru instead of hiding it inside a provider-owned turn.

### Running caller-owned custom functions in Kitaru's sandbox

Kitaru also provides a helper for the narrow case where Gemini asks for a custom function and your application owns that function body. The helper does not run automatically. Your code must register the function names it is willing to run.

A concrete safe example is a function named `sandbox_python_version`:

```
Gemini interaction checkpoint
  -> Gemini returns status="requires_action"
  -> result.steps contains call_id="call_123" and tool_name="sandbox_python_version"

Kitaru-owned sandbox checkpoint
  -> your registry says sandbox_python_version means: run "python --version"
  -> Kitaru runs that command through kitaru.run_sandbox_command(...)
  -> non-zero exits are kept as result data instead of becoming Gemini-helper errors

Gemini interaction checkpoint
  -> your flow sends GeminiInteractionRequest.function_result(...)
  -> Gemini continues from the same interaction id
```

In code, the middle checkpoint looks like this. The checkpoint matters: `execute_gemini_sandbox_function_call(...)` runs a sandbox command, so Kitaru rejects direct flow-body use by default to avoid replay running the command a second time.

```python
import kitaru
from kitaru.adapters.gemini import (
    GeminiInteractionResult,
    GeminiSandboxFunctionExecution,
    GeminiSandboxFunctionSpec,
    execute_gemini_sandbox_function_call,
)

@kitaru.checkpoint
def run_requested_function(
    result: GeminiInteractionResult,
) -> GeminiSandboxFunctionExecution:
    return execute_gemini_sandbox_function_call(
        result,
        {
            "sandbox_python_version": GeminiSandboxFunctionSpec(
                function_name="sandbox_python_version",
                command="python --version",
            )
        },
    )
```

The returned `GeminiSandboxFunctionExecution` contains the matched call metadata, the local `SandboxCommandResult`, the content-block payload sent to Gemini, and the ready-to-send `GeminiInteractionRequest.function_result(...)`.

V1 intentionally does **not** parse model-supplied function arguments and only builds model-targeted continuations that use stored Gemini interaction history. It rejects provider-agent/Antigravity results, `agent=...` overrides, and `store=False` stateless continuations. Stateless function-result calls must send the full previous interaction history plus the function-result step, while this helper only has the stable `requires_action` summary. The helper matches by safe step metadata that Kitaru already exposes: step index, step type, call ID, and function name. Registered commands can be static, or they can be built from that safe call metadata. If you need model arguments later, that needs a separate opt-in design because those arguments live in raw provider payloads and may contain private user data.

The default function-result payload is a Gemini text content block containing a compact JSON object. That JSON object includes:

* `ok`
* `exit_code`
* `stdout` and `stderr`, clipped to a compact model-facing size by default
* stdout/stderr truncation flags, including whether the helper clipped the model-facing payload
* cleanup policy and cleanup success
* redaction flags and warnings when Kitaru changed model-facing stdout or stderr before sending it back to Gemini

It does not include environment variables, full stack IDs, sandbox IDs, session IDs, the command string, or a raw cleanup error string in the payload sent back to Gemini. The local `SandboxCommandResult` remains available to your own code for debugging.

Kitaru redacts the default model-facing stdout and stderr before building that payload. It removes exact values provided through `GeminiSandboxFunctionSpec.env`, common secret-like key/value patterns, and bearer or authorization tokens. This is best-effort protection, not a guarantee: a command can still print sensitive text in a shape no generic redactor recognizes. Design sandbox commands so they do not print secrets. If you provide a custom `result_payload_builder`, that builder owns its own redaction before returning anything that will be sent to Gemini.

This helper does not move Google-owned execution into Kitaru. Antigravity internals, built-in Gemini code execution, hosted MCP, web execution, and Google-owned tools still run wherever Google runs them. Kitaru can only checkpoint the custom function body that your application explicitly executes.

The runnable example includes a dry-run-first showcase:

```bash
uv run python examples/integrations/gemini_interactions_agent/gemini_interactions_adapter.py \
  --dry-run \
  --mode sandbox-function
```

## Polling background interactions

For background interactions, avoid accidentally starting duplicate remote jobs. If an interaction is already created and you need to check it later, use `GeminiInteractionRequest.poll(interaction_id=...)`. Poll requests call `client.interactions.get(...)`; they do not call `client.interactions.create(...)`.

```python
poll_request = GeminiInteractionRequest.poll(
    interaction_id="interaction_123",
    timeout_s=30,
)
result = runner.run_sync(poll_request)
```

If a background interaction has not reached `completed` or `requires_action`, the adapter raises `KitaruRuntimeError` instead of saving an unfinished response as a successful checkpoint. Continue polling the same `interaction_id`; do not retry by creating a fresh background request unless you intentionally want another remote job.

You can also poll with `runner.run_stream_sync(...)` or `await runner.run_stream(...)` when your installed Google SDK supports streamed `interactions.get(..., stream=True)`. That lets you watch the existing background interaction progress without creating a second remote job.

## Cache identity

The cache key uses the request, runner name, Kitaru strategy, installed `google-genai` SDK version, and optional `cache_identity`. Streaming calls also include a stream-specific cache surface plus the stream-display/reconstruction policy. Kitaru does not inspect live Google client internals such as project, region, or credentials.

That means this situation is risky without `cache_identity`:

```
same runner name + same request
  ├─ client A points at project/dev
  └─ client B points at project/prod
```

From Kitaru's view, those can look identical. If they should not share replay or cache behavior, give them different identities:

```python
dev_runner = KitaruGeminiInteractionsRunner(
    name="gemini_summary",
    cache_identity="dev-project/us-central1",
)
prod_runner = KitaruGeminiInteractionsRunner(
    name="gemini_summary",
    cache_identity="prod-project/us-central1",
)
```

Use `cache_identity` whenever project, region, credential alias, endpoint, or other client configuration changes the meaning of the same logical request. It must be a stable, non-secret string such as `"project/region"`; do not pass a live client object or anything whose `repr()` can change between processes.

## Result shape

`KitaruGeminiInteractionsRunner.run_sync(...)` returns a `GeminiInteractionResult` with fields such as:

* `status`
* `interaction_id`
* `previous_interaction_id`
* `output_text` when Gemini exposes text
* `model` or `agent`
* `environment_id` when Google reports one
* `steps`, a list of `GeminiInteractionStepSummary` records derived primarily from `interaction.steps`
* `usage` when reported by the SDK
* `estimated_cost_usd` when Kitaru can estimate one automatically or a configured calculator returns one
* `poll_count`, `duration_ms`, `sdk_version`, and non-sensitive `metadata`
* artifact names for captured request manifest, output, usage, event log, and run summary
* artifact names for raw input, raw interaction, and raw steps only when those captures are explicitly enabled
* `warnings` for best-effort capture or SDK-shape compatibility issues
* stream inspection metadata under `metadata["stream"]` when you used `run_stream(...)` or `run_stream_sync(...)`

The adapter treats `interaction.steps` as the primary response shape. If an older SDK exposes `outputs` instead, Kitaru can summarize those for compatibility and adds a warning. If the SDK omits `output_text`, Kitaru only derives fallback output from a clearly identified final assistant/model step. Prompt, tool, sandbox, or ambiguous timeline text is not merged into `output_text`.

Failed Gemini interactions raise an exception instead of returning a successful `GeminiInteractionResult`. If the failure happens inside a Kitaru checkpoint, the adapter records best-effort failure metadata before the exception propagates.

## Capture policy

By default, Kitaru saves the boundary data that is useful for replay inspection and audits:

* redacted request manifest
* output text
* usage when reported
* event log
* run summary
* safe step summaries on the returned result. Their `text_preview` field is disabled for prompt, tool, sandbox, and ambiguous timeline content by default.

It does **not** save raw prompts, raw interaction payloads, or raw step payloads unless you opt in. It also does not save raw stream transcripts; streaming live events are not saved as a durable transcript by this adapter version.

```python
from kitaru.adapters.gemini import GeminiInteractionCapturePolicy

runner = KitaruGeminiInteractionsRunner(
    name="gemini_summary",
    capture=GeminiInteractionCapturePolicy(
        # Raw prompt/provider payload capture is opt-in.
        save_input=False,
        save_raw_interaction=False,
        save_steps=False,
        # These remain enabled by default.
        save_output=True,
        save_usage=True,
        redact_request_manifest=True,
        # Live stream text deltas are hidden by default.
        include_stream_text_deltas=False,
    ),
)
```

Turn on raw capture only when you need it for debugging or audit review:

```python
runner = KitaruGeminiInteractionsRunner(
    name="debug_gemini_run",
    capture=GeminiInteractionCapturePolicy(
        save_input=True,
        save_raw_interaction=True,
        save_steps=True,
    ),
)
```

Treat raw provider payloads as conversation data. They may contain prompts, retrieved snippets, tool arguments, model output, or other sensitive material. The redacted request manifest removes common secret-like keys such as API keys, authorization headers, tokens, credentials, cookies, and passwords, but raw payload artifacts are not a secret store.

Capture failures are non-fatal by default because Google may already have succeeded by the time Kitaru tries to save artifacts. Retrying after a strict capture failure can duplicate provider-side work, so only enable `fail_on_artifact_capture_error=True` or `fail_on_event_persistence_error=True` when you understand that trade-off.

## Checkpoint strategy

The Gemini Interactions adapter currently supports one public strategy:

```python
KitaruGeminiInteractionsRunner(
    name="gemini_task",
    checkpoint_strategy="interaction",
)
```

`"interaction"` is also the default. It means one stable Gemini interaction response becomes one Kitaru checkpoint.

Strategies such as per-step, per-tool, web-call, code-execution, hosted-MCP, or Antigravity-internal checkpointing are not exposed because Kitaru does not own those call bodies. Google runs them inside its hosted interaction runtime. The adapter would be lying if it claimed it could replay those internals independently.

Caller-owned custom functions are supported through the explicit sandbox helper shown above: Gemini reaches `requires_action`, Kitaru runs a registered sandbox command, and a later interaction sends the matching function result. That only covers the function body Kitaru actually runs. It still does not make Google-owned managed-agent internals replayable.

## Recommended durability pattern

Put the Gemini runner call directly in the flow body so the adapter can create its own checkpoint around the interaction. Put side effects that must be durable in separate Kitaru checkpoints after Gemini returns.

```python
from pathlib import Path

import kitaru

@kitaru.checkpoint
def write_report(text: str, path: str) -> str:
    Path(path).write_text(text)
    return path

@kitaru.flow
def report_flow(topic: str) -> str:
    request = GeminiInteractionRequest.start(
        f"Write a short report about {topic}.",
        model="gemini-3.5-flash",
    )
    gemini_result = runner.run_sync(request)
    return write_report(gemini_result.output_text or "", "report.md")
```

That gives you a concrete sequence:

```
flow body asks Gemini
  -> adapter-created Gemini interaction checkpoint completes
  -> Kitaru-owned write_report checkpoint writes a durable file
```

If a later checkpoint fails, Kitaru can reuse the completed Gemini interaction result instead of calling Google again.

## Constraints and gotchas

* **Stable statuses only.** The adapter only saves successful checkpoints for `completed` and `requires_action`. Background work that is still running should be polled again by interaction ID.
* **Coarse durability.** One Gemini interaction response is the replay unit. Google-owned internal steps, hosted tools, hosted MCP work, web/code execution, and Antigravity sandbox mutations are not separate Kitaru checkpoints.
* **`requires_action` is a handoff point.** Use it to move work back into your flow: run local code, call `kitaru.wait()` for a human, then send a matching `function_result` interaction.
* **Use `cache_identity` for cross-client disambiguation.** If the same runner name and request can point at different projects, regions, credential aliases, endpoints, or client setups, set an explicit identity.
* **Raw provider capture is off by default.** Safe summaries and redacted manifests are captured by default; raw prompts/interactions/steps require an explicit opt-in. Streaming text deltas are also hidden by default and must be enabled with `include_stream_text_deltas=True`.
* **Streaming is observability, not replay state.** Stream events help you watch a live provider call. On a checkpoint cache hit, Kitaru may return the saved final result without replaying the live stream.
* **Antigravity environment support is adapter-example level.** The adapter gives you a preset for the Google-managed Antigravity environment path, but Google still owns the remote environment lifecycle and behavior.

## Runnable example

Run the educational integration example:

```bash
uv sync --extra local --extra gemini
uv run kitaru init
export GEMINI_API_KEY='<your-gemini-api-key>'
uv run python examples/integrations/gemini_interactions_agent/gemini_interactions_adapter.py
```

To inspect the example without making a Gemini API call:

```bash
uv run python examples/integrations/gemini_interactions_agent/gemini_interactions_adapter.py --help
uv run python examples/integrations/gemini_interactions_agent/gemini_interactions_adapter.py --dry-run --stream
uv run python examples/integrations/gemini_interactions_agent/gemini_interactions_adapter.py --dry-run --mode sandbox-function
```

The example supports:

* `--help` for smoke tests
* `--dry-run` for a no-network preview
* `--stream` to use `run_stream_sync(...)` for real calls and preview stream metadata in dry runs
* `--mode model` using `gemini-3.5-flash`
* `--mode sandbox-function` for caller-owned custom functions that run through Kitaru's active sandbox
* `--mode antigravity` as an explicit slower/costlier managed-agent demo

## Troubleshooting

### “Why did replay not resume inside Antigravity's internal work?”

Because Kitaru's boundary is the stable Gemini interaction response. Antigravity internal planning, sandbox files, hosted tools, and environment reuse happen inside Google's runtime. Kitaru can reuse the saved result of a completed interaction; it cannot restore an arbitrary midpoint inside Google's hosted agent loop.

### “Why did a background interaction raise instead of saving a checkpoint?”

It had not reached `completed` or `requires_action` yet. Poll the same `interaction_id` again with `GeminiInteractionRequest.poll(...)`. Do not create a fresh background interaction unless you deliberately want another Google job.

### “Why do I need `cache_identity`?”

Kitaru can see the request and runner name, but it does not inspect live Google client internals. If two clients use the same logical request but point at different projects, regions, credentials, or endpoints, `cache_identity` tells Kitaru those are different replay/cache worlds.

### “Where are the raw Gemini payloads?”

They are off by default. Enable `save_input=True`, `save_raw_interaction=True`, or `save_steps=True` in `GeminiInteractionCapturePolicy` when you deliberately want raw provider artifacts.

### “Can I checkpoint every Gemini internal step?”

Not in this adapter version. Kitaru only checkpoints work it can replay honestly. Gemini-hosted internal steps are observations from Kitaru's point of view, not Python call bodies Kitaru can rerun independently.

## Related guides

<table data-view="cards"><thead><tr><th></th><th></th><th data-hidden data-card-target data-type="content-ref"></th></tr></thead><tbody><tr><td><strong>Replay and overrides</strong></td><td>Re-run a flow with cached outputs for completed checkpoints</td><td><a href="/pages/ieoi9kJkRqHsTAPEUquq">/pages/ieoi9kJkRqHsTAPEUquq</a></td></tr><tr><td><strong>Wait, Input, and Resume</strong></td><td>How paused flows are resolved from Python, CLI, and MCP</td><td><a href="/pages/BUp6cWRuU8VUfknQKRto">/pages/BUp6cWRuU8VUfknQKRto</a></td></tr><tr><td><strong>Artifacts</strong></td><td>Save named outputs from Kitaru-owned checkpoints</td><td><a href="/pages/gZ9fn4QuMS1d9OUqDiuH">/pages/gZ9fn4QuMS1d9OUqDiuH</a></td></tr></tbody></table>


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://docs.zenml.io/kitaru/adapters/gemini-interactions.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.