Rhone AI Gateway

Every call and run belongs to a session. Sessions are the continuity handle — they own context history, defaults, and lineage. Think of a session as a persistent conversation thread that survives across requests.

Calls vs Runs

A call is one model invocation. A run is an orchestrated execution over one or more calls, with support for tool execution, interrupts, and durable resume. Both are execution primitives, not continuity modes.

Blocks

The unified message format. Instead of provider-specific message shapes, VAI uses typed blocks: input, response, system, thinking, tool_call, and tool_result. One format across every provider.

Context Modes

Stateful (default) automatically maintains conversation history on the server. Stateless means caller-supplied full-context execution — not sessionless or non-persistent. Both work with calls and runs.

Data Layer

A derived query plane over your runtime data. Explicit query sources (session_summaries, run_summaries, call_summaries, browser_cleanup_jobs) with typed field catalogs, saved views, declared facets for metadata indexing, and structured exports — without exposing raw SQL.

Assessments & Feedback

Machine-generated quality scores (assessments) and user reactions (feedback signals) attached to runs, calls, and other canonical objects. Assessment profiles configure automatic scoring — zero-config by default, with profile overrides per request.

Documentation

Quickstart

API Reference

Provider Compatibility

SDKs & Libraries

Hosted Browser Operations

Safety & Guardrails

Data Layer

Evals & Quality

Core Concepts

Sessions

Calls vs Runs

Blocks

Context Modes

Data Layer

Assessments & Feedback