Documentation

Rhone AI Gateway is a session-first AI gateway built on the open VAI protocol. Route to any provider, maintain persistent context, and stream in real-time.

Core Concepts

Sessions

Every call and run belongs to a session. Sessions are the continuity handle — they own context history, defaults, and lineage. Think of a session as a persistent conversation thread that survives across requests.

Calls vs Runs

A call is one model invocation. A run is an orchestrated execution over one or more calls, with support for tool execution, interrupts, and durable resume. Both are execution primitives, not continuity modes.

Blocks

The unified message format. Instead of provider-specific message shapes, VAI uses typed blocks: input, response, system, thinking, tool_call, and tool_result. One format across every provider.

Context Modes

Stateful (default) automatically maintains conversation history on the server. Stateless means caller-supplied full-context execution — not sessionless or non-persistent. Both work with calls and runs.

Data Layer

A derived query plane over your runtime data. Explicit query sources (session_summaries, run_summaries, call_summaries, browser_cleanup_jobs) with typed field catalogs, saved views, declared facets for metadata indexing, and structured exports — without exposing raw SQL.

Assessments & Feedback

Machine-generated quality scores (assessments) and user reactions (feedback signals) attached to runs, calls, and other canonical objects. Assessment profiles configure automatic scoring — zero-config by default, with profile overrides per request.