Routing, Knowledge Base, guardrails, and telemetry for AI apps

One API. 561 AI models.
Up to 95% less.*

Connect to OpenClaw Developer API Pricing

Measured route savings

82.8%

Live route comparisons

Tokens routed

26.6M

Requests

3,062

Based on live route comparisons.

tokaroo SDK

import { Tokaroo } from "tokaroo";

const client = new Tokaroo({
  apiKey: "tok_..."
});

const res = await client.chat.completions.create({
  model:    "auto",   // best value — Tokaroo picks for you
  messages: [{ role: "user", content: "Hello!" }],
});

// model: "fast" — prioritize speed
// model: "max"  — maximum capability

Already using OpenAI? One line to swap.

 // Before
-const openai = new OpenAI({ apiKey: process.env.OPENAI_KEY });

 // After — one line change
+const openai = new OpenAI({
+  baseURL: "https://api.tokaroo.com/v1",
+  apiKey:  process.env.TOKAROO_KEY,
+});
 // Everything else stays the same

Works with

OC OpenClaw NC NemoClaw SDK OpenAI SDK API Any OpenAI-compatible app

How it works

Create an account

Get your key

Your first payment funds the account and creates your first active API key.

That's it

Tokaroo handles routing, cost, fallback, caching, context, guardrails, and telemetry. You just use AI.

What you get

auto - fast - max

Pick your tradeoff. auto optimizes for value, fast for speed, max for capability. Tokaroo handles everything underneath.

200+ routed models, one key

Anthropic, DeepSeek, Google, Groq, moonshot, OpenAI, and OpenRouter - chat, images, speech, and video. You request auto, fast, or max; Tokaroo handles the model routing underneath.

Semantic cache

Similar questions get answered from cache. You never pay for the same answer twice.

Auto fallback

If one provider goes down mid-session, another picks up instantly. Your app never crashes on an outage.

Knowledge Base

Give agents durable memory, source-grounded context, entities, events, and feedback loops instead of one-off prompts.

Sources

Ingest files, URLs, docs, specs, and records as retrievable source material with chunks and citations.

Mission Harness

Track tasks, steps, artifacts, approvals, actions, outcomes, and traces so agent work becomes learnable.

Action Guardrails

Check read, write, spend, destructive, and regulated actions before tools run. Require approvals when risk is high.

Docs Studio

Create documents, reports, specs, guides, and machine-readable artifacts from source material and agent output.

Usage dashboard

See exactly what you spent, per request. Balance, history, and credits in one place.

Works everywhere

OpenAI SDK compatible. Drop into OpenClaw, NemoClaw, or any OpenAI-compatible app.

Pay only for what you use.

No seats. No subscriptions. Usage-based pricing - pay only for what you use.

Get started View pricing Read the docs

One API. 561 AI models.Up to 95% less.*

How it works

What you get

Pay only for what you use.

One API. 561 AI models.
Up to 95% less.*