Documentation - v1.7.1

OmniRoute Docs

AI gateway for multi-provider LLMs. One endpoint for OpenAI, Anthropic, Gemini, GitHub Copilot, Claude Code, Cursor, and 20+ more providers.

Open Dashboard Endpoint Page GitHub open_in_new Report Issue

Quick Start

1. Install and run
Run npx omniroute or clone from GitHub and run npm start.
2. Create API key
Go to Endpoint -> Registered Keys. Generate one key per environment.
3. Connect providers
Add provider accounts via OAuth login, API key, or free-tier auto-connect.
4. Set client base URL
Point your IDE or API client to https://<host>/v1. Use provider prefix, for example gh/gpt-5.1-codex.

Features

hub

Multi-Provider Routing

Route requests to 30+ AI providers through a single OpenAI-compatible endpoint. Supports chat, responses, audio, and image APIs.

layers

Combos and Balancing

Create model combos with fallback chains and balancing strategies: round-robin, priority, random, least-used, and cost-optimized.

bar_chart

Usage and Cost Tracking

Real-time token counting, cost calculation per provider/model, and detailed usage breakdown by API key and account.

analytics

Analytics Dashboard

Visual analytics with charts for requests, tokens, errors, latency, costs, and model popularity over time.

health_and_safety

Health Monitoring

Live health checks, provider status, circuit breaker states, and automatic rate limit detection with exponential backoff.

terminal

CLI Tools

Manage IDE configurations, export/import backups, discover codex profiles, and configure settings from the dashboard.

shield

Security and Policies

API key authentication, IP filtering, prompt injection guard, domain policies, session management, and audit logging.

cloud_sync

Cloud Sync

Sync your configuration to Cloudflare Workers for remote access with encrypted credentials and automatic failover.

Supported Providers

36 providers across three connection types.

Manage Providers

Free Tier

4 providers

iFlow AIif/

Qwen Codeqw/

Gemini CLIgc/

Kiro AIkr/

OAuth

8 providers

Claude Codecc/

Antigravityag/

OpenAI Codexcx/

GitHub Copilotgh/

Cursor IDEcu/

Kimi Codingkmc/

Kilo Codekc/

Clinecl/

API Key

24 providers

OpenRouteropenrouter/

GLM Codingglm/

Kimikimi/

Minimax Codingminimax/

Minimax (China)minimax-cn/

OpenAIopenai/

Anthropicanthropic/

Geminigemini/

DeepSeekds/

Groqgroq/

xAI (Grok)xai/

Mistralmistral/

Perplexitypplx/

Together AItogether/

Fireworks AIfireworks/

Cerebrascerebras/

Coherecohere/

NVIDIA NIMnvidia/

Nebius AInebius/

SiliconFlowsiliconflow/

Hyperbolichyp/

Deepgramdg/

AssemblyAIaai/

NanoBanananb/

Common Use Cases

Single endpoint for many providers

Point clients to one base URL and route by model prefix (for example: gh/, cc/, kr/, openai/).

Fallback and model switching with combos

Create combo models in Dashboard and keep client config stable while providers rotate internally.

Usage, cost and debug visibility

Track tokens and cost by provider, account, and API key in Usage and Analytics tabs.

Client Compatibility

Cherry Studio

Base URL: https://<host>/v1
Chat endpoint: /chat/completions
Model recommendation: explicit prefix (gh/..., cc/...)

Codex / GitHub Copilot Models

Use model IDs with gh/.
Codex-family models auto-route to /responses.
Non-Codex models continue on /chat/completions.

Cursor IDE

Use cu/ prefix for Cursor models.
OAuth connection - login from the Providers page.
Supports both chat and responses endpoints.

Claude Code / Antigravity

Use cc/ (Claude) or ag/ (Antigravity) prefix.
OAuth connection with automatic token refresh.
Full streaming support for all models.

API Reference

Method	Path	Notes
`POST`	/v1/chat/completions	OpenAI-compatible chat endpoint (default).
`POST`	/v1/responses	Responses API endpoint (Codex, o-series).
`GET`	/v1/models	Model catalog for all connected providers.
`POST`	/v1/audio/transcriptions	Audio transcription (Deepgram, AssemblyAI).
`POST`	/v1/images/generations	Image generation (NanoBanana).
`POST`	/chat/completions	Rewrite helper for clients without /v1.
`POST`	/responses	Rewrite helper for Responses without /v1.
`GET`	/models	Rewrite helper for model discovery without /v1.

Model Prefixes

Use the provider prefix before the model name to route to a specific provider. Example: gh/gpt-5.1-codex routes to GitHub Copilot.

Prefix	Provider	Type
`if/`	iFlow AI	Free Tier
`qw/`	Qwen Code	Free Tier
`gc/`	Gemini CLI	Free Tier
`kr/`	Kiro AI	Free Tier
`cc/`	Claude Code	OAuth
`ag/`	Antigravity	OAuth
`cx/`	OpenAI Codex	OAuth
`gh/`	GitHub Copilot	OAuth
`cu/`	Cursor IDE	OAuth
`kmc/`	Kimi Coding	OAuth
`kc/`	Kilo Code	OAuth
`cl/`	Cline	OAuth
`openrouter/`	OpenRouter	API Key
`glm/`	GLM Coding	API Key
`kimi/`	Kimi	API Key
`minimax/`	Minimax Coding	API Key
`minimax-cn/`	Minimax (China)	API Key
`openai/`	OpenAI	API Key
`anthropic/`	Anthropic	API Key
`gemini/`	Gemini	API Key
`ds/`	DeepSeek	API Key
`groq/`	Groq	API Key
`xai/`	xAI (Grok)	API Key
`mistral/`	Mistral	API Key
`pplx/`	Perplexity	API Key
`together/`	Together AI	API Key
`fireworks/`	Fireworks AI	API Key
`cerebras/`	Cerebras	API Key
`cohere/`	Cohere	API Key
`nvidia/`	NVIDIA NIM	API Key
`nebius/`	Nebius AI	API Key
`siliconflow/`	SiliconFlow	API Key
`hyp/`	Hyperbolic	API Key
`dg/`	Deepgram	API Key
`aai/`	AssemblyAI	API Key
`nb/`	NanoBanana	API Key

Troubleshooting

If the client fails with model routing, use explicit provider/model (for example: gh/gpt-5.1-codex).
If you receive ambiguous model errors, pick a provider prefix instead of a bare model ID.
For GitHub Codex-family models, keep model as gh/<codex-model>; router selects /responses automatically.
Use Dashboard > Providers > Test Connection before testing from IDEs or external clients.
If a provider shows circuit breaker open, wait for the cooldown or check Health page for details.
For OAuth providers, re-authenticate if tokens expire. Check the provider card status indicator.