Models & Providers

Model ID Format

All model IDs follow the pattern <provider>/<model-name>:

gemini/gemini-2.5-flash
openrouter/anthropic/claude-3-5-sonnet
kimi/moonshot-v1-8k
minimax/MiniMax-M2.5
local/llama-3

The gateway parses the prefix to route requests to the correct provider.

Providers

Gemini (Google)

Model	Prompt ($/1M)	Completion ($/1M)	Context
`gemini/gemini-2.5-pro`	$1.25	$10.00	1M
`gemini/gemini-2.5-flash`	$0.15	$0.60	1M
`gemini/gemini-2.5-flash-lite`	$0.10	$0.40	1M
`gemini/gemini-2.0-flash`	$0.10	$0.40	1M
`gemini/gemini-1.5-pro`	$1.25	$5.00	2M
`gemini/gemini-1.5-flash`	$0.075	$0.30	1M

Kimi (Moonshot AI)

Model	Prompt ($/1M)	Completion ($/1M)	Context
`kimi/kimi-k2.5`	$0.60	$3.00	262k
`kimi/moonshot-v1-8k`	$0.20	$2.00	8k
`kimi/moonshot-v1-32k`	$1.00	$3.00	32k
`kimi/moonshot-v1-128k`	$2.00	$5.00	131k

INFO

kimi-k2.5 ignores temperature, top_p, and penalty parameters.

MiniMax

Model	Prompt ($/1M)	Completion ($/1M)	Context
`minimax/MiniMax-M2.7`	$0.30	$1.20	204k
`minimax/MiniMax-M2.5`	$0.118	$0.95	196k
`minimax/MiniMax-M2`	$0.255	$1.00	196k
`minimax/MiniMax-M1`	$0.40	$1.76	1M
`minimax/MiniMax-Text-01`	$0.20	$1.10	1M

INFO

MiniMax ignores presence_penalty and frequency_penalty parameters.

OpenRouter

Provides access to 400+ models. Models are fetched dynamically from the OpenRouter API. Pricing comes from the API response.

Local

For self-hosted models (Ollama, vLLM, or any OpenAI-compatible endpoint). Free pricing ($0/$0). Context defaults to 4096 if not specified by the model metadata.

Pricing

Prices shown above are downstream costs (what the gateway pays the provider). The gateway applies a configurable markup (default 20%) on top.

Effective cost = downstream cost x (1 + markup%)

Adding a New Provider

Implement the LLMProvider interface in src/services/llm/
Register the provider in src/services/llm/index.ts
Add the API key env variable to src/config.ts

Models & Providers ​

Model ID Format ​

Providers ​

Gemini (Google) ​

Kimi (Moonshot AI) ​

MiniMax ​

OpenRouter ​

Local ​

Pricing ​

Adding a New Provider ​