Bring Your Own Key (BYOK)
Use your own provider credentials. Keys are encrypted at rest; we only display the last 4 characters.
Provider credentials
OpenAI
OpenAI Responses
Anthropic
Openrouter
Chutes
AWS Bedrock
Paste JSON: {"accessKeyId":"...","secretAccessKey":"...","region":"us-east-1"}
Azure
Paste JSON: {"endpoint":"https://<endpoint>","apiKey":"<key>","deploymentName":"<deployment>","apiVersion":"2024-10-21-preview"}
Azure Responses
Paste JSON: {"endpoint":"https://<endpoint>","apiKey":"<key>","apiVersion":"2024-10-21-preview"}
Azure Anthropic
Paste JSON: {"endpoint":"https://<resource>.services.ai.azure.com/anthropic/","apiKey":"<key>"}
Groq
SambaNova
Vercel
Novita
MegaNova
SiliconFlow
Akash
NVIDIA
Google AI Studio
Z.AI (GLM)
GMICloud
Cerebras
DeepInfra
Hyperbolic
Morpheus
Fireworks
Together
Web Search Credentials
Tavily (Web Search)
For web search only. Used by /api/web and web-search enhancements.
Exa (Web Search)
For web search only. Supports neural/semantic search modes.
Kagi (Web Search)
For web search only. Used by /api/web and web-search enhancements.
Perplexity (Web Search)
For web search only. Used by /api/web and web-search enhancements.
Example: Call v1/chat/completions with BYOK
Replace YOUR_API_KEY and choose one of the model providers: openai | openai-responses | anthropic | openrouter | chutes | aws | azure | azure-responses | azure-anthropic-foundry | groq | sambanova | vercel | novita | meganova | siliconflow | akash | nvidia | google | zai | gmicloud | cerebras | deepinfra. Web search keys are used only for web search endpoints and enhancements.
BYOK FAQ
What is the pricing?
We charge a 5% markup on the normal at-cost API rates. Your provider bills you directly; we only charge the markup.
What if I use a free model via BYOK?
If the model that you use is free via the original provider where you use it from we base our 5% markup on what the lowest rate would be if you used the paid version of it. If you for example use an API key that has free requests for gemini-2.5-pro, we will charge the 5% markup based on regular gemini-2.5-pro pricing.
Which endpoint supports BYOK?
BYOK is currently supported only on the OpenAI-compatible endpoint /api/v1/chat/completions.
Where are web search keys used?
Web search keys are used for web search only (e.g., /api/web and web-search enhancements). They are not chat model providers.
Do I have to set x-byok-provider?
No. It’s optional. We auto-map based on the model. Setting it is recommended if you want to force a specific provider.
How do I use Google Gemini AI Studio?
Add your Google AI Studio key above, then set the header x-byok-provider: google when calling /api/v1/chat/completions. This forces routing through Google AI Studio for Gemini models.
What usage counts are used for billing?
We prefer provider-reported usage tokens when available (streaming or final usage). If unavailable, we estimate conservatively.
How are keys stored?
Keys are encrypted at rest. We only store the last 4 characters for display and never send keys back to the client.
