Bring Your Own Key (BYOK)

Use your own provider credentials. Keys are encrypted at rest; we only display the last 4 characters.

BYOK applies to the OpenAI-compatible /api/v1/chat/completions endpoint. Web-search providers are used by the search/web endpoint and have separate usage patterns.

Show saved only

Provider credentials

OpenAI

Not configured

OpenAI Responses

Not configured

Anthropic

Not configured

Openrouter

Not configured

Chutes

Under maintenance

AWS Bedrock

Not configured

Paste JSON: {"accessKeyId":"...","secretAccessKey":"...","region":"us-east-1"}

Azure

Not configured

Paste JSON: {"endpoint":"https://<endpoint>","apiKey":"<key>","deploymentName":"<deployment>","apiVersion":"2024-10-21-preview"}

Azure Responses

Not configured

Paste JSON: {"endpoint":"https://<endpoint>","apiKey":"<key>","apiVersion":"2024-10-21-preview"}

Azure Anthropic

Not configured

Paste JSON: {"endpoint":"https://<resource>.services.ai.azure.com/anthropic/","apiKey":"<key>"}

Groq

Not configured

SambaNova

Not configured

Vercel

Not configured

Novita

Not configured

MegaNova

Not configured

SiliconFlow

Not configured

Akash

Not configured

NVIDIA

Under maintenance

Google AI Studio

Not configured

Ollama

Not configured

Z.AI (GLM)

Under maintenance

GMICloud

Not configured

Cerebras

Not configured

DeepInfra

Not configured

Hyperbolic

Disabled

Morpheus

Disabled

Fireworks

Disabled

Together

Disabled

Web Search Credentials

Tavily (Web Search)

Not configured

For web search only. Used by /api/web and web-search enhancements.

Exa (Web Search)

Not configured

For web search only. Supports neural/semantic search modes.

Kagi (Web Search)

Not configured

For web search only. Used by /api/web and web-search enhancements.

Perplexity (Web Search)

Not configured

For web search only. Used by /api/web and web-search enhancements.

Valyu (Web Search)

Not configured

For web search only. Supports web, news, and proprietary indexes.

Example: Call v1/chat/completions with BYOK

curl -X POST https://nano-gpt.com/api/v1/chat/completions \
  -H "x-api-key: YOUR_API_KEY" \
  -H "x-use-byok: true" \
  # Optional but recommended: force provider mapping when in doubt \
  -H "x-byok-provider: openai" \
  -H "content-type: application/json" \
  -d '{
    "model": "gpt-4o-mini",
    "messages": [
      {"role": "user", "content": "Say hello from BYOK"}
    ]
  }'

BYOK FAQ

What is the pricing?

We charge a 5% markup on the normal at-cost API rates. Your provider bills you directly; we only charge the markup.

What if I use a free model via BYOK?

If the model that you use is free via the original provider where you use it from we base our 5% markup on what the lowest rate would be if you used the paid version of it. If you for example use an API key that has free requests for gemini-2.5-pro, we will charge the 5% markup based on regular gemini-2.5-pro pricing.

Which endpoint supports BYOK?

BYOK is currently supported only on the OpenAI-compatible endpoint /api/v1/chat/completions.

Where are web search keys used?

Web search keys are used for web search only (e.g., /api/web and web-search enhancements). They are not chat model providers.

Do I have to set x-byok-provider?

No. It’s optional. We auto-map based on the model. Setting it is recommended if you want to force a specific provider.

How do I use Google Gemini AI Studio?

Add your Google AI Studio key above, then set the header x-byok-provider: google when calling /api/v1/chat/completions. This forces routing through Google AI Studio for Gemini models.

What usage counts are used for billing?

We prefer provider-reported usage tokens when available (streaming or final usage). If unavailable, we estimate conservatively.

How are keys stored?

Keys are encrypted at rest. We only store the last 4 characters for display and never send keys back to the client.