API

411 Total Models
342 Text Models
49 Image Models
20 Video Models

Quick Start

Get up and running in minutes

1

Create API Key

Generate your API key from the dashboard below

2

Use Our Examples

Copy code examples in Python, JavaScript, or cURL

3

Make Requests

Start using 300+ AI models instantly

API Keys

Manage your API authentication

You can create up to 5 API keys. Need more? Contact support

No API keys yet

Create your first API key to get started

Documentation

Complete API reference and guides

Full API Documentation

For comprehensive API documentation, integration guides, and best practices, visit our documentation site.

Visit docs.nano-gpt.com

API Endpoints

Available endpoints and their capabilities

Chat Completions

POST /api/v1/chat/completions

OpenAI-compatible endpoint for text generation with streaming support

Image Generation

POST /api/generate-image

Generate images with DALL-E, Midjourney, Flux, and more

Video Generation

POST /api/generate-video

Create videos with Kling, Veo, Hunyuan, and other providers

Audio & Speech

POST /api/text-to-speech

Text-to-speech and speech-to-text capabilities

Models List

GET /api/v1/models

Get available models with pricing information

TEE Attestation

GET /api/v1/tee/attestation

Secure GPU-TEE attestation for verified inference

For the full endpoint list, see our Documentation

Available Models

Browse and search our model catalog

ASI1 Mini

asi1-mini

ASI-1 Mini introduces next-level adaptive reasoning, context-aware decision-making. It features native reasoning support with four dynamic reasoning modes, intelligently selecting from Multi-Step, Complete, Optimized, and Short Reasoning, balancing depth, efficiency, and precision. Whether tackling complex, multi-layered problems or delivering concise, high-impact insights, ASI-1 Mini ensures reasoning is always tailored to the task at hand. Note: this model is rate limited at the moment.

Input: $1.70/1M
Output: $1.70/1M

Aion 1.0

aion-labs/aion-1.0

Aion Labs most powerful reasoning model with high performance across reasoning and coding.

Input: $6.80/1M
Output: $13.60/1M

Aion 1.0 mini (DeepSeek)

aion-labs/aion-1.0-mini

A distilled version of the DeepSeek-R1 model that excels in reasoning domains like mathematics, coding, and logic.

Input: $1.19/1M
Output: $2.38/1M

Amazon Nova Lite 1.0

amazon/nova-lite-v1

Amazon's new lower cost model. Can handle up to 300k input tokens, with faster output but less thorough understanding than Amazon's Nova Pro.

Input: $0.10/1M
Output: $0.41/1M

Amazon Nova Micro 1.0

amazon/nova-micro-v1

Amazon's lowest cost model. Comparable to GPT-4o-mini and Gemini 1.5 Flash, with the fastest output.

Input: $0.06/1M
Output: $0.24/1M

Amazon Nova Pro 1.0

amazon/nova-pro-v1

Amazon's new flagship model. Can handle up to 300k input tokens, with comparable performance to ChatGPT and Claude 3.5 Sonnet.

Input: $1.36/1M
Output: $5.44/1M

Amoral Gemma3 27B v2

soob3123/amoral-gemma3-27B-v2

Amoral Gemma3 27B v2 is a 27B parameter model that is a more advanced version of Gemma3 27B.

Input: $0.51/1M
Output: $0.51/1M

Anubis 70B v1

TheDrummer/Anubis-70B-v1

L3.3 finetune for roleplaying.

Input: $0.85/1M
Output: $0.85/1M

Anubis Pro 105b v1

anubis-pro-105b-v1

An upscaled version of Llama 3.3 70B with 50% more layers. Finetuned further to make use of its new layers.

Input: $1.36/1M
Output: $1.70/1M

Athene V2 Chat

Nexusflow/Athene-V2-Chat

An open-weights LLM on-par with GPT-4o across benchmarks.

Input: $0.85/1M
Output: $0.85/1M

Azure gpt-4-turbo

azure-gpt-4-turbo

Azure version of OpenAI gpt-4-turbo. ⚠️ WARNING: OpenAI may retain and use data sent to this model for training purposes.

Input: $17.00/1M
Output: $51.00/1M

Azure gpt-4o

azure-gpt-4o

Azure version of OpenAI gpt-4o. ⚠️ WARNING: OpenAI may retain and use data sent to this model for training purposes.

Input: $4.25/1M
Output: $17.00/1M

Code Examples

Get started with code samples in multiple languages

A simple example to get you started with the NanoGPT API

import requests

API_KEY = "YOUR_API_KEY"
BASE_URL = "https://nano-gpt.com/api/v1"

headers = {
    "Authorization": f"Bearer {API_KEY}",
    "Content-Type": "application/json"
}

# Simple chat completion
response = requests.post(
    f"{BASE_URL}/chat/completions",
    headers=headers,
    json={
        "model": "chatgpt-4o-latest",
        "messages": [
            {"role": "user", "content": "Hello, how are you?"}
        ]
    }
)

print(response.json())

Pricing

Simple, transparent pricing for all models

All prices are pay-as-you-go with no monthly fees. Prices shown are per 1 million tokens for text models.

For detailed pricing information and bulk discounts, please contact our sales team or visit our Discord community.

Get notified about API updates.

We will only use this to contact you updates to how the API works. You can unsubscribe at any time.