API

497 Total Models
396 Text Models
75 Image Models
26 Video Models

Quick Start

Get up and running in minutes

1

Create API Key

Generate your API key from the dashboard below

2

Use Our Examples

Copy code examples in Python, JavaScript, or cURL

3

Make Requests

Start using 300+ AI models instantly

API Keys

Manage your API authentication

You can create up to 10 API keys. Need more? Contact support

No API keys yet

Create your first API key to get started

Documentation

Complete API reference and guides

Full API Documentation

For comprehensive API documentation, integration guides, and best practices, visit our documentation site.

Visit docs.nano-gpt.com

API Endpoints

Available endpoints and their capabilities

Chat Completions

POST /api/v1/chat/completions

OpenAI-compatible endpoint for text generation with streaming support

Image Generation

POST /api/generate-image

Generate images with DALL-E, Midjourney, Flux, and more

Video Generation

POST /api/generate-video

Create videos with Kling, Veo, Hunyuan, and other providers

Audio & Speech

POST /api/text-to-speech

Text-to-speech and speech-to-text capabilities

Models List

GET /api/v1/models

Get available models with pricing information

TEE Attestation

GET /api/v1/tee/attestation

Secure GPU-TEE attestation for verified inference

For the full endpoint list, see our Documentation

Available Models

Browse and search our model catalog

ASI1 Mini

asi1-mini
ASI-1 Mini introduces next-level adaptive reasoning, context-aware decision-making. It features native reasoning support with four dynamic reasoning modes, intelligently selecting from Multi-Step, Complete, Optimized, and Short Reasoning, balancing depth, efficiency, and precision. Whether tackling complex, multi-layered problems or delivering concise, high-impact insights, ASI-1 Mini ensures reasoning is always tailored to the task at hand. Note: this model is rate limited at the moment.
API Input: $1.00/1M
API Output: $1.00/1M

Aion 1.0

aion-labs/aion-1.0
Aion Labs most powerful reasoning model with high performance across reasoning and coding.
API Input: $4.00/1M
API Output: $7.99/1M

Aion 1.0 mini (DeepSeek)

aion-labs/aion-1.0-mini
A distilled version of the DeepSeek-R1 model that excels in reasoning domains like mathematics, coding, and logic.
API Input: $0.80/1M
API Output: $1.39/1M

Amazon Nova Lite 1.0

amazon/nova-lite-v1
Amazon's new lower cost model. Can handle up to 300k input tokens, with faster output but less thorough understanding than Amazon's Nova Pro.
API Input: $0.06/1M
API Output: $0.24/1M

Amazon Nova Micro 1.0

amazon/nova-micro-v1
Amazon's lowest cost model. Comparable to GPT-4o-mini and Gemini 1.5 Flash, with the fastest output.
API Input: $0.04/1M
API Output: $0.14/1M

Amazon Nova Pro 1.0

amazon/nova-pro-v1
Amazon's new flagship model. Can handle up to 300k input tokens, with comparable performance to ChatGPT and Claude 3.5 Sonnet.
API Input: $0.80/1M
API Output: $3.20/1M

Amoral Gemma3 27B v2

soob3123/amoral-gemma3-27B-v2
Amoral Gemma3 27B v2 is a 27B parameter model that is a more advanced version of Gemma3 27B.
API Input: $0.30/1M
API Output: $0.30/1M

Anubis 70B v1

TheDrummer/Anubis-70B-v1
L3.3 finetune for roleplaying.
API Input: $0.49/1M
API Output: $0.49/1M

Anubis Pro 105b v1

anubis-pro-105b-v1
An upscaled version of Llama 3.3 70B with 50% more layers. Finetuned further to make use of its new layers.
API Input: $0.80/1M
API Output: $1.00/1M

ArliAI RpR Ultra 235B

ArliAI-RpR-Ultra-235B
ArliAI's first big model, currently in preview/testing for roleplaying and storytelling
API Input: $0.51/1M
API Output: $0.51/1M

Athene V2 Chat

Nexusflow/Athene-V2-Chat
An open-weights LLM on-par with GPT-4o across benchmarks.
API Input: $0.49/1M
API Output: $0.49/1M

Azure gpt-4-turbo

azure-gpt-4-turbo
Azure version of OpenAI gpt-4-turbo. ⚠️ WARNING: OpenAI may retain and use data sent to this model for training purposes.
API Input: $10.00/1M
API Output: $30.00/1M

Code Examples

Get started with code samples in multiple languages

A simple example to get you started with the NanoGPT API

import requests

API_KEY = "YOUR_API_KEY"
BASE_URL = "https://nano-gpt.com/api/v1"

headers = {
    "Authorization": f"Bearer {API_KEY}",
    "Content-Type": "application/json"
}

# Simple chat completion
response = requests.post(
    f"{BASE_URL}/chat/completions",
    headers=headers,
    json={
        "model": "chatgpt-4o-latest",
        "messages": [
            {"role": "user", "content": "Hello, how are you?"}
        ]
    }
)

print(response.json())

Pricing

Simple, transparent pricing for all models

All prices are pay-as-you-go with no monthly fees. Prices shown are API prices per 1 million tokens for text models.

For detailed pricing information and bulk discounts, please contact our sales team or visit our Discord community.

Get notified about API updates.

We will only use this to contact you updates to how the API works. You can unsubscribe at any time.