API
API keys
Generate up to 5 API keys to use NanoGPT in other applications. If you require more keys, please contact us at support@nano-gpt.com and we will help you out.
Authenticate by including your API key as a HTTP header: x-api-key: API_KEY
Name | Status | Created | API Key |
---|
API Reference
Documentation is not yet complete. The below example code can be used in Python, NanoGPTjs is a great starting point for JS users.
If you encounter issues or need further information please contact support@nano-gpt.com
Text models
POST https://nano-gpt.com/api/talk-to-gpt
Name | Model | Description |
---|---|---|
ChatGPT 4o | chatgpt-4o-latest | OpenAI's current recommended model, the well-known ChatGPT. |
OpenAI o1 | o1-preview | OpenAI's new flagship series of reasoning models for solving hard problems. Useful when tackling complex problems in science, coding, math, and similar fields |
OpenAI o1-mini | o1-mini | A fast, cost-efficient version of OpenAI's o1 reasoning model tailored to coding, math, and science use cases. |
Claude 3.5 Sonnet | claude-3-5-sonnet-20240620 | Anthropic's most intelligent model, offering even better results on many subjects than GPT-4o. |
Gemini 1.5 Pro Old | google/gemini-pro-1.5 | Older version of Google's next-generation model with a breakthrough 4 million context window. Comparable to GPT-4o. |
Gemini 1.5 Pro | google/gemini-pro-1.5-exp | Google's next-generation model with a breakthrough 4 million context window. Comparable to GPT-4o. |
Llama 3.1 Large | Meta-Llama-3-1-405B-Instruct-FP8 | Meta's largest Llama 3.1 405B model via an open permissionless network. Note: in testing phase, therefore temporarily 90% discounted. |
Llama 3.2 Medium | meta-llama/llama-3.2-90b-vision-instruct | Medium-size (and capability) version of Meta's newest model (3.2 series). |
Grok 2 | x-ai/grok-2 | Grok-2 is xAI's frontier language model, the one used on X. Claims state-of-the-art reasoning capabilities, best for complex and multi-step use cases. |
Llama 3 Lumimaid 70B | neversleep/llama-3-lumimaid-70b | A Llama 3 70B finetune trained on curated roleplay data. Extremely uncensored and suitable for NSFW. |
WizardLM-2 8x22B | microsoft/wizardlm-2-8x22b | Microsoft's advanced Wizard model. The most popular role-playing model. |
Llama 3.1 Large | accounts/fireworks/models/llama-v3p1-405b-instruct | Meta's largest and most capable Llama model. Competitive with GPT-4o and Claude 3.5 Sonnet. |
GPT 4o | gpt-4o-2024-08-06 | OpenAI's precusor to ChatGPT-4o. Great on English text and code, with significant improvements on text in non-English languages. |
Llama 3.1 Medium | accounts/fireworks/models/llama-v3p1-70b-instruct | Meta's updated version of their medium Llama model. Slightly lesser performance than Llama Large, but cheaper. |
Llama 3.1 Medium | llama-3.1-70b-instruct | Meta's GPT-4 level model. Cheaper than GPT-4 and Claude 3, with similar performance according to most. |
GPT 4o mini | gpt-4o-mini | OpenAI's most cost-efficient small model. Cheaper and smarter than GPT-3.5 (the original ChatGPT), but less performant than gpt-4o |
Perplexity Online | llama-3.1-sonar-huge-128k-online | The bigger version of the Perplexity model that is able to browse the web and access up-to-date information. |
Llama 3.1 Large | meta-llama/llama-3.1-405b-instruct | Meta's largest and most capable Llama model. Competitive with GPT-4o and Claude 3.5 Sonnet. |
Claude 3 Opus | claude-3-opus-20240229 | Anthropic's flagship model, outperforming GPT-4 on most benchmarks. |
Gemini 1.5 Flash | google/gemini-flash-1.5-exp | Google's fastest multimodal model with great performance for diverse, repetitive tasks and a 4 million context window. |
Gemini 1.5 Flash | google/gemini-flash-1.5 | Older version of Google's fastest multimodal model with great performance for diverse, repetitive tasks and a 4 million context window. |
Perplexity Online Medium | llama-3.1-sonar-large-128k-online | A Perplexity model that is able to browse the web and access up-to-date information. |
Hermes 3 Large | nousresearch/hermes-3-llama-3.1-405b | Llama 3.1 405b with the brakes taken off. An uncensored model, aligned to the user. |
Hermes 3 Large | nousresearch/hermes-3-llama-3.1-405b:extended | Llama 3.1 405b with the brakes taken off. An uncensored model, aligned to the user. |
MythoMax 13B | gryphe/mythomax-l2-13b | One of the highest performing and most popular fine-tunes of Llama 2 13B, with rich descriptions and roleplay. |
DeepSeek V2.5 | deepseek/deepseek-chat | Combination of DeepSeek V2 Chat and Coder, integrating capabilities from both. |
Qwen2.5 72B | qwen/qwen-2.5-72b-instruct | Great multilingual support, strong at mathematics and coding, supports roleplay and chatbots. |
EVA Qwen2.5 14B | eva-unit-01/eva-qwen-2.5-14b | Based on Qwen2.5-14b, specializing in RP and creative writing, fine-tuned with a mix of synthetic and natural data. |
Dolphin 2.6 Mixtral 8x7b | cognitivecomputations/dolphin-mixtral-8x7b | Designed for instruction following, conversational, and coding. |
GPT 4 Turbo | gpt-4-turbo-preview | Can take in the largest messages (up to 300 pages of context), and all round seen as one of the best in class models. |
GPT 4o | gpt-4o | OpenAI's most advanced model. It matches GPT-4 Turbo performance on text in English and code, with significant improvement on text in non-English languages, while also being much faster and 50% cheaper. |
GPT 3.5 Turbo | gpt-3.5-turbo | Older model. Brought ChatGPT to the mainstream, seen as dated nowadays. 90% cheaper than GPT-4-Turbo, recommended for very simple tasks. |
Gemini 1.5 Flash | gemini-1.5-flash-001 | Google's fastest multimodal model with great performance for diverse, repetitive tasks and a 1 million context window. |
Gemini 1.5 Pro | gemini-1.5-pro-001 | Google's next-generation model with a breakthrough 1 million context window. Comparable to GPT-4o. |
Playground | free-model | Use a randomly selected free model to test our service. |
Magnum v2 72B | anthracite-org/magnum-v2-72b | From the creators of Goliath. Aimed at achieving prose quality similar to Claude Opus 3, trained on 55 million tokens of curated Roleplay data. |
Rocinante 12B | thedrummer/rocinante-12b | Designed for engaging storytelling and rich prose. Expanded vocabulary with unique and expressive word choices, enhanced creativity and captivating stories. |
Dolphin 2.9.2 Mixtral 8x22B | cognitivecomputations/dolphin-mixtral-8x22b | Successor to Dolphin 2.6 Mixtral 8x7b. Great for instruction following, conversational, and coding. |
Llama 3.1 70b Instruct | meta-llama/llama-3.1-70b-instruct | Optimized for high quality dialogue usecases. |
Llama 3.1 8b Instruct | meta-llama/llama-3.1-8b-instruct | Fast and efficient for simple purposes. |
L3 Euryale 70B | sao10k/l3-euryale-70b | A 70B parameter model from SAO10K, offering high-quality text generation. |
Mistral Tiny | mistralai/mistral-tiny | Powered by Mistral-7B-v0.2, best used for large batch processing tasks where cost is a significant factor but reasoning capabilities are not crucial. |
Mistral 7B Instruct | mistralai/mistral-7b-instruct | Optimized for speed with decent context length |
Llama 3 70b Instruct | meta-llama/llama-3-70b-instruct | Optimized for high quality dialogue usecases. |
WizardLM-2 7B | microsoft/wizardlm-2-7b | Finetune of Mistral 7B Instruct, very fast. |
Cohere: Command R | cohere/command-r | 35B parameter model that performs conversational language tasks at a higher quality, more reliably, and with a longer context than previous models. It can be used for complex workflows like code generation, retrieval augmented generation (RAG), tool use, and agents |
Nous Hermes 3 70B Instruct | nousresearch/hermes-3-llama-3.1-70b | Generalist language model including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the board. |
Mistral Nemo | mistralai/mistral-nemo | 12B parameter model with multilingual support. |
Llama 3.2 3b Instruct | meta-llama/llama-3.2-3b-instruct | Small model optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization |
Llama 3.1 8B (decentralized) | Meta-Llama-3-1-8B-Instruct-FP8 | Meta's Llama 3.1 8B model via an open permissionless network |
Note! The endpoint for the Gemini models is /api/talk-to-gemini
Image models
POST https://nano-gpt.com/api/generate-image
Name | Model | Description |
---|---|---|
Flux Pro V1.1 | flux-pro/v1.1 | The current best scoring model across all image models tested. |
Flux Schnell | flux/schnell | Fast and high-quality image generation - the cheaper version of the Flux range of models. |
DALL-E-3 | dall-e-3 | OpenAI's most well-known image model. |
DALL-E-3 HD | dall-e-3-hd | OpenAI's most well-known image model, now in HD quality. |
Flux Realism | flux-realism | Incredibly photorealistic image generation. Generate people, animals, landscapes that are hard to distinguish from reality. |
Playground V2.5 | playground-v25 | Playground V2.5 outperforms SDXL in many user tests. Suitable for a broad range of images. |
Proteus | proteus-v0.2 | A versatile image generation model with high-quality outputs. |
Realistic Vision V5.1 | realisticVisionV51_v51VAE_94301.safetensors | Realistic Vision generates realistic-looking humans. It can also generate realistic objects, animals and landscapes. |
Uber Realistic | uberRealisticPornMerge_urpmv12_4979.safetensors | Generates realistic-looking NSFW images. |
Stable Diffusion 3 Medium | sd3_base_medium.safetensors | Excels at photorealism, typography, and prompt following. Works best in 1024x1024. |
Dreamshaper XL | dreamshaper_8_93211.safetensors | Dreamshaper generates realistic and anime/illustration-style images, and is best suited to sci-fi and fantasy scenes. |
ReV Animated | revAnimated_v122.safetensors | ReV Animated specialized in fantasy, anime and semi-realistic landscapes. |
Stable Diffusion XL | fast-sdxl | Cheap and powerful text-to-image model that generates pictures rapidly. |
Flux Pro V1 | flux-pro | Older version of Flux V1.1. Exceptional quality and prompt adherence. |