API
API keys
Generate up to 5 API keys to use NanoGPT in other applications. If you require more keys, please contact us at support@nano-gpt.com and we will help you out.
Authenticate by including your API key as a HTTP header: x-api-key: API_KEY
Name | Status | Created | API Key |
---|
Get notified about API updates.
We will only use this to contact you updates to how the API works. You can unsubscribe at any time.
API Reference
Documentation is not yet complete. The below example code can be used in Python, NanoGPTjs is a great starting point for JS users.
If you encounter issues or need further information please contact support@nano-gpt.com
Text models
POST https://nano-gpt.com/api/talk-to-gpt
Name | Model | Description |
---|---|---|
ChatGPT 4o | chatgpt-4o-latest | OpenAI's current recommended model, the well-known ChatGPT. |
Gemini Experimental | gemini-exp-1114 | Google's newest experimental model as of 14 November 2024. Tops the leaderboards on many independent benchmarks. |
OpenAI o1 | o1-preview | OpenAI's new flagship series of reasoning models for solving hard problems. Useful when tackling complex problems in science, coding, math, and similar fields |
OpenAI o1-mini | o1-mini | A fast, cost-efficient version of OpenAI's o1 reasoning model tailored to coding, math, and science use cases. |
Gemini 1.5 Pro | google/gemini-pro-1.5 | Google's next-generation model with a breakthrough 4 million context window. Comparable to GPT-4o. |
Gemini 1.5 Pro Exp | google/gemini-pro-1.5-exp | Experimental version of Google's next-generation model with a breakthrough 4 million context window. Comparable to GPT-4o. |
Grok 2 | x-ai/grok-beta | Grok-2 is xAI's frontier language model, the one used on X. Claims state-of-the-art reasoning capabilities, best for complex and multi-step use cases. |
Claude 3.5 Sonnet | claude-3-5-sonnet-20241022 | Anthropic's updated most intelligent model, offering even better results on many subjects than GPT-4o. |
Claude 3.5 Sonnet Old | claude-3-5-sonnet-20240620 | Anthropic's most intelligent model, offering even better results on many subjects than GPT-4o. |
Claude 3.5 Haiku | claude-3-5-haiku-20241022 | Anthropic's updated faster and cheaper model, offering good results on chatbots and coding. |
Claude 3 Opus | claude-3-opus-20240229 | Anthropic's flagship model, outperforming GPT-4 on most benchmarks. |
Yi Lightning | yi-lightning | Chinese-developed multilingual (English, Chinese and others) model by 01.ai that's very fast and cheap, yet scores high on independent leaderboards. |
GPT 4o mini | gpt-4o-mini | OpenAI's most cost-efficient small model. Cheaper and smarter than GPT-3.5 (the original ChatGPT), but less performant than gpt-4o |
GLM-4 Plus | glm-4-plus | GLM high-intelligence flagship model with 128K context window |
Nvidia Nemotron | nvidia/llama-3.1-nemotron-70b-instruct | Nvidia's latest Llama fine-tune optimized for instruction following. Early results hints that it might outperform models such as GPT-4o and Claude 3.5 Sonnet. |
Llama 3.1 Large | Meta-Llama-3-1-405B-Instruct-FP8 | Note: comes with a 90% discount currently, enjoy! Meta's largest Llama 3.1 405B model. Open-source, run through an open permissionless crypto network (no central provider). |
Hermes 3 Large | nousresearch/hermes-3-llama-3.1-405b | Llama 3.1 405b with the brakes taken off. An uncensored model, aligned to the user. |
Qwen Turbo | qwen-turbo | Alibaba's fastest and cheapest model. Suitable for simple tasks, fast and low cost, with a 1 million token context window. |
Qwen Max | qwen-max | Alibaba's flagship model. Suitable for complex tasks, with the strongest reasoning ability |
Qwen Plus | qwen-plus | Alibaba's balanced model. Fast, cheap, yet still very powerful. |
Qwen Long 10M | qwen-long | Alibaba's huge context window model. Takes in up to 10 million tokens, which is equivalent to dozens of books. |
Qwen 2.5 Coder 32b | qwen/qwen-2.5-coder-32b-instruct | The latest series of Code-Specific Qwen large language models. |
Yi Large | yi-large | Large version of Yi Lightning with a 32k context window, but more expensive. |
Yi Medium 200k | yi-medium-200k | Medium version of Yi Lightning with a huge 200k context window |
Yi Medium | yi-medium | Medium version of Yi. |
Yi Spark | yi-spark | Small and powerful, lightweight and fast model. Provides enhanced mathematical operation and code writing capabilities. |
Mistral Large 2411 | mistralai/mistral-large | Upgrade to Mistral's flagship model. It is fluent in English, French, Spanish, German, and Italian, with high grammatical accuracy, with a long context window. |
Lumimaid v0.2 | neversleep/llama-3.1-lumimaid-70b | Upgrade to Llama 3 Lumimaid 70B. A Llama 3.1 70B finetune trained on curated roleplay data. Extremely uncensored and suitable for NSFW. |
Inflection 3 Pi | inflection/inflection-3-pi | A chatbot with emotional intelligence. Has access to recent news, excels in scenarios like customer support and roleplay. Mirrors your conversation style. |
Inflection 3 Productivity | inflection/inflection-3-productivity | Optimized for instruction following. Good at tasks that require precise adherence to provided guidelines. Has access to recent news. |
WizardLM-2 8x22B | microsoft/wizardlm-2-8x22b | Microsoft's advanced Wizard model. The most popular role-playing model. |
SorcererLM 8x22B | raifle/sorcererlm-8x22b | Advanced roleplaying model with reasoning and emotional intelligence for engaging interactions, contextual awareness and enhanced narrative depth |
Llama 3.1 Large | accounts/fireworks/models/llama-v3p1-405b-instruct | Meta's largest and most capable Llama model. Competitive with GPT-4o and Claude 3.5 Sonnet. |
GPT 4o | gpt-4o-2024-08-06 | OpenAI's precusor to ChatGPT-4o. Great on English text and code, with significant improvements on text in non-English languages. |
Llama 3.2 Medium | meta-llama/llama-3.2-90b-vision-instruct | Medium-size (and capability) version of Meta's newest model (3.2 series). |
Llama 3.1 Medium | accounts/fireworks/models/llama-v3p1-70b-instruct | Meta's updated version of their medium Llama model. Slightly lesser performance than Llama Large, but cheaper. |
Llama 3.1 Medium | llama-3.1-70b-instruct | Meta's GPT-4 level model. Cheaper than GPT-4 and Claude 3, with similar performance according to most. |
Perplexity Online | llama-3.1-sonar-huge-128k-online | The bigger version of the Perplexity model that is able to browse the web and access up-to-date information. |
Llama 3.1 Large | meta-llama/llama-3.1-405b-instruct | Meta's largest and most capable Llama model. Competitive with GPT-4o and Claude 3.5 Sonnet. |
Gemini 1.5 Flash | google/gemini-flash-1.5-exp | Experimental version of Google's fastest multimodal model with great performance for diverse, repetitive tasks and a 2 million words context window. |
Gemini 1.5 Flash | google/gemini-flash-1.5 | Google's fastest multimodal model with great performance for diverse, repetitive tasks and a 2 million words context window. |
Perplexity Online Medium | llama-3.1-sonar-large-128k-online | A Perplexity model that is able to browse the web and access up-to-date information. |
MythoMax 13B | gryphe/mythomax-l2-13b | One of the highest performing and most popular fine-tunes of Llama 2 13B, with rich descriptions and roleplay. |
DeepSeek V2.5 | deepseek/deepseek-chat | Combination of DeepSeek V2 Chat and Coder, integrating capabilities from both. |
GLM-4 | glm-4 | High-intelligence model with 128K context window |
GLM-4 Long | glm-4-long | Extended context model supporting up to 1M tokens |
Yi Large Turbo | yi-large-turbo | Super cost-effective, excellent performance. Balanced high-precision tuning based on performance, inference speed, and cost. |
Qwen2.5 72B | qwen/qwen-2.5-72b-instruct | Great multilingual support, strong at mathematics and coding, supports roleplay and chatbots. |
EVA Qwen2.5 32B | eva-unit-01/eva-qwen-2.5-32b | Full-parameter finetune of Qwen2.5-32B on mixture of synthetic and natural data. It uses Celeste 70B 0.1 data mixture, greatly expanding it to improve versatility, creativity and flavor of the resulting model. |
EVA Qwen2.5 14B | eva-unit-01/eva-qwen-2.5-14b | Based on Qwen2.5-14b, specializing in RP and creative writing, fine-tuned with a mix of synthetic and natural data. |
Dolphin 2.6 Mixtral 8x7b | cognitivecomputations/dolphin-mixtral-8x7b | Designed for instruction following, conversational, and coding. |
GPT 4 Turbo | gpt-4-turbo-preview | Can take in the largest messages (up to 300 pages of context), and all round seen as one of the best in class models. |
GPT 4o | gpt-4o | OpenAI's most advanced model. It matches GPT-4 Turbo performance on text in English and code, with significant improvement on text in non-English languages, while also being much faster and 50% cheaper. |
GPT 3.5 Turbo | gpt-3.5-turbo | Older model. Brought ChatGPT to the mainstream, seen as dated nowadays. 90% cheaper than GPT-4-Turbo, recommended for very simple tasks. |
Gemini 1.5 Flash | gemini-1.5-flash-001 | Google's fastest multimodal model with great performance for diverse, repetitive tasks and a 1 million context window. |
Gemini 1.5 Pro | gemini-1.5-pro-001 | Google's next-generation model with a breakthrough 1 million context window. Comparable to GPT-4o. |
Playground | free-model | Use a randomly selected free model to test our service. |
Magnum v4 72B | anthracite-org/magnum-v4-72b | Upgraded model of Magnum V2 72B. From the creators of Goliath. Aimed at achieving prose quality similar to Claude Opus 3, trained on 55 million tokens of curated Roleplay data. |
Rocinante 12B | thedrummer/rocinante-12b | Designed for engaging storytelling and rich prose. Expanded vocabulary with unique and expressive word choices, enhanced creativity and captivating stories. |
Dolphin 2.9.2 Mixtral 8x22B | cognitivecomputations/dolphin-mixtral-8x22b | Successor to Dolphin 2.6 Mixtral 8x7b. Great for instruction following, conversational, and coding. |
Llama 3.1 70b Instruct | meta-llama/llama-3.1-70b-instruct | Optimized for high quality dialogue usecases. |
Llama 3.1 8b Instruct | meta-llama/llama-3.1-8b-instruct | Fast and efficient for simple purposes. |
L3 Euryale 70B | sao10k/l3-euryale-70b | A 70B parameter model from SAO10K, offering high-quality text generation. |
Mistral Tiny | mistralai/mistral-tiny | Powered by Mistral-7B-v0.2, best used for large batch processing tasks where cost is a significant factor but reasoning capabilities are not crucial. |
Mistral 7B Instruct | mistralai/mistral-7b-instruct | Optimized for speed with decent context length |
Llama 3 70b Instruct | meta-llama/llama-3-70b-instruct | Optimized for high quality dialogue usecases. |
WizardLM-2 7B | microsoft/wizardlm-2-7b | Finetune of Mistral 7B Instruct, very fast. |
Cohere: Command R | cohere/command-r | 35B parameter model that performs conversational language tasks at a higher quality, more reliably, and with a longer context than previous models. It can be used for complex workflows like code generation, retrieval augmented generation (RAG), tool use, and agents |
Nous Hermes 3 70B Instruct | nousresearch/hermes-3-llama-3.1-70b | Generalist language model including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the board. |
Mistral Nemo | mistralai/mistral-nemo | 12B parameter model with multilingual support. |
Llama 3.2 3b Instruct | meta-llama/llama-3.2-3b-instruct | Small model optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization |
Llama 3 Lumimaid 70B | neversleep/llama-3-lumimaid-70b | A Llama 3 70B finetune trained on curated roleplay data. Extremely uncensored and suitable for NSFW. |
Magnum v2 72B | anthracite-org/magnum-v2-72b | From the creators of Goliath. Aimed at achieving prose quality similar to Claude Opus 3, trained on 55 million tokens of curated Roleplay data. |
Llama 3.1 8B (decentralized) | Meta-Llama-3-1-8B-Instruct-FP8 | Meta's Llama 3.1 8B model via an open permissionless network |
GLM-4 AirX | glm-4-airx | Fastest GLM-4 variant with 8K context window |
GLM-4 Air | glm-4-air | High-performance model with 128K context window |
GLM-4 FlashX | glm-4-flashx | Fast and cost-effective model with 128K context window |
GLM-4 Flash | glm-4-flash | Extremely cheap model with 128K context window |
Note! The endpoint for the Gemini models is /api/talk-to-gemini
Image models
POST https://nano-gpt.com/api/generate-image
Name | Model | Description |
---|---|---|
Recraft V3 | recraft-v3 | The current best scoring model across all image models tested. |
Flux Pro V1.1 | flux-pro/v1.1 | Excellent image quality, prompt adherence, and output diversity. |
Flux Pro V1.1 Ultra | flux-pro/v1.1-ultra | 4K version of Flux Pro V1.1. Excellent image quality, prompt adherence, and output diversity. |
Flux Schnell | flux/schnell | Fast and high-quality image generation - the cheaper version of the Flux range of models. |
SD 3.5 Large | stable-diffusion-v35-large | Stable Diffusion's newest model. Generates a wide variety of images reflecting different styles without complex prompting. |
Ideogram V2 | ideogram-ai/ideogram-v2 | An excellent image model with state of the art inpainting, prompt comprehension and especially text rendering. |
Ideogram V2 Turbo | ideogram-ai/ideogram-v2-turbo | A fast image model with state of the art inpainting, prompt comprehension and especially text rendering. |
Flux Realism | flux-realism | Incredibly photorealistic image generation. Generate people, animals, landscapes that are hard to distinguish from reality. |
DALL-E-3 | dall-e-3 | OpenAI's most well-known image model. |
DALL-E-3 HD | dall-e-3-hd | OpenAI's most well-known image model, now in HD quality. |
SD 3.5 Large Turbo | stable-diffusion-v35-large/turbo | Turbo version of Stable Diffusion's newest model. Faster and cheaper performance while still maintaining great prompt adherence and quality. |
Playground V2.5 | playground-v25 | Playground V2.5 outperforms SDXL in many user tests. Suitable for a broad range of images. |
Proteus | proteus-v0.2 | A versatile image generation model with high-quality outputs. |
Promptchan | promptchan | The best NSFW image generation. High-quality image generation with lots of customization options. |
Uber Realistic | uberRealisticPornMerge_urpmv12_4979.safetensors | Generates realistic-looking NSFW images. |
Stable Diffusion 3 Medium | sd3_base_medium.safetensors | Excels at photorealism, typography, and prompt following. Works best in 1024x1024. |
Dreamshaper XL | dreamshaper_8_93211.safetensors | Dreamshaper generates realistic and anime/illustration-style images, and is best suited to sci-fi and fantasy scenes. |
ReV Animated | revAnimated_v122.safetensors | ReV Animated specialized in fantasy, anime and semi-realistic landscapes. |
Stable Diffusion XL | fast-sdxl | Cheap and powerful text-to-image model that generates pictures rapidly. |
Flux Pro V1 | flux-pro | Older version of Flux V1.1. Exceptional quality and prompt adherence. |