Discover AI language models for conversations, coding, and creative writing
Kimi K2.5 TEE
Kimi K2.5 is Moonshot AI's native multimodal model built on Kimi K2 with ~15T mixed visual and text tokens, delivering strong general reasoning, visual coding, and agentic tool-calling. Running inside a TEE (Trusted Execution Environment), with verifiably no logging by the provider.
Context
128.0K
Max Output
65.5K
Date Added
Jan 29, 2026
Pricing
Input:
$0.60/1M
Output:
$3.00/1M
Est./msg:
$0.0021
Kimi K2.5 Thinking TEE
Kimi K2.5 with extended thinking capabilities. Moonshot AI's native multimodal model with strong general reasoning, visual coding, and agentic tool-calling. Running inside a TEE (Trusted Execution Environment), with verifiably no logging by the provider.
Features
Context
128.0K
Max Output
65.5K
Date Added
Jan 29, 2026
Pricing
Input:
$0.60/1M
Output:
$3.00/1M
Est./msg:
$0.0021
GLM 4.7 TEE
GLM-4.7 is a next-gen GLM series text model with stronger reasoning, long-context chat, and reliable tool use. Running inside a TEE (Trusted Execution Environment), with verifiably no logging by the provider.
Benchmarks (Artificial Analysis)
Intelligence
34.1
Coding
32.0
Math
48.0
Speed
117.9
Context
131.0K
Max Output
65.5K
Date Added
Jan 29, 2026
Pricing
Input:
$0.85/1M
Output:
$3.30/1M
Est./msg:
$0.0025
Kimi K2.5 (Official API)
Official API route for Kimi K2.5. Uses the original Moonshot AI model version and routes directly to the Moonshot API.
Benchmarks (Artificial Analysis)
Intelligence
46.8
Coding
39.5
Speed
110.3
Features
Context
256.0K
Max Output
65.5K
Date Added
Jan 27, 2026
Pricing
Input:
$0.60/1M
Output:
$3.00/1M
Est./msg:
$0.0021
View Providers
Kimi K2.5 Thinking (Official API)
Official API route for Kimi K2.5 Thinking. Uses the original Moonshot AI model version and routes directly to the Moonshot API.
Benchmarks (Artificial Analysis)
Intelligence
46.8
Coding
39.5
Speed
110.3
Features
Context
256.0K
Max Output
65.5K
Date Added
Jan 27, 2026
Pricing
Input:
$0.60/1M
Output:
$3.00/1M
Est./msg:
$0.0021
View Providers
Kimi K2.5
Kimi K2.5 is Moonshot AI's native multimodal model built on Kimi K2 with ~15T mixed visual and text tokens, delivering strong general reasoning, visual coding, and agentic tool-calling. This route uses instant (non-thinking) mode for faster responses.
Benchmarks (Artificial Analysis)
Intelligence
46.8
Coding
39.5
Speed
110.3
Features
Context
256.0K
Max Output
65.5K
Date Added
Jan 26, 2026
Pricing
Input:
$0.60/1M
Output:
$3.00/1M
Est./msg:
$0.0021
View Providers
Kimi K2.5 Thinking
Kimi K2.5 with thinking mode enabled. Built on Kimi K2 with ~15T mixed visual and text tokens, it excels at general reasoning, visual coding, and agentic tool-calling. Produces reasoning traces for complex multi-step workflows.
Benchmarks (Artificial Analysis)
Intelligence
46.8
Coding
39.5
Speed
110.3
Features
Context
256.0K
Max Output
65.5K
Date Added
Jan 26, 2026
Pricing
Input:
$0.60/1M
Output:
$3.00/1M
Est./msg:
$0.0021
View Providers
Qwen3 Max 2026-01-23
Qwen3 Max is Alibaba's flagship Qwen 3 reasoning model with native tool use (web search, web extractor, code interpreter) and a 256K context window.
Benchmarks (Artificial Analysis)
Intelligence
31.3
Coding
26.4
Math
80.7
Speed
31.1
Context
256.0K
Max Output
32.8K
Date Added
Jan 26, 2026
Pricing
Input:
$1.20/1M
Output:
$6.00/1M
Est./msg:
$0.0042
Olmo 3.1 32B Think
Allen Institute's open-source Olmo 3.1 32B Think, tuned for deep reasoning and multi-step problem solving with a 65k context window.
Benchmarks (Artificial Analysis)
Intelligence
14.2
Coding
9.8
Math
77.3
Speed
68.9
Features
Context
65.5K
Max Output
8.2K
Date Added
Jan 25, 2026
Pricing
Input:
$0.15/1M
Output:
$0.50/1M
Est./msg:
$0.0004
Olmo 3.1 32B Instruct
Allen Institute's open-source Olmo 3.1 32B Instruct, optimized for responsive instruction following and multi-turn dialogue with a 65k context window.
Context
65.5K
Max Output
8.2K
Date Added
Jan 25, 2026
Pricing
Input:
$0.20/1M
Output:
$0.60/1M
Est./msg:
$0.0005
MiniMax M2-her
Dialogue-focused MiniMax model optimized for role-playing and multi-turn conversations with rich role settings.
Context
65.5K
Max Output
2.0K
Date Added
Jan 24, 2026
Pricing
Input:
$0.30/1M
Output:
$1.21/1M
Est./msg:
$0.0009
L3.3 70B Loki v2.0
CrucibleLab's Loki v2.0 is a Llama 3.3 70B roleplay/storytelling finetune tuned for immersive, expressive prose with strong character consistency and long-form narrative flow.
Context
16.4K
Max Output
16.4K
Date Added
Jan 22, 2026
Pricing
Input:
$0.49/1M
Output:
$0.49/1M
Est./msg:
$0.0007
GLM 4.7 Flash Original
GLM-4.7-Flash is a lightweight 30B model optimized for coding and agentic tasks. Balances high performance with efficiency, perfect for local deployment. Routed directly via Z-AI (Zhipu) subscription.
Benchmarks (Artificial Analysis)
Intelligence
34.1
Coding
32.0
Math
48.0
Speed
117.9
Features
Context
200.0K
Max Output
128.0K
Date Added
Jan 19, 2026
Pricing
Input:
$0.07/1M
Output:
$0.40/1M
Est./msg:
$0.0003
View Providers
GLM 4.7 Flash Original Thinking
GLM-4.7-Flash with extended thinking capabilities for complex reasoning. Lightweight 30B model optimized for coding and agentic tasks.
Features
Context
200.0K
Max Output
128.0K
Date Added
Jan 19, 2026
Pricing
Input:
$0.07/1M
Output:
$0.40/1M
Est./msg:
$0.0003
View Providers
GLM 4.7 Flash
GLM-4.7-Flash is a lightweight 30B model optimized for coding and agentic tasks. Balances high performance with efficiency.
Benchmarks (Artificial Analysis)
Intelligence
34.1
Coding
32.0
Math
48.0
Speed
117.9
Features
Context
200.0K
Max Output
128.0K
Date Added
Jan 19, 2026
Pricing
Input:
$0.07/1M
Output:
$0.40/1M
Est./msg:
$0.0003
View Providers
GLM 4.7 Flash Thinking
GLM-4.7-Flash with extended thinking capabilities for complex reasoning. Lightweight 30B model optimized for coding and agentic tasks.
Benchmarks (Artificial Analysis)
Intelligence
42.0
Coding
36.3
Math
95.0
Speed
98.6
Features
Context
200.0K
Max Output
128.0K
Date Added
Jan 19, 2026
Pricing
Input:
$0.07/1M
Output:
$0.40/1M
Est./msg:
$0.0003
View Providers
GPT 5.2 Codex
Coding-focused GPT-5.2 variant with optimized routing.
Benchmarks (Artificial Analysis)
Intelligence
49.0
Coding
43.0
Speed
144.6
Features
Context
400.0K
Max Output
128.0K
Date Added
Jan 14, 2026
Pricing
Input:
$1.75/1M
Output:
$14.00/1M
Est./msg:
$0.0088
Hermes 3 70B
Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, better roleplaying, reasoning, multi-turn conversation, and long context coherence. This 70B model is a competitive finetune of Llama-3.1-70B focused on aligning LLMs to the user with powerful steering capabilities.
Benchmarks (Artificial Analysis)
Intelligence
10.6
Speed
42.7
Context
65.5K
Max Output
8.2K
Date Added
Jan 7, 2026
Pricing
Input:
$0.41/1M
Output:
$0.41/1M
Est./msg:
$0.0006
View Providers
MiroThinker v1.5 235B
MiroThinker is the official implementation of the MiroMind Research Agent Project. It is an open-source search agent designed to advance tool-augmented reasoning and information-seeking capabilities, enabling complex real-world research workflows across diverse challenges.
Context
32.8K
Max Output
4.0K
Date Added
Jan 7, 2026
Pricing
Input:
$0.30/1M
Output:
$1.20/1M
Est./msg:
$0.0009
The Drummer Cydonia 24B v4.3
Cydonia 24B v4.3 continues TheDrummer's Cydonia series with updated tuning on Mistral Small.
Context
32.8K
Max Output
32.8K
Date Added
Dec 25, 2025
Pricing
Input:
$0.10/1M
Output:
$0.12/1M
Est./msg:
$0.0002
The Drummer Magidonia 24B v4.3
Magidonia 24B v4.3 is a new 24B Drummer finetune built for rich, creative roleplay.
Context
32.8K
Max Output
32.8K
Date Added
Dec 25, 2025
Pricing
Input:
$0.10/1M
Output:
$0.12/1M
Est./msg:
$0.0002
GLM 4.6 Derestricted v5
Derestricted GLM 4.6 tuned for open-ended creative writing and roleplay with relaxed filters.
Benchmarks (Artificial Analysis)
Intelligence
30.1
Coding
30.2
Math
44.3
Speed
29.3
Context
131.1K
Date Added
Dec 23, 2025
Pricing
Input:
$0.40/1M
Output:
$1.50/1M
Est./msg:
$0.0011
GLM 4.7 Original
GLM-4.7 is a next-gen GLM series text model with stronger reasoning, long-context chat, and reliable tool use. Routed directly via Z-AI (Zhipu).
Benchmarks (Artificial Analysis)
Intelligence
34.1
Coding
32.0
Math
48.0
Speed
117.9
Features
Context
200.0K
Max Output
65.5K
Date Added
Dec 22, 2025
Pricing
Input:
$0.15/1M
Output:
$0.80/1M
Est./msg:
$0.0006
GLM 4.7 Original Thinking
GLM-4.7 original with extended thinking capabilities for complex reasoning.
Benchmarks (Artificial Analysis)
Intelligence
42.0
Coding
36.3
Math
95.0
Speed
98.6
Features
Context
200.0K
Max Output
65.5K
Date Added
Dec 22, 2025
Pricing
Input:
$0.15/1M
Output:
$0.80/1M
Est./msg:
$0.0006
GLM 4.7
GLM-4.7 is a next-gen GLM series text model with stronger reasoning, long-context chat, and reliable tool use.
Benchmarks (Artificial Analysis)
Intelligence
34.1
Coding
32.0
Math
48.0
Speed
117.9
Features
Context
200.0K
Max Output
65.5K
Date Added
Dec 22, 2025
Pricing
Input:
$0.15/1M
Output:
$0.80/1M
Est./msg:
$0.0006
View Providers
GLM 4.7 Thinking
GLM-4.7 with extended thinking capabilities for enhanced reasoning on complex tasks.
Benchmarks (Artificial Analysis)
Intelligence
42.0
Coding
36.3
Math
95.0
Speed
98.6
Features
Context
200.0K
Max Output
65.5K
Date Added
Dec 22, 2025
Pricing
Input:
$0.15/1M
Output:
$0.80/1M
Est./msg:
$0.0006
View Providers
Manta Mini 1.0
Lightweight tier optimized for speed and cost.
Context
8.2K
Max Output
8.2K
Date Added
Dec 20, 2025
Pricing
Input:
$0.02/1M
Output:
$0.16/1M
Est./msg:
$0.0001
Manta Flash 1.0
The flagship model for balanced reasoning and context-rich dialogue. Perfect for AI roleplay, storytelling, and assistant tasks with a 16K window.
Context
16.4K
Max Output
16.4K
Date Added
Dec 20, 2025
Pricing
Input:
$0.02/1M
Output:
$0.16/1M
Est./msg:
$0.0001
Manta Pro 1.0
Tailored for deep reasoning, long-form generation, and RAG workloads. 32K token context window.
Context
32.8K
Max Output
32.8K
Date Added
Dec 20, 2025
Pricing
Input:
$0.06/1M
Output:
$0.50/1M
Est./msg:
$0.0003
MiniMax M2.1
MiniMax M2.1 builds on M2 with enhanced context understanding and improved complex tool use. 230B parameter MoE model (10B active) optimized for agentic workflows and long-horizon tasks.
Benchmarks (Artificial Analysis)
Intelligence
39.5
Coding
32.8
Math
82.7
Speed
72.1
Features
Context
200.0K
Max Output
131.1K
Date Added
Dec 19, 2025
Pricing
Input:
$0.33/1M
Output:
$1.32/1M
Est./msg:
$0.0010
View Providers