Discover AI language models for conversations, coding, and creative writing
GPT 5.3 Codex
Coding-focused GPT-5.3 variant with optimized routing.
Features
Context
400.0K
Max Output
128.0K
Date Added
Feb 24, 2026
Pricing
Input:
$1.75/1M
Output:
$14.00/1M
Cache:
Read $0.18/1M
Est./msg:
$0.0088
Qwen3.5 122B A10B
Qwen3.5 122B A10B is Alibaba's high-end native vision-language model in the Qwen 3.5 family with strong text and multimodal performance.
Benchmarks (Artificial Analysis)
Intelligence
39.9
Coding
37.4
Speed
81.0
Features
Context
260.1K
Max Output
65.5K
Date Added
Feb 24, 2026
Pricing
Input:
$0.40/1M
Output:
$3.20/1M
Est./msg:
$0.0020
Qwen3.5 122B A10B Thinking
Qwen3.5 122B A10B with extended reasoning enabled. Alibaba's high-end native vision-language model in the Qwen 3.5 family.
Benchmarks (Artificial Analysis)
Intelligence
32.5
Coding
24.5
Math
82.3
Speed
60.1
Features
Context
260.1K
Max Output
65.5K
Date Added
Feb 24, 2026
Pricing
Input:
$0.40/1M
Output:
$3.20/1M
Est./msg:
$0.0020
Qwen3.5 27B
Qwen3.5 27B is a native vision-language dense model optimized for fast responses while balancing quality and inference speed.
Benchmarks (Artificial Analysis)
Intelligence
39.9
Coding
37.4
Speed
81.0
Features
Context
260.1K
Max Output
65.5K
Date Added
Feb 24, 2026
Pricing
Input:
$0.30/1M
Output:
$2.40/1M
Est./msg:
$0.0015
Qwen3.5 27B Thinking
Qwen3.5 27B with extended reasoning enabled. A native vision-language dense model optimized for fast responses.
Benchmarks (Artificial Analysis)
Intelligence
32.5
Coding
24.5
Math
82.3
Speed
60.1
Features
Context
260.1K
Max Output
65.5K
Date Added
Feb 24, 2026
Pricing
Input:
$0.30/1M
Output:
$2.40/1M
Est./msg:
$0.0015
Qwen3.5 35B A3B
Qwen3.5 35B A3B is a native vision-language MoE model with hybrid attention designed for efficient inference and strong general performance.
Benchmarks (Artificial Analysis)
Intelligence
39.9
Coding
37.4
Speed
81.0
Features
Context
260.1K
Max Output
65.5K
Date Added
Feb 24, 2026
Pricing
Input:
$0.25/1M
Output:
$2.00/1M
Est./msg:
$0.0013
Qwen3.5 35B A3B Thinking
Qwen3.5 35B A3B with extended reasoning enabled. A native vision-language MoE model with hybrid attention.
Benchmarks (Artificial Analysis)
Intelligence
32.5
Coding
24.5
Math
82.3
Speed
60.1
Features
Context
260.1K
Max Output
65.5K
Date Added
Feb 24, 2026
Pricing
Input:
$0.25/1M
Output:
$2.00/1M
Est./msg:
$0.0013
Qwen3.5 Flash
Qwen3.5 Flash is the fastest and cheapest native Qwen 3.5 vision-language model with a 1M-token context window.
Benchmarks (Artificial Analysis)
Intelligence
39.9
Coding
37.4
Speed
81.0
Features
Context
991.8K
Max Output
65.5K
Date Added
Feb 24, 2026
Pricing
Input:
$0.10/1M
Output:
$0.40/1M
Est./msg:
$0.0003
Qwen3.5 Flash Thinking
Qwen3.5 Flash with extended reasoning enabled. The fastest and cheapest native Qwen 3.5 vision-language model.
Benchmarks (Artificial Analysis)
Intelligence
32.5
Coding
24.5
Math
82.3
Speed
60.1
Features
Context
991.8K
Max Output
65.5K
Date Added
Feb 24, 2026
Pricing
Input:
$0.10/1M
Output:
$0.40/1M
Est./msg:
$0.0003
Gemini 3.1 Pro (Preview High)
Gemini 3.1 Pro preview high-reasoning variant.
Benchmarks (Artificial Analysis)
Intelligence
57.0
Coding
55.5
Speed
109.2
Features
Context
1.0M
Max Output
65.5K
Date Added
Feb 21, 2026
Pricing
Input:
$2.00/1M
Output:
$12.00/1M
Cache:
Read $0.20/1M
Est./msg:
$0.0080
Gemini 3.1 Pro (Preview Low)
Gemini 3.1 Pro preview low-reasoning variant.
Benchmarks (Artificial Analysis)
Intelligence
57.0
Coding
55.5
Speed
109.2
Features
Context
1.0M
Max Output
65.5K
Date Added
Feb 21, 2026
Pricing
Input:
$2.00/1M
Output:
$12.00/1M
Cache:
Read $0.20/1M
Est./msg:
$0.0080
Gemini 3.1 Pro (Preview)
Gemini 3.1 Pro preview is built for tasks where simple answers are not enough. Stronger core reasoning for complex coding, math, and long-context workflows, with multimodal support and a reported 77.1% verified score on ARC-AGI-2. NOTE: Inputs > 200k tokens are charged at 2x input and 1.5x output rates.
Benchmarks (Artificial Analysis)
Intelligence
57.0
Coding
55.5
Speed
109.2
Features
Context
1.0M
Max Output
65.5K
Date Added
Feb 19, 2026
Pricing
Input:
$2.00/1M
Output:
$12.00/1M
Cache:
Read $0.20/1M
Est./msg:
$0.0080
Claude Sonnet 4.6
Claude Sonnet 4.6 is Anthropic's most capable Sonnet yet — a full upgrade across coding, computer use, long‑context reasoning, agent planning, and design. Features a 1M token context window in beta. Approaches Opus‑level intelligence at Sonnet pricing.
Benchmarks (Artificial Analysis)
Intelligence
44.3
Coding
46.4
Speed
55.4
Features
Context
1.0M
Max Output
128.0K
Date Added
Feb 17, 2026
Pricing
Input:
$2.99/1M
Output:
$14.99/1M
Cache:
Read $0.30/1M · Write $3.74/1M (5m) / $5.98/1M (1h)
Est./msg:
$0.0105
Claude Sonnet 4.6 Thinking
Claude Sonnet 4.6 with extended thinking enabled for tougher coding, planning, and multi‑tool tasks. Ideal for long‑horizon agent workflows and complex problem solving.
Features
Context
1.0M
Max Output
128.0K
Date Added
Feb 17, 2026
Pricing
Input:
$2.99/1M
Output:
$14.99/1M
Cache:
Read $0.30/1M · Write $3.74/1M (5m) / $5.98/1M (1h)
Est./msg:
$0.0105
Qwen3.5 397B A17B
Qwen 3.5's open-source 397B MoE model (17B active params) with hybrid linear attention. Runs via open-source providers. Supports text, image, and video input with a 256K context window.
Benchmarks (Artificial Analysis)
Intelligence
39.9
Coding
37.4
Speed
81.0
Features
Context
258.0K
Max Output
65.5K
Date Added
Feb 16, 2026
Pricing
Input:
$0.60/1M
Output:
$3.60/1M
Est./msg:
$0.0024
Qwen3.5 397B A17B Thinking
Qwen 3.5's open-source 397B MoE model (17B active params) with hybrid linear attention and extended reasoning. Runs via open-source providers. Supports text, image, and video input with a 256K context window.
Benchmarks (Artificial Analysis)
Intelligence
45.0
Coding
41.3
Speed
62.9
Features
Context
258.0K
Max Output
65.5K
Date Added
Feb 16, 2026
Pricing
Input:
$0.60/1M
Output:
$3.60/1M
Est./msg:
$0.0024
Qwen3.5 Plus
Qwen 3.5 Plus is a commercial model with hybrid linear attention and sparse MoE architecture. Supports text, image, and video input with a 1M context window.
Benchmarks (Artificial Analysis)
Intelligence
39.9
Coding
37.4
Speed
81.0
Features
Context
983.6K
Max Output
65.5K
Date Added
Feb 16, 2026
Pricing
Input:
$0.40/1M
Output:
$2.40/1M
Est./msg:
$0.0016
Qwen3.5 Plus Thinking
Qwen 3.5 Plus with extended reasoning. A commercial model with hybrid linear attention and sparse MoE architecture. Supports text, image, and video input with a 1M context window.
Features
Context
983.6K
Max Output
65.5K
Date Added
Feb 16, 2026
Pricing
Input:
$0.40/1M
Output:
$2.40/1M
Est./msg:
$0.0016
Doubao Seed 2.0 Code Preview
Code-focused preview model in the Doubao Seed 2.0 family. Supports a 256k context window and up to 128k output tokens. ⚠️ Note: This model routes through ByteDance, a Chinese entity - privacy and logging guarantees may be limited.
Context
256.0K
Max Output
128.0K
Date Added
Feb 14, 2026
Pricing
Input:
$0.78/1M
Output:
$3.89/1M
Est./msg:
$0.0027
Doubao Seed 2.0 Lite
Lower-cost variant in the Doubao Seed 2.0 family for fast general usage. Supports a 256k context window and up to 32k output tokens. ⚠️ Note: This model routes through ByteDance, a Chinese entity - privacy and logging guarantees may be limited.
Context
256.0K
Max Output
32.0K
Date Added
Feb 14, 2026
Pricing
Input:
$0.15/1M
Output:
$0.87/1M
Est./msg:
$0.0006
Doubao Seed 2.0 Mini
Smallest and most affordable model in the Doubao Seed 2.0 family. Supports a 256k context window and up to 32k output tokens. ⚠️ Note: This model routes through ByteDance, a Chinese entity - privacy and logging guarantees may be limited.
Context
256.0K
Max Output
32.0K
Date Added
Feb 14, 2026
Pricing
Input:
$0.05/1M
Output:
$0.48/1M
Est./msg:
$0.0003
Doubao Seed 2.0 Pro
Highest-capability general model in the Doubao Seed 2.0 family. Supports a 256k context window and up to 128k output tokens. ⚠️ Note: This model routes through ByteDance, a Chinese entity - privacy and logging guarantees may be limited.
Context
256.0K
Max Output
128.0K
Date Added
Feb 14, 2026
Pricing
Input:
$0.78/1M
Output:
$3.88/1M
Est./msg:
$0.0027
Molmo 2 8B
Allen Institute's open-source Molmo 2 8B, a vision-language model built on Qwen3-8B with SigLIP 2 vision backbone. Supports image, video, and multi-image understanding with strong spatial grounding and object tracking. Included in the subscription.
Benchmarks (Artificial Analysis)
Coding
4.4
Speed
114.4
Features
Context
36.9K
Max Output
36.9K
Date Added
Feb 14, 2026
Pricing
Input:
$0.20/1M
Output:
$0.20/1M
Est./msg:
$0.0003
MiniMax M2.5
MiniMax M2.5 is a productivity-focused flagship model that builds on M2.1 with stronger coding and real-world office workflow performance (Word, Excel, PowerPoint), plus better tool-use planning and token efficiency.
Benchmarks (Artificial Analysis)
Intelligence
42.0
Coding
37.4
Speed
53.3
Features
Context
204.8K
Max Output
131.1K
Date Added
Feb 12, 2026
Pricing
Input:
$0.30/1M
Output:
$1.20/1M
Est./msg:
$0.0009
GLM 5
GLM-5 is Zhipu's latest flagship model with advanced reasoning and instruction following. This is the open-source hosted version and it is included in the subscription.
Benchmarks (Artificial Analysis)
Intelligence
49.6
Coding
44.2
Speed
70.3
Features
Context
200.0K
Max Output
128.0K
Date Added
Feb 11, 2026
Pricing
Input:
$0.30/1M
Output:
$2.55/1M
Est./msg:
$0.0016
View Providers
GLM 5 Thinking
GLM-5 with extended thinking capabilities for complex reasoning. This is the open-source hosted version and it is included in the subscription.
Benchmarks (Artificial Analysis)
Intelligence
49.6
Coding
44.2
Speed
70.3
Features
Context
200.0K
Max Output
128.0K
Date Added
Feb 11, 2026
Pricing
Input:
$0.30/1M
Output:
$2.55/1M
Est./msg:
$0.0016
View Providers
GLM 5 Original
GLM-5 is Zhipu's latest flagship model with advanced reasoning and instruction following. Routed directly via Z-AI (Zhipu).
Benchmarks (Artificial Analysis)
Intelligence
49.6
Coding
44.2
Speed
70.3
Features
Context
200.0K
Max Output
128.0K
Date Added
Feb 11, 2026
Pricing
Input:
$1.00/1M
Output:
$3.20/1M
Est./msg:
$0.0026
View Providers
GLM 5 Original Thinking
GLM-5 original with extended thinking capabilities for complex reasoning.
Benchmarks (Artificial Analysis)
Intelligence
49.6
Coding
44.2
Speed
70.3
Features
Context
200.0K
Max Output
128.0K
Date Added
Feb 11, 2026
Pricing
Input:
$1.00/1M
Output:
$3.20/1M
Est./msg:
$0.0026
View Providers
GLM 5 TEE
GLM-5 is Z.AI's flagship model for complex systems engineering and long-horizon agent workflows. Running inside a TEE (Trusted Execution Environment), with verifiably no logging by the provider.
Benchmarks (Artificial Analysis)
Intelligence
49.6
Coding
44.2
Speed
70.3
Context
203.0K
Max Output
65.5K
Date Added
Feb 11, 2026
Pricing
Input:
$1.20/1M
Output:
$3.50/1M
Est./msg:
$0.0029
Claude 4.6 Opus
Claude Opus 4.6 is Anthropic's newest Opus model with stronger coding and agentic performance, plus a 1M-token context window in beta.
Benchmarks (Artificial Analysis)
Intelligence
46.4
Coding
47.6
Speed
67.3
Features
Context
1.0M
Max Output
128.0K
Date Added
Feb 5, 2026
Pricing
Input:
$5.00/1M
Output:
$25.01/1M
Cache:
Read $0.50/1M · Write $6.25/1M (5m) / $10.00/1M (1h)
Est./msg:
$0.0175