Private AI
Browse and discover the best AI language models for conversations, coding, and creative writing.
Kimi K2.7 Code
Kimi K2.7 Code is Moonshot AI's coding-focused agentic model built for long-horizon software engineering workflows. It supports native image input, tool calling, and forced thinking mode; instant/non-thinking mode is not supported.
Features
Context
262.1K
Max Output
65.5K
Date Added
Jun 12, 2026
Pricing
Input:
$0.95/1M
Output:
$4.00/1M
Est./msg:
$0.0029
Subscription
Included in subscription
· Uses 2x tokens per request
View Providers
Nvidia Nemotron 3 Ultra 550B TEE
Nvidia Nemotron 3 Ultra 550B running through Chutes TEE with encrypted transport and provider evidence support. Not included in the subscription.
Benchmarks (Artificial Analysis)
Intelligence
37.8
Coding
37.6
Speed
171.3
Features
Context
262.1K
Max Output
65.5K
Date Added
Jun 11, 2026
Performance
TPS
56
TTFT
8.7s
Pricing
Input:
$1.50/1M
Output:
$4.00/1M
Est./msg:
$0.0035
Subscription
Not included in subscription
MiMo V2.5 Pro UltraSpeed
MiMo V2.5 Pro UltraSpeed is Xiaomi's speed-focused 1T-parameter MiMo V2.5 Pro mode, built for near-instant coding assistance, real-time chat, live edits, and low-latency agent loops. Xiaomi reports up to roughly 1,000 tokens per second using its TileRT serving stack, FP4 expert quantization, and DFlash speculative decoding.
Benchmarks (Artificial Analysis)
Intelligence
42.2
Coding
45.5
Speed
44.8
Features
Context
1.0M
Max Output
131.1K
Date Added
Jun 10, 2026
Performance
TPS
354
TTFT
4.9s
Pricing
Input:
$1.50/1M
Output:
$3.00/1M
Cache:
Read $0.12/1M
Est./msg:
$0.0030
Subscription
Not included in subscription
DeepSeek V4 Flash TEE
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with a 1M-token context window. Running inside a TEE (Trusted Execution Environment), with provider attestation support.
Benchmarks (Artificial Analysis)
Intelligence
40.3
Coding
38.7
Speed
111.8
Context
1.0M
Max Output
1.0M
Date Added
Jun 9, 2026
Performance
TPS
74.4
TTFT
5.1s
Pricing
Input:
$0.20/1M
Output:
$0.40/1M
Cache:
Read $0.04/1M
Est./msg:
$0.0004
Subscription
Not included in subscription
Qwen3.6 35B A3B TEE
Qwen3.6 35B A3B is an open-weight MoE model from Alibaba's Qwen team with 35B total parameters and 3B active parameters per token. Running inside a TEE (Trusted Execution Environment), with provider attestation support.
Benchmarks (Artificial Analysis)
Intelligence
40.0
Coding
44.9
Speed
47.2
Context
262.1K
Max Output
262.1K
Date Added
Jun 9, 2026
Performance
TPS
105.1
TTFT
2.7s
Pricing
Input:
$0.20/1M
Output:
$1.27/1M
Est./msg:
$0.0008
Subscription
Not included in subscription
Linkup Research High
Linkup Research with high reasoning depth. Runs an async web research agent and returns a sourced answer for complex research tasks. Responses can take several minutes.
Max Output
32.8K
Date Added
Jun 5, 2026
Pricing
Fixed cost: $1.575
Subscription
Not included in subscription
Linkup Research Low
Linkup Research with low reasoning depth. Runs an async web research agent and returns a sourced answer for factual and multi-source questions. Responses can take several minutes.
Max Output
16.4K
Date Added
Jun 5, 2026
Pricing
Fixed cost: $0.2625
Subscription
Not included in subscription
Linkup Research Medium
Linkup Research with medium reasoning depth. Runs an async web research agent and returns a sourced answer for factual and multi-source questions. Responses can take several minutes.
Max Output
16.4K
Date Added
Jun 5, 2026
Pricing
Fixed cost: $0.525
Subscription
Not included in subscription
Linkup Research XHigh
Linkup Research with extra-high reasoning depth. Runs Linkup's most thorough async web research mode and returns a sourced answer. Responses can take several minutes.
Max Output
32.8K
Date Added
Jun 5, 2026
Pricing
Fixed cost: $2.625
Subscription
Not included in subscription
Nex N2 Pro
Nex AGI's open-source agentic reasoning model, post-trained on Qwen3.5-397B-A17B. It is built for agentic coding, software engineering, deep research, tool use, and long-horizon tasks with a 256K context window.
Features
Context
262.1K
Max Output
262.1K
Date Added
Jun 4, 2026
Performance
TPS
98.9
TTFT
9.8s
Pricing
Input:
$0.50/1M
Output:
$1.50/1M
Est./msg:
$0.0013
Subscription
Included in subscription
Nvidia Nemotron 3 Ultra 550B
Nvidia's Nemotron 3 Ultra 550B A55B model from the Nemotron 3 family. It uses a hybrid Mamba-Transformer MoE architecture. Provider-specific context limits vary, with the longest current route supporting up to 1M context.
Benchmarks (Artificial Analysis)
Intelligence
37.8
Coding
37.6
Speed
171.3
Features
Context
1.0M
Max Output
65.5K
Date Added
Jun 4, 2026
Pricing
Input:
$0.50/1M
Output:
$2.50/1M
Est./msg:
$0.0018
Subscription
Included in subscription
View Providers
Nvidia Nemotron 3 Ultra 550B Thinking
Nvidia's Nemotron 3 Ultra 550B A55B model from the Nemotron 3 family. It uses a hybrid Mamba-Transformer MoE architecture. Provider-specific context limits vary, with the longest current route supporting up to 1M context. Thinking enabled.
Benchmarks (Artificial Analysis)
Intelligence
37.8
Coding
37.6
Speed
171.3
Features
Context
1.0M
Max Output
65.5K
Date Added
Jun 4, 2026
Pricing
Input:
$0.50/1M
Output:
$2.50/1M
Est./msg:
$0.0018
Subscription
Included in subscription
View Providers
Qwen3.6 27B TEE
Qwen3.6 27B is a dense language model from Alibaba's Qwen team with text and image input, configurable thinking/reasoning behavior, and a native 262K context window. Running inside a TEE (Trusted Execution Environment), with provider attestation support.
Benchmarks (Artificial Analysis)
Intelligence
40.0
Coding
44.9
Speed
47.2
Features
Context
262.1K
Max Output
65.5K
Date Added
Jun 4, 2026
Pricing
Input:
$0.32/1M
Output:
$2.70/1M
Est./msg:
$0.0017
Subscription
Not included in subscription
MiMo V2.5 Pro Thinking
MiMo V2.5 Pro with Xiaomi thinking enabled for coding, long-context reasoning, and agentic orchestration.
Benchmarks (Artificial Analysis)
Intelligence
42.2
Coding
45.5
Speed
44.8
Features
Context
1.0M
Max Output
131.1K
Date Added
Jun 3, 2026
Pricing
Input:
$0.44/1M
Output:
$0.87/1M
Cache:
Read $0.04/1M
Est./msg:
$0.0009
Subscription
Included in subscription
View Providers
MiMo V2.5 Thinking
MiMo V2.5 with Xiaomi thinking enabled. It supports deep reasoning, tool calling, structured outputs, and web search with up to 1M context.
Benchmarks (Artificial Analysis)
Intelligence
40.1
Coding
42.1
Speed
79.5
Features
Context
1.0M
Max Output
131.1K
Date Added
Jun 3, 2026
Pricing
Input:
$0.14/1M
Output:
$0.28/1M
Cache:
Read $0.03/1M
Est./msg:
$0.0003
Subscription
Included in subscription
View Providers
Mistral Code Agent Latest
Mistral Code Agent Latest is Mistral's direct API alias for devstral-2512, an agentic coding model built for autonomous software engineering, tool use, and long-running code tasks.
Benchmarks (Artificial Analysis)
Intelligence
15.5
Coding
23.7
Math
36.7
Speed
75.2
Features
Context
262.1K
Max Output
32.8K
Date Added
Jun 2, 2026
Performance
TPS
77.9
TTFT
1.1s
Pricing
Input:
$0.40/1M
Output:
$2.00/1M
Est./msg:
$0.0014
Subscription
Included in subscription
Mistral Code Latest
Mistral Code Latest is Mistral's direct API alias for codestral-2508, a low-latency coding model for code generation, completion, fill-in-the-middle workflows, function calling, and structured output.
Features
Context
256.0K
Max Output
32.8K
Date Added
Jun 2, 2026
Pricing
Input:
$0.30/1M
Output:
$0.90/1M
Est./msg:
$0.0007
Subscription
Not included in subscription
MiniMax M3
MiniMax M3 is the non-thinking route for MiniMax's open-weights frontier model, built for coding, agent workflows, tool use, and multimodal understanding from step zero. It keeps native thinking disabled for faster direct answers. MiniMax reports 59.0% on SWE-Bench Pro and 66.0% on Terminal Bench 2.1, with Sparse Attention designed to scale context to 1M. It starts with a 512K context cap on NanoGPT for now.
Benchmarks (Artificial Analysis)
Intelligence
44.4
Coding
43.4
Speed
59.2
Features
Context
512.0K
Max Output
80.0K
Date Added
Jun 1, 2026
Pricing
Input:
$0.30/1M
Output:
$1.20/1M
Est./msg:
$0.0009
Subscription
Included in subscription
View Providers