Model Explorer
Browse the AI model landscape. Filter by capability, view timelines, and calculate costs.
Claude Fable 5
Anthropic's Mythos-class flagship with adaptive reasoning. Tops the Artificial Analysis index at launch.
Claude Haiku 4.5
Fastest Claude model. Great for high-volume, latency-sensitive applications.
Claude Opus 4.6
Most capable Claude model. Excels at complex reasoning, agentic tasks, and extended coding.
Claude Opus 4.7
Opus-class reasoning at a third of the old Opus price, with the context window stretched to 1M.
Claude Sonnet 4.5
Near-Opus intelligence at Sonnet pricing. Strong at coding, analysis, and general tasks.
DeepSeek R1
Open-source reasoning model. Competes with o1 at a fraction of the cost.
DeepSeek V3
Strong general-purpose model at extremely low cost. Open weights.
Gemini 2.0 Flash
Rock-bottom pricing with 1M context. Fast multimodal processing.
Gemini 2.5 Flash
Cheap thinking model with 1M context. Excellent value for reasoning tasks.
Gemini 2.5 Pro
Google's thinking model. Strong at coding, analysis, huge context window.
Gemini 3.5 Flash
Google's streaming-first Gemini 3.5 workhorse. Powers the Live API including real-time translation.
GLM-5
Z.ai's open-weights flagship. Self-hostable frontier-adjacent capability.
GPT-4.1
Million-token context, strong at coding and instruction following.
GPT-4o
Multimodal workhorse from OpenAI. Good all-rounder for text, vision, and audio.
GPT-5
Flagship OpenAI model. 400K context, strong across all tasks.
GPT-5 Mini
Budget GPT-5 class model with large context window.
GPT-5 Nano
Cheapest frontier model. On-device capable, great for high-volume simple tasks.
GPT-5.5
OpenAI's flagship successor to GPT-5. Instant-mode routing picks effort per request.
Grok 4.3
xAI's fast frontier model. Aggressive pricing for the capability tier.
Kimi k2
Long-context reasoning model. Competitive pricing and strong performance.
Kimi K2.6
Moonshot's agent-focused update to K2. Strong tool use at commodity pricing.
Llama 4 Maverick
Strong open-source contender. 1M context, multimodal, competitive with proprietary.
Llama 4 Scout
10M token context. MoE architecture, self-hostable, multimodal.
Mistral Large 3
European flagship. Excellent multilingual, 256K context, strong at code.
Mistral Small 3.1
Very cheap and fast. Punches above its weight for simple to medium tasks.
o3
Deep reasoning model. Excels at maths, science, coding, and logic problems.
o3-mini
Budget reasoning model. Good value for tasks that need structured thinking.
Qwen 2.5 72B
Top-tier Chinese open-source model. Strong at coding and multilingual tasks.
28 of 28 models shown. Capability scores are approximate and based on public benchmarks. Prices per 1M tokens (USD).