All Models

Browse 53 models across 8 providers.

OpenAI

12 models

OpenAI's next-generation flagship — significant capability jump over GPT-4.1 with 1M context

$8 in$32 out1M

OGPT-5 Mini

Affordable GPT-5 intelligence — brings GPT-5 capability to cost-sensitive workloads

$0.6 in$2.4 out512k

OGPT-4.1

Latest flagship with 1M context window and strong coding/instruction following

$2 in$8 out1M

OGPT-4.1 Mini

Affordable intelligence with 1M context — best cost/performance in the 4.1 family

$0.4 in$1.6 out1M

OGPT-4.1 Nano

Smallest and cheapest GPT-4.1 model — ideal for simple tasks needing 1M context

$0.1 in$0.4 out1M

Oo4-mini

Fast, efficient reasoning model optimized for STEM and coding tasks

$1.1 in$4.4 out200k

Oo3

Advanced reasoning model at significantly reduced price (80% cut from launch)

$0.4 in$1.6 out200k

Oo1

OpenAI's original frontier reasoning model — deep thinking for the hardest problems

$15 in$60 out200k

OGPT-4o

Multimodal model with strong vision, audio, and text capabilities

$2.5 in$10 out128k

OGPT-4o Mini

Ultra-affordable model for high-volume tasks with good quality

$0.15 in$0.6 out128k

OGPT-4 Turbo

Previous generation GPT-4 Turbo — powerful but superseded by GPT-4o in cost-efficiency

$10 in$30 out128k

OGPT-3.5 Turbo

Classic fast model — still cost-effective for simple chat tasks and legacy integrations

$0.5 in$1.5 out16k

Anthropic

10 models

AClaude Fable 5

Anthropic's most capable widely released model — adaptive thinking always on, for the most demanding reasoning and long-horizon agentic work

$10 in$50 out1M

AClaude Opus 4.8

Most capable Opus-tier model — complex reasoning, long-horizon agentic coding, and high-autonomy work with adaptive thinking

$5 in$25 out1M

AClaude Opus 4.7

Most capable Claude model — step-change improvement in agentic coding

$5 in$25 out1M

AClaude Sonnet 4.6

Optimal balance of intelligence, cost, and speed with 1M context

$3 in$15 out1M

AClaude Opus 4.6

Previous flagship — strong reasoning and extended thinking support

$5 in$25 out1M

AClaude Opus 4.5

Most capable Claude 4 model with extended thinking — top performance on complex reasoning and coding

$15 in$75 out200k

AClaude Haiku 4.5

Fastest and most cost-efficient Claude with near-frontier intelligence

$1 in$5 out200k

AClaude 3.5 Sonnet

Previous generation Sonnet — high intelligence at moderate cost, widely used in production

$3 in$15 out200k

AClaude 3.5 Haiku

Previous generation fast model — great balance of speed and intelligence at low cost

$0.8 in$4 out200k

AClaude 3 Opus

Third-generation flagship — powerful reasoning, still used for demanding legacy workloads

$15 in$75 out200k

Google

10 models

GGemini 3 Ultra

Google's most powerful model — frontier reasoning, native multimodal, 2M context window

$10 in$30 out2M

GGemini 3 Pro

Gemini 3's balanced model — strong reasoning at a fraction of Ultra cost

$3.5 in$14 out1M

GGemini 3 Flash

Fast and capable Gemini 3 — ideal for real-time applications needing 1M context

$0.5 in$2 out1M

GGemini 3 Flash-Lite

Most affordable Gemini 3 model — high-volume tasks with 1M context at near-zero cost

$0.12 in$0.48 out1M

GGemini 2.5 Pro

Most capable Gemini model with deep reasoning and multimodal support

$1.25 in$10 out1M

GGemini 2.5 Flash

Best-in-class speed and efficiency for diverse tasks

$0.3 in$2.5 out1M

GGemini 2.5 Flash-Lite

Most cost-efficient Gemini model for high-volume, latency-sensitive workloads

$0.1 in$0.4 out1M

GGemini 2.0 Flash

Previous gen workhorse — fast multimodal model with excellent price-to-performance

$0.1 in$0.4 out1M

GGemini 2.0 Flash-Lite

Ultra-cheap previous gen model — suitable for high-volume simple generation tasks

$0.075 in$0.3 out1M

GGemini 1.5 Pro

First model with 2M token context window — great for massive document analysis

$1.25 in$5 out2M

Mistral

6 models

MMagistral Medium

Mistral's reasoning model — strong for complex analytical and math tasks

$2 in$5 out128k

MMistral Large 3

Frontier-level MoE model (675B total / 41B active params) at competitive price

$0.5 in$1.5 out256k

MMistral Medium 3

State-of-the-art performance at 8x lower cost than previous generation

$0.4 in$2 out128k

MMistral Small 3.1

Efficient small model for simple, high-volume tasks

$0.1 in$0.3 out128k

MCodestral

Code-specialized model with 256k context — optimized for fill-in-the-middle

$0.3 in$0.9 out256k

MMistral Nemo

Compact multilingual model with 128k context — great budget option for EU-compliance workloads

$0.1 in$0.3 out128k

DeepSeek

3 models

DDeepSeek R2

DeepSeek's second-generation reasoning model — stronger than R1 across all benchmarks at similar cost

$0.8 in$3.2 out128k

DDeepSeek R1

Open-source reasoning model matching o1-level performance at a fraction of the cost. Ideal for math, coding, logic.

$0.55 in$2.19 out64k

DDeepSeek Chat

Cost-efficient chat model with strong multilingual performance. Best price-to-quality for Asian languages.

$0.27 in$1.1 out64k

xAI

3 models

XGrok 3

xAI's flagship model with real-time web access and strong performance on coding and analysis.

$3 in$15 out128k

XGrok 3 Fast

xAI's high-throughput variant of Grok 3 — same intelligence as flagship with faster response times

$5 in$25 out128k

XGrok 3 Mini

xAI's efficient model optimized for fast responses and cost-sensitive workloads.

$0.3 in$0.5 out128k

Qwen

4 models

QQwen3.5 Flash

Alibaba's fastest model with 256k context window at near-zero cost. Best for ultra high-volume tasks.

$0.01 in$0.05 out256k

QQwen3 235B

Alibaba's large MoE model with exceptional price — $0.06/1M for both input and output tokens.

$0.06 in$0.06 out32k

QQwen3 30B

Alibaba's mid-size model — solid performance at near-zero cost for everyday tasks

$0.1 in$0.15 out32k

QQwen3 8B

Alibaba's small efficient model — one of the cheapest options for simple classification and generation

$0.05 in$0.1 out32k