Supported by OpenClaw

AI model pricing and provider fit.

Compare supported providers, scan model capabilities, and filter the list quickly. Prices are displayed per 1 million tokens for fast planning.

OpenAI

Latest GPT-5.4 and GPT-5 mini with best intelligence for agentic and coding workflows

Recommended

GPT-5.4

1M

Best intelligence at scale for agentic, coding, and professional workflows

Input

$2.50

/ 1M tokens

Output

$15.00

/ 1M tokens

1M contextVisionWeb searchFile searchMCPSkills

gpt-5.4

Recommended

GPT-5 mini

400K

Near-frontier intelligence for cost-sensitive, low latency, high volume workloads

Input

$0.25

/ 1M tokens

Output

$2.00

/ 1M tokens

400K contextVisionWeb searchFile searchMCP

gpt-5-mini-2025-08-07

GPT-4o

128K

General purpose, vision, coding (Legacy)

Input

$2.50

/ 1M tokens

Output

$10.00

/ 1M tokens

VisionFunction callingJSON mode

gpt-4o

GPT-4o-mini

128K

Fast, cost-effective tasks (Legacy)

Input

$0.15

/ 1M tokens

Output

$0.60

/ 1M tokens

VisionFunction callingLow latency

gpt-4o-mini

Anthropic

Latest Claude 4.6 models with state-of-the-art reasoning and agent capabilities

Recommended

Claude 4.6 Opus

200K

Most intelligent model for agents and coding

Input

$5.00

/ 1M tokens

Output

$25.00

/ 1M tokens

Advanced reasoningAgent capabilitiesVision200K context

claude-opus-4-6-20251001

Recommended

Claude 4.6 Sonnet

200K

Optimal balance of intelligence, cost, and speed

Input

$3.00

/ 1M tokens

Output

$15.00

/ 1M tokens

Fast reasoningVisionFunction calling200K context

claude-sonnet-4-6-20251001

Claude 4.6 Haiku

200K

Fastest, most cost-efficient model

Input

$1.00

/ 1M tokens

Output

$5.00

/ 1M tokens

Ultra-fastCost-effectiveVision200K context

claude-haiku-4-6-20251001

Claude 3.5 Sonnet

200K

Coding, analysis, writing (Legacy)

Input

$3.00

/ 1M tokens

Output

$15.00

/ 1M tokens

Excellent codingLong contextVision

claude-3-5-sonnet-20241022

Claude 3 Opus

200K

Most complex tasks (Legacy)

Input

$15.00

/ 1M tokens

Output

$75.00

/ 1M tokens

Highest capabilityDeep analysisResearch

claude-3-opus-20240229

Claude 3 Sonnet

200K

Balanced performance

Input

$3.00

/ 1M tokens

Output

$15.00

/ 1M tokens

ReliableGood speedVersatile

claude-3-sonnet-20240229

Claude 3 Haiku

200K

Fast responses, simple tasks

Input

$0.25

/ 1M tokens

Output

$1.25

/ 1M tokens

FastestCost-effectiveLightweight

claude-3-haiku-20240307

Google

Latest Gemini 2.5 models with native multimodal and 1M+ context window

Recommended

Gemini 2.5 Pro

1M

State-of-the-art reasoning and agent capabilities

Input

$1.25

/ 1M tokens

Output

$10.00

/ 1M tokens

Advanced reasoning1M contextMultimodalAgent support

gemini-2.5-pro-preview-03-25

Recommended

Gemini 2.5 Flash

1M

Fast, cost-effective multimodal tasks

Input

$0.15

/ 1M tokens

Output

$0.60

/ 1M tokens

Fast inference1M contextMultimodalCost-effective

gemini-2.5-flash-preview-03-25

Gemini 2.0 Flash

1M

Fast, multimodal tasks

Input

$0.07

/ 1M tokens

Output

$0.30

/ 1M tokens

1M contextVisionAudioVideo

gemini-2.0-flash-exp

Gemini 1.5 Pro

2M

Long documents, analysis

Input

$1.25

/ 1M tokens

Output

$5.00

/ 1M tokens

2M contextComplex reasoningMultimodal

gemini-1.5-pro

Gemini 1.5 Flash

1M

Fast, cost-effective

Input

$0.07

/ 1M tokens

Output

$0.30

/ 1M tokens

1M contextSpeedEfficiency

gemini-1.5-flash

Moonshot AI

Chinese LLM with strong long-context capabilities

Recommended

Kimi K2.5

256K

Long context, Chinese tasks

Input

Search-augmented models with real-time information access

Recommended

Sonar Pro

200K

Research, complex queries

Input

$3.00

/ 1M tokens

Output

$15.00

/ 1M tokens

Search augmentedReal-time dataCitations

sonar-pro

Sonar

128K

General search queries

Input

Ultra-fast inference with competitive pricing

Recommended

Llama 3.3 70B

128K

Fast inference, general tasks

Input

$0.59

/ 1M tokens

Output

$0.79

/ 1M tokens

Ultra-fast128K contextOpen source

llama-3.3-70b-versatile

Llama 3.1 8B

128K

Ultra-fast simple tasks

Input

$0.05

/ 1M tokens

Output

$0.08

/ 1M tokens

FastestCheapestEfficient

llama-3.1-8b-instant

Mixtral 8x7B

32K

Balanced performance

Input

Fast inference API for open-source models

Recommended

Llama 3.3 70B

128K

Fast inference

Input

$0.90

/ 1M tokens

Output

$0.90

/ 1M tokens

FastReliable128K context

accounts/fireworks/models/llama-v3p3-70b-instruct

Mixtral 8x22B

64K

Complex reasoning

Input

Prices are shown in USD per 1 million tokens. Input tokens refer to the text you send to the model, while output tokens are the model's response. Context window indicates the maximum number of tokens the model can process in a single request. Prices may vary and are subject to change by the providers. Always check the official provider documentation for the most current pricing.

OpenAI Pricing Anthropic Pricing Google Pricing