LLM API Cost Calculator

Compare API pricing across GPT-4o, Claude, Gemini 2.0 Flash, DeepSeek R1, and more. Calculate per-request costs and monthly estimates for your AI applications.

Usage Parameters

Input Tokens per Request

~75,000 words

Output Tokens per Request

~37,500 words

Requests per Day

3,000 requests/month

Filter Models

Provider

Tier

Compare Models

Cheapest (per request)

$0.030

Cheapest Model

Gemini 2.0 Flash

Cheapest Monthly

$90.00

Potential Savings

97%

Model	Provider	Input Cost	Output Cost	Per Request	Monthly Est.
1Gemini 2.0 Flash	Google	$0.010	$0.020	$0.030	$90.00
DeepSeek R1	DeepSeek	$0.055	$0.110	$0.165	$493.50
GPT-4o	OpenAI	$0.250	$0.500	$0.750	$2250.00
Claude Sonnet 4.5	Anthropic	$0.300	$0.750	$1.05	$3150.00

Current Pricing Reference (per 1M tokens)

Model	Provider	Input	Output	Context
GPT-4o	OpenAI	$2.50	$10.00	128K
GPT-4o Mini	OpenAI	$0.15	$0.60	128K
o1 (Reasoning)Advanced reasoning model	OpenAI	$15.00	$60.00	200K
o1-mini	OpenAI	$3.00	$12.00	128K
o3	OpenAI	$2.00	$8.00	200K
o3-mini	OpenAI	$0.55	$2.20	128K
Claude Opus 4.5Most capable Claude model	Anthropic	$5.00	$25.00	200K
Claude Sonnet 4.5Balanced performance	Anthropic	$3.00	$15.00	200K
Claude Haiku 4.5Fast and efficient	Anthropic	$1.00	$5.00	200K
Gemini 2.0 FlashBest value, 1M context	Google	$0.10	$0.40	1000K
Gemini 1.5 Pro2M context window	Google	$2.50	$10.00	2000K
DeepSeek R1Open-source reasoning model	DeepSeek	$0.55	$2.19	128K
DeepSeek V3Ultra low cost	DeepSeek	$0.27	$1.10	128K

Pricing updated January 2025. Prices may vary. Always check official provider documentation for current rates.

How to Use

Enter your average input tokens per request (prompts, context, etc.)
Enter your average output tokens per request (model responses)
Set how many API requests you make per day
Filter by provider or tier to narrow down options
Select which models you want to compare
View cost breakdown per request and monthly estimates

How to Use the LLM API Cost Calculator

Select AI Models to Compare

Choose from GPT-4o, Claude Sonnet, Gemini Flash, DeepSeek R1, and other popular AI models. Select multiple models to see a side-by-side cost comparison for your use case.

Enter Token Usage Estimates

Input your average input tokens (prompts, context) and output tokens (responses) per request. Use sample text to estimate if unsure - roughly 1 token equals 4 characters or 0.75 words.

Set Your Request Volume

Specify how many API requests you expect per day. The calculator will project daily and monthly costs based on your usage patterns and selected models.

Compare and Optimize Costs

Review the cost breakdown across all selected models. Identify the most cost-effective option for your needs, considering the tradeoff between capability and price.

Pro tip: Your data is processed entirely in your browser. Nothing is sent to any server, ensuring complete privacy.

Understanding AI API Pricing in 2025

Large Language Model (LLM) APIs have become essential for modern applications, but understanding their pricing can be complex. Most providers charge per token, with different rates for input (your prompts) and output (model responses). This calculator helps you compare costs across all major providers.

As of January 2025, the AI pricing landscape has become highly competitive. Google Gemini 2.0 Flash offers the lowest mainstream pricing at $0.10 per million input tokens, while DeepSeek continues to disrupt with even lower costs. Meanwhile, OpenAI and Anthropic compete on capability rather than price alone.

Key Pricing Tiers

Budget: Gemini Flash, DeepSeek, GPT-4o-mini - best for high-volume, cost-sensitive applications
Standard: GPT-4o, Claude Sonnet - balanced performance and cost for most use cases
Premium: Claude Opus - maximum capability for complex tasks requiring deep reasoning
Reasoning: o1, o1-mini - specialized models for complex multi-step reasoning tasks

Cost Optimization Tips

Use prompt caching when available (can reduce costs by 90%)
Consider batch APIs for non-real-time workloads (50% discount)
Choose the smallest model that meets your quality requirements
Optimize prompts to reduce input tokens without losing context
Use streaming to fail fast on poor generations

Frequently Asked Questions

How much does it cost to use GPT-4o API?

GPT-4o costs $2.50 per million input tokens and $10.00 per million output tokens as of January 2025. For a typical request with 1,000 input tokens and 500 output tokens, this works out to about $0.0075 per request.

Which AI model is the cheapest to use?

Google Gemini 2.0 Flash is currently the cheapest mainstream model at $0.10 per million input tokens and $0.40 per million output tokens. DeepSeek V3 is even cheaper at $0.27/$1.10. For OpenAI, GPT-4o-mini at $0.15/$0.60 is very cost-effective.

How do Claude API costs compare to OpenAI?

Claude Sonnet 4.5 ($3/$15 per 1M tokens) is slightly more expensive than GPT-4o ($2.50/$10) but offers 200K context. Claude Haiku 4.5 ($1/$5) is competitive with GPT-4o-mini ($0.15/$0.60) though more expensive.

What is a token in AI API pricing?

A token is approximately 4 characters or 0.75 words in English. For example, "Hello world" is about 2-3 tokens. Most AI APIs charge separately for input tokens (your prompts) and output tokens (model responses).

Is DeepSeek R1 cheaper than GPT-4?

Yes, DeepSeek R1 is significantly cheaper at $0.55 input / $2.19 output per million tokens, compared to GPT-4o at $2.50/$10. DeepSeek R1 offers competitive reasoning capabilities at a fraction of the cost.

Related Tools

Token Counter

Count tokens for OpenAI GPT-4, Claude 3, Gemini, and other LLMs. See token breakdown by model, estimate API costs instantly. Supports cl100k_base and other tokenizers.

Claude API Calculator

Calculate Anthropic Claude API costs. Compare Claude 3 Opus, Sonnet, and Haiku pricing.

DeepSeek Calculator

Calculate DeepSeek API costs and compare with other providers.

Gemini API Calculator

Calculate Google Gemini API costs. Compare Gemini Pro, Ultra, and Flash pricing.

View all AI Tools