LLM API Cost Calculator

Compare API pricing across GPT-4o, Claude, Gemini 2.0 Flash, DeepSeek R1, and more. Calculate per-request costs and monthly estimates for your AI applications.

Usage Parameters

~75,000 words

~37,500 words

3,000 requests/month

Filter Models

Cheapest (per request)

$0.030

Cheapest Model

Gemini 2.0 Flash

Cheapest Monthly

$90.00

Potential Savings

97%

ModelProviderInput CostOutput CostPer RequestMonthly Est.
1Gemini 2.0 Flash
Google$0.010$0.020$0.030$90.00
DeepSeek R1
DeepSeek$0.055$0.110$0.165$493.50
GPT-4o
OpenAI$0.250$0.500$0.750$2250.00
Claude Sonnet 4.5
Anthropic$0.300$0.750$1.05$3150.00

Current Pricing Reference (per 1M tokens)

ModelProviderInputOutputContext
GPT-4o
OpenAI$2.50$10.00128K
GPT-4o Mini
OpenAI$0.15$0.60128K
o1 (Reasoning)Advanced reasoning model
OpenAI$15.00$60.00200K
o1-mini
OpenAI$3.00$12.00128K
o3
OpenAI$2.00$8.00200K
o3-mini
OpenAI$0.55$2.20128K
Claude Opus 4.5Most capable Claude model
Anthropic$5.00$25.00200K
Claude Sonnet 4.5Balanced performance
Anthropic$3.00$15.00200K
Claude Haiku 4.5Fast and efficient
Anthropic$1.00$5.00200K
Gemini 2.0 FlashBest value, 1M context
Google$0.10$0.401000K
Gemini 1.5 Pro2M context window
Google$2.50$10.002000K
DeepSeek R1Open-source reasoning model
DeepSeek$0.55$2.19128K
DeepSeek V3Ultra low cost
DeepSeek$0.27$1.10128K

Pricing updated January 2025. Prices may vary. Always check official provider documentation for current rates.

How to Use

  1. Enter your average input tokens per request (prompts, context, etc.)
  2. Enter your average output tokens per request (model responses)
  3. Set how many API requests you make per day
  4. Filter by provider or tier to narrow down options
  5. Select which models you want to compare
  6. View cost breakdown per request and monthly estimates

How to Use the LLM API Cost Calculator

Select AI Models to Compare

Choose from GPT-4o, Claude Sonnet, Gemini Flash, DeepSeek R1, and other popular AI models. Select multiple models to see a side-by-side cost comparison for your use case.

Enter Token Usage Estimates

Input your average input tokens (prompts, context) and output tokens (responses) per request. Use sample text to estimate if unsure - roughly 1 token equals 4 characters or 0.75 words.

Set Your Request Volume

Specify how many API requests you expect per day. The calculator will project daily and monthly costs based on your usage patterns and selected models.

Compare and Optimize Costs

Review the cost breakdown across all selected models. Identify the most cost-effective option for your needs, considering the tradeoff between capability and price.

Pro tip: Your data is processed entirely in your browser. Nothing is sent to any server, ensuring complete privacy.

Understanding AI API Pricing in 2025

Large Language Model (LLM) APIs have become essential for modern applications, but understanding their pricing can be complex. Most providers charge per token, with different rates for input (your prompts) and output (model responses). This calculator helps you compare costs across all major providers.

As of January 2025, the AI pricing landscape has become highly competitive. Google Gemini 2.0 Flash offers the lowest mainstream pricing at $0.10 per million input tokens, while DeepSeek continues to disrupt with even lower costs. Meanwhile, OpenAI and Anthropic compete on capability rather than price alone.

Key Pricing Tiers

  • Budget: Gemini Flash, DeepSeek, GPT-4o-mini - best for high-volume, cost-sensitive applications
  • Standard: GPT-4o, Claude Sonnet - balanced performance and cost for most use cases
  • Premium: Claude Opus - maximum capability for complex tasks requiring deep reasoning
  • Reasoning: o1, o1-mini - specialized models for complex multi-step reasoning tasks

Cost Optimization Tips

  • Use prompt caching when available (can reduce costs by 90%)
  • Consider batch APIs for non-real-time workloads (50% discount)
  • Choose the smallest model that meets your quality requirements
  • Optimize prompts to reduce input tokens without losing context
  • Use streaming to fail fast on poor generations

Frequently Asked Questions

How much does it cost to use GPT-4o API?

GPT-4o costs $2.50 per million input tokens and $10.00 per million output tokens as of January 2025. For a typical request with 1,000 input tokens and 500 output tokens, this works out to about $0.0075 per request.

Which AI model is the cheapest to use?

Google Gemini 2.0 Flash is currently the cheapest mainstream model at $0.10 per million input tokens and $0.40 per million output tokens. DeepSeek V3 is even cheaper at $0.27/$1.10. For OpenAI, GPT-4o-mini at $0.15/$0.60 is very cost-effective.

How do Claude API costs compare to OpenAI?

Claude Sonnet 4.5 ($3/$15 per 1M tokens) is slightly more expensive than GPT-4o ($2.50/$10) but offers 200K context. Claude Haiku 4.5 ($1/$5) is competitive with GPT-4o-mini ($0.15/$0.60) though more expensive.

What is a token in AI API pricing?

A token is approximately 4 characters or 0.75 words in English. For example, "Hello world" is about 2-3 tokens. Most AI APIs charge separately for input tokens (your prompts) and output tokens (model responses).

Is DeepSeek R1 cheaper than GPT-4?

Yes, DeepSeek R1 is significantly cheaper at $0.55 input / $2.19 output per million tokens, compared to GPT-4o at $2.50/$10. DeepSeek R1 offers competitive reasoning capabilities at a fraction of the cost.