LLM API Cost Calculator
Compare API pricing across GPT-4o, Claude, Gemini 2.0 Flash, DeepSeek R1, and more. Calculate per-request costs and monthly estimates for your AI applications.
Usage Parameters
~75,000 words
~37,500 words
3,000 requests/month
Filter Models
Cheapest (per request)
$0.030
Cheapest Model
Gemini 2.0 Flash
Cheapest Monthly
$90.00
Potential Savings
97%
| Model | Provider | Input Cost | Output Cost | Per Request | Monthly Est. |
|---|---|---|---|---|---|
1Gemini 2.0 Flash | $0.010 | $0.020 | $0.030 | $90.00 | |
DeepSeek R1 | DeepSeek | $0.055 | $0.110 | $0.165 | $493.50 |
GPT-4o | OpenAI | $0.250 | $0.500 | $0.750 | $2250.00 |
Claude Sonnet 4.5 | Anthropic | $0.300 | $0.750 | $1.05 | $3150.00 |
Current Pricing Reference (per 1M tokens)
| Model | Provider | Input | Output | Context |
|---|---|---|---|---|
GPT-4o | OpenAI | $2.50 | $10.00 | 128K |
GPT-4o Mini | OpenAI | $0.15 | $0.60 | 128K |
o1 (Reasoning)Advanced reasoning model | OpenAI | $15.00 | $60.00 | 200K |
o1-mini | OpenAI | $3.00 | $12.00 | 128K |
o3 | OpenAI | $2.00 | $8.00 | 200K |
o3-mini | OpenAI | $0.55 | $2.20 | 128K |
Claude Opus 4.5Most capable Claude model | Anthropic | $5.00 | $25.00 | 200K |
Claude Sonnet 4.5Balanced performance | Anthropic | $3.00 | $15.00 | 200K |
Claude Haiku 4.5Fast and efficient | Anthropic | $1.00 | $5.00 | 200K |
Gemini 2.0 FlashBest value, 1M context | $0.10 | $0.40 | 1000K | |
Gemini 1.5 Pro2M context window | $2.50 | $10.00 | 2000K | |
DeepSeek R1Open-source reasoning model | DeepSeek | $0.55 | $2.19 | 128K |
DeepSeek V3Ultra low cost | DeepSeek | $0.27 | $1.10 | 128K |
Pricing updated January 2025. Prices may vary. Always check official provider documentation for current rates.
How to Use
- Enter your average input tokens per request (prompts, context, etc.)
- Enter your average output tokens per request (model responses)
- Set how many API requests you make per day
- Filter by provider or tier to narrow down options
- Select which models you want to compare
- View cost breakdown per request and monthly estimates
How to Use the LLM API Cost Calculator
Select AI Models to Compare
Choose from GPT-4o, Claude Sonnet, Gemini Flash, DeepSeek R1, and other popular AI models. Select multiple models to see a side-by-side cost comparison for your use case.
Enter Token Usage Estimates
Input your average input tokens (prompts, context) and output tokens (responses) per request. Use sample text to estimate if unsure - roughly 1 token equals 4 characters or 0.75 words.
Set Your Request Volume
Specify how many API requests you expect per day. The calculator will project daily and monthly costs based on your usage patterns and selected models.
Compare and Optimize Costs
Review the cost breakdown across all selected models. Identify the most cost-effective option for your needs, considering the tradeoff between capability and price.
Pro tip: Your data is processed entirely in your browser. Nothing is sent to any server, ensuring complete privacy.
Understanding AI API Pricing in 2025
Large Language Model (LLM) APIs have become essential for modern applications, but understanding their pricing can be complex. Most providers charge per token, with different rates for input (your prompts) and output (model responses). This calculator helps you compare costs across all major providers.
As of January 2025, the AI pricing landscape has become highly competitive. Google Gemini 2.0 Flash offers the lowest mainstream pricing at $0.10 per million input tokens, while DeepSeek continues to disrupt with even lower costs. Meanwhile, OpenAI and Anthropic compete on capability rather than price alone.
Key Pricing Tiers
- Budget: Gemini Flash, DeepSeek, GPT-4o-mini - best for high-volume, cost-sensitive applications
- Standard: GPT-4o, Claude Sonnet - balanced performance and cost for most use cases
- Premium: Claude Opus - maximum capability for complex tasks requiring deep reasoning
- Reasoning: o1, o1-mini - specialized models for complex multi-step reasoning tasks
Cost Optimization Tips
- Use prompt caching when available (can reduce costs by 90%)
- Consider batch APIs for non-real-time workloads (50% discount)
- Choose the smallest model that meets your quality requirements
- Optimize prompts to reduce input tokens without losing context
- Use streaming to fail fast on poor generations
Frequently Asked Questions
How much does it cost to use GPT-4o API?
GPT-4o costs $2.50 per million input tokens and $10.00 per million output tokens as of January 2025. For a typical request with 1,000 input tokens and 500 output tokens, this works out to about $0.0075 per request.
Which AI model is the cheapest to use?
Google Gemini 2.0 Flash is currently the cheapest mainstream model at $0.10 per million input tokens and $0.40 per million output tokens. DeepSeek V3 is even cheaper at $0.27/$1.10. For OpenAI, GPT-4o-mini at $0.15/$0.60 is very cost-effective.
How do Claude API costs compare to OpenAI?
Claude Sonnet 4.5 ($3/$15 per 1M tokens) is slightly more expensive than GPT-4o ($2.50/$10) but offers 200K context. Claude Haiku 4.5 ($1/$5) is competitive with GPT-4o-mini ($0.15/$0.60) though more expensive.
What is a token in AI API pricing?
A token is approximately 4 characters or 0.75 words in English. For example, "Hello world" is about 2-3 tokens. Most AI APIs charge separately for input tokens (your prompts) and output tokens (model responses).
Is DeepSeek R1 cheaper than GPT-4?
Yes, DeepSeek R1 is significantly cheaper at $0.55 input / $2.19 output per million tokens, compared to GPT-4o at $2.50/$10. DeepSeek R1 offers competitive reasoning capabilities at a fraction of the cost.
Related Tools
Count tokens for OpenAI GPT-4, Claude 3, Gemini, and other LLMs. See token breakdown by model, estimate API costs instantly. Supports cl100k_base and other tokenizers.
Calculate Anthropic Claude API costs. Compare Claude 3 Opus, Sonnet, and Haiku pricing.
Calculate DeepSeek API costs and compare with other providers.
Calculate Google Gemini API costs. Compare Gemini Pro, Ultra, and Flash pricing.