Cost-check any AI API in 30 seconds.
?How it worksEnter your monthly usage and we rank every provider from cheapest to most expensive.For text: fill tokens-in / out / requests for accuracy, or just total tokens (we assume a 70/30 input/output split).Presets autofill realistic values. Prices are list rates — no Batch (-50%) or caching (-90%) discounts. An asterisk (*) flags providers with pricing caveats.
Pick your use case, plug in your real usage, and see which provider is the cheapest — and which gives you the best value when you factor quality in.
Cheapest providers for your usage
Pricing verified against provider docs on 2026-05-04. Always confirm with the provider before procurement — rates can shift mid-month, and tier discounts (Batch APIs, prompt caching, volume commitments) are not modeled here.
How we compute it
We multiply your monthly usage by each provider's published API rate and sort the list from cheapest to most expensive.
For text, the formula is (tokens_in × in_rate + tokens_out × out_rate) × requests / 1M. If you only enter total tokens, we assume a 70% input / 30% output split.
Prices are list rates verified against each provider's official pricing page on 2026-06-15. They don't include Batch API discounts (-50%) or prompt caching (-90% on cached input).
An asterisk (*) next to a provider means the rate has caveats — for example, ElevenLabs cost varies by tier, Gemini 3.1 Pro rises to $4/$18 beyond 200k context, and DeepSeek R1 bills reasoning tokens separately. Read the italic note below the row.
Need a SaaS subscription instead of an API? Try the Find my AI wizard — it ranks plug-and-play tools by your monthly budget.
FAQ
Token pricing, demystified.
A token is the unit AI providers use to bill API usage. Roughly, 1 token ≈ ¾ of an English word — so 1,000 tokens ≈ 750 words. Most providers charge separately for input tokens (your prompt and any context) and output tokens (the model's reply), and output is usually 3–5× more expensive than input.
Multiply your average tokens per request × your number of monthly requests, split into input and output, then multiply by each provider's per-token rate. The calculator above does this for every major provider in one click — enter tokens-in, tokens-out, and monthly request count, and it ranks providers from cheapest to most expensive.
It depends on what you're doing. For text and chat, DeepSeek, Gemini Flash and Claude Haiku are usually the cheapest. For TTS audio, ElevenLabs Flash and Google Cloud TTS lead. For STT, Deepgram and Whisper. For images, Flux Schnell and SDXL on Replicate. The ranking shifts when output volume is high — run your own usage numbers above for an answer matched to your real workload.
There's no flat monthly fee — the OpenAI API is pay-as-you-go. As a rough benchmark, GPT-4o costs about $2.50 per million input tokens and $10 per million output tokens. A "Pro" usage profile (≈25,000 requests/month with ~1,500 input and ~800 output tokens per request) typically lands between $300 and $500/month. Use the Pro preset above to model it instantly.
Input tokens are what you send to the model (your prompt, any context, files, system instructions). Output tokens are what the model generates back as a reply. Output is almost always priced higher than input because generation is more compute-intensive than reading. For most chat workloads, output costs 3–5× more per token than input.
No. The calculator shows list prices only — the rate every developer pays before negotiating volume or opting into Batch APIs (typically -50%) and prompt caching (typically -90% on cached input tokens). If you're running heavy automated workloads, your real cost may be 30–60% lower than what the calculator shows.
We re-verify rates against each provider's official pricing page monthly, and the verification date appears below every results table. AI pricing changes frequently — confirm the current rate with the provider directly before signing any contract or making a procurement decision.
A flat-rate SaaS subscription (ChatGPT Plus at $20/month, Claude Pro at $20/month, Gemini Advanced at $19.99/month) is cheaper for occasional, human-in-the-loop use. The API wins when you build a product, automate workflows, or run heavy daily volume. If you're not sure which side you're on, try the Find my AI wizard — it ranks SaaS plans by your monthly budget.
🍪 We use cookies
We use cookies to improve your experience and analyze site traffic. By clicking "Accept", you agree to our Cookie Policy.