AI Prompt Cost Calculator

What does this prompt cost at 10K, 100K, 10M calls a day — on Claude, GPT, Gemini or open models?

Pick a model, set input and output tokens per call, set a cache-hit rate, and get the per-call, per-day, per-month and per-year cost — across all major providers for comparison.

Model

Calls per day

Input tokens / call

Output tokens / call

Input cache hit rate (%)

Cached input is ~10% of normal cost on providers that support it.

Per call—

Per day—

Per month—

Per year—

Across all models (monthly)

Runs entirely in your browser — nothing you enter is sent to a server.

Free forever · No signup

Need this for real, on your stack?

These free tools are a taste of how we think. We’re a senior software team across Romania & Pakistan that ships deep technical work — platforms, infra, data and the gnarly bits in between.

Talk to our engineers →

Cache-hit rate is the lever

For repetitive workloads (system prompts, RAG context), prompt caching cuts input cost to ~10% of base. A 70% cache hit rate cuts your bill by ~60% with zero quality change.

Rates

Prices are a 2026 snapshot stored in weww-tools.js — edit there to reflect current quotes.

Built by senior engineers at WeWorkWorldwide. We’re hiring across Romania & Pakistan — see open roles →. Or see how our team embeds with yours.

Need this for real, on your stack?

Cache-hit rate is the lever

Rates

More developer tools