Advertisement

🧠 Claude API Cost Calculator

Calculate Anthropic Claude API costs for all models. Enable prompt caching to see how much you save when reusing large system prompts or documents across requests.

Claude Model Comparison

Model Input / 1M Cached / 1M Output / 1M Context Best For
Claude Opus 4.8 $15.00 $1.50 $75.00 200K Most capable, complex tasks
Claude Sonnet 4.6 $3.00 $0.30 $15.00 200K Balanced performance & cost
Claude Haiku 4.5 $0.80 $0.08 $4.00 200K Fast, lightweight tasks

* Prices as of 2025. Check anthropic.com/pricing for the latest rates.

Frequently Asked Questions

Prompt caching allows you to mark portions of your prompt (like system prompts or large documents) to be cached server-side. Subsequent requests that reuse cached content are charged at just 10% of the normal input price, saving up to 90% on repeated context.

If you have a 10,000-token system prompt used in every request, without caching that costs $30/M tokens Γ— 10K tokens = $0.30 per 1,000 requests on Sonnet. With caching, that drops to $0.03 per 1,000 requests β€” a 90% saving on that portion.

Use Haiku for high-volume, straightforward tasks like classification, extraction, summarization, and chatbot responses. Use Sonnet when you need higher quality reasoning, coding help, or nuanced analysis. Opus is for the most complex tasks where quality is paramount.

No β€” all input tokens (including system prompts, conversation history, and user messages) are charged at the same input token rate. However, prompt caching applies specifically to reusable portions you mark for caching.

All current Claude models support 200,000 token context windows β€” one of the largest available. This allows processing entire books, large codebases, or extensive documents in a single request.

Related Calculators