Question 1

Are these token counts exact?

Accepted Answer

For OpenAI models yes — we use the actual tiktoken encoder (o200k_base) running in your browser. For Claude, Gemini, and Llama, no public JavaScript tokenizer exists, so we use a chars-per-token heuristic calibrated against published averages (~3.5 for Claude, ~4 for Gemini, ~3.8 for Llama). Expect ±10% error on those.

Question 2

Are the prices current?

Accepted Answer

The prices shown by default are pinned to 2026-05-22 when this page was last updated. Click "Refresh prices" to pull live values from OpenRouter, which aggregates current rates across providers. The page falls back to the pinned values if the network call fails.

Question 3

Why does the same text produce different counts on different models?

Accepted Answer

Each model family uses its own tokenizer. Recent OpenAI models use o200k_base, which is more efficient on multilingual text than the older cl100k_base. Claude, Llama, and Gemini each have their own BPE vocabularies.

Question 4

Why doesn't the cost include output tokens?

Accepted Answer

Output token count is unpredictable — it depends on how long the model decides to respond. Most APIs charge 3-5x more for output than input, so total cost is roughly (input tokens) × (input price) + (output tokens) × (3-5 × input price). Multiply this tool's number by your expected response ratio.

Question 5

Where do the live prices come from?

Accepted Answer

OpenRouter (openrouter.ai), which aggregates pricing across LLM providers for routing purposes. They expose a public JSON API at /api/v1/models. We fetch it only when you click the Refresh button — the page makes zero network calls otherwise.

Question 6

Does this work offline?

Accepted Answer

Yes for the token counts. The tokenizer runs entirely in your browser. The live-price refresh button needs network; without it, the page shows the pinned static prices.

Question 7

Why is my Claude token count an estimate?

Accepted Answer

Anthropic does not publish their tokenizer as a runnable library. The only way to get an exact Claude token count is to call their API's count_tokens endpoint, which requires an API key and a network round-trip. The heuristic we use is accurate to within ~10% for English prose.

Model	Vendor	Tokens	$/1M input	Cost / call
GPT (latest)	OpenAI	41~est	$5.00	$0.000205
GPT-5.4 (mid-tier)	OpenAI	41~est	$2.50	$0.000102
Claude Opus (latest)	Anthropic	44~est	$5.00	$0.000220
Claude Sonnet (latest)	Anthropic	44~est	$3.00	$0.000132
Claude Haiku (latest)	Anthropic	44~est	$1.00	$0.000044
Gemini Pro (latest)	Google	39~est	$2.00	$0.000078
Gemini Flash (latest)	Google	39~est	$1.50	$0.000058
Llama 3.x (70B, self-hosted)	Meta	41~est	—	—

Token Counter for GPT, Claude, Llama

How to use the token counter for gpt, claude, llama

Formula & explanation

Examples

Frequently asked questions

Related developer tools tools

Related reading