Question 1

Is my prompt sent to any server?

Accepted Answer

No. Tokenization runs entirely in your browser. The cl100k_base and o200k_base BPE vocabularies ship with the page, and the encoder reads only your textarea. Your prompt never leaves the tab — there is no network call after the initial page load.

Question 2

How accurate are non-OpenAI counts?

Accepted Answer

GPT-5, GPT-4.1, GPT-4o, GPT-4 Turbo, and GPT-3.5 counts are exact via OpenAI's official BPE tables. Claude, Gemini, Llama, and DeepSeek use a heuristic adjustment over the o200k_base baseline, calibrated against published samples — typically within 5 to 10 percent of the provider's own tokenizer. For billing-critical estimates use the provider's count_tokens API on a recent prompt.

Question 3

Are the prices live?

Accepted Answer

Prices are static reference values reviewed in May 2026. Headline rates change without notice, especially for batch tiers, regional billing, and prompt caching. Always verify against the provider's pricing page before committing budget.

Question 4

What size of input does this handle?

Accepted Answer

Up to roughly 500 KB of text per session. Beyond that the textarea and tokenizer can stutter on lower-end devices. For batch workloads use the official tokenizer locally — pip install tiktoken for OpenAI, anthropic.tokenizers for Claude.

Question 5

Why doesn't my application's count match this number?

Accepted Answer

Production apps include system prompts, function or tool definitions, image and audio tokens, and chat formatting overhead that this counter intentionally ignores. The counter measures one body of plain text. To debug a billing line item, paste only the raw user content and compare against your application's user-only token field.

AI Token Counter

Token counts & cost

How to Use

What Is a Token?

Why Count Locally?

Cross-Provider Cost Comparison

Privacy and Network Behavior

FAQ