Question 1

Do Claude subscriptions charge by token?

Accepted Answer

No. Claude Pro, Max, Team, and Enterprise subscriptions do not bill by token. Subscriptions use a usage-window model based on compute time and the complexity of your requests. You never see a token count or a per-token charge on a subscription plan. Token-based billing only applies to the Anthropic API (accessed via API key through the Anthropic Console).

Question 2

How do I count tokens before sending a request?

Accepted Answer

Anthropic provides a token counting API endpoint that returns the exact token count for a given message before you send it. You can also use rough estimates: approximately 750 words per 1,000 tokens, or 4 characters per token. The Anthropic Python and TypeScript SDKs include tokenization utilities for offline estimation.

Question 3

What is the context window and does it affect cost?

Accepted Answer

The context window is the maximum amount of text (measured in tokens) that Claude can process in a single request - including both your input and Claude's output. Larger context windows cost more because they require more memory and compute. Claude Sonnet 4.6 has a 200K token context window. At subscription level, you pay a flat rate and the context window is included. At API level, you pay per token for both input and output.

Question 4

Does Claude count output tokens separately?

Accepted Answer

Yes. In the API, input tokens (what you send) and output tokens (what Claude generates) are priced separately. Output tokens are typically priced higher because they require more compute to generate. Output tokens are a subset of the context window used for each response. In subscriptions, this distinction does not affect your billing.

Question 5

How does the Batch API halve costs?

Accepted Answer

The Batch API processes requests asynchronously with results delivered within 24 hours. Because the compute is scheduled flexibly rather than in real-time, Anthropic can run it at 50% of the standard API price. All models support batch processing. The Batch API is accessible via the same API key - you simply use a different endpoint and accept the latency trade-off.

Content Type	Approximate Size	Approximate Tokens
Tweet / short message	280 characters	~70 tokens
Paragraph of text	100 words	~130 tokens
Standard essay	800 words	~1,060 tokens
Short story	5,000 words	~6,700 tokens
Academic paper	8,000 words	~10,700 tokens
Code file (500 lines)	~15,000 characters	~3,500 tokens
Large codebase file (2000 lines)	~60,000 characters	~14,000 tokens
Novel chapter	20,000 words	~26,700 tokens

Model	Input (per 1M tokens)	Output (per 1M tokens)	Batch Input	Batch Output	Context Window
Claude Opus 4.6	$5.00	$25.00	$2.50	$12.50	1M (API)
Claude Sonnet 4.6	$3.00	$15.00	$1.50	$7.50	200K
Claude Haiku 4.5	$1.00	$5.00	$0.50	$2.50	200K
Claude Haiku 3.5	$0.80	$4.00	$0.40	$2.00	200K

Claude Token Costs 2026: What Tokens Are, What They Cost, and How to Calculate Your Bill

What Is a Token?

API Token Pricing by Model

Token Cost Calculator

Subscriptions Do NOT Bill by Token

Frequently Asked Questions

Related Pages