Context window is a term that comes up constantly in AI buying decisions. Here's a plain-English explainer of what it is, why it matters, and what 'long context' actually unlocks.
A context window is the maximum amount of text (and other data) an LLM can read and reason about in a single conversation. Measured in tokens (roughly 4 characters or 0.75 words). Claude Sonnet 4.7 supports 200K tokens standard, with 1M available on some tiers; GPT-5 class is similar. 1M tokens is about 750,000 words — a 2000-page book.
Longer context window means the model can hold more in its 'attention' at once. Practical implications:
Most B2B workflows do not need extreme context. 200K tokens (Claude Sonnet standard) handles 99% of business tasks easily. Reach for 1M context only when you have a specific reason — analyzing very long documents, full codebase reasoning.
About 4 characters or 0.75 words on average. 1,000 tokens is roughly 750 words.
No. Bigger is better when needed; otherwise wastes cost and can slow output. Right-size for the task.
Claude Sonnet 4.7 supports 200K tokens standard; 1M tokens available on certain tiers.
Comparable to Claude — large, variable by tier.
Not by itself. Quality, integration, and workflow fit matter more for most teams.