What is a context window?
Short answer
A context window is the maximum number of tokens a language model can take into account at one time. It covers everything in the exchange: the system instructions, your prompt, any attached content, and the model's own reply.
What counts toward the window
The context window is a budget measured in tokens. Everything the model sees in a turn draws from it:
- System or developer instructions
- The conversation history so far
- Your current prompt and any pasted or attached content
- The response the model generates, which also consumes tokens
How big are they
Context windows have grown quickly. Different models support very different sizes, ranging from a few thousand tokens to well over a million. Larger windows let a model work with more material at once, but the relevant input still has to fit alongside the reply.