Why AI Forgets

Your conversation has a hard limit. Watch what happens when you hit it.

The Analogy

A Desk That Can Only Hold So Many Papers

Every new page you lay down pushes the oldest page off the edge. That's the context window -- a hard limit on how much AI can hold in its mind at once. Your system prompt, your messages, and every word the AI says back? All sharing the same desk.

Stage 1 -- The Window

Your AI's Entire Memory

0 / 8,192 tokens
Context Window: 8,192 tokens
Used
0%

This is your AI's entire memory. Everything it knows about this conversation has to fit in here. Your system prompt just took the first 35 tokens.

Stage 2 -- Growing

Every Message Fills the Window

0 / 8,192 tokens
Context Window: 8,192 tokens
Used
0%
Stage 3 -- Overflow

When the Desk Is Full

0 / 8,192 tokens
Context Window: 8,192 tokens
Used
0% FULL
Stage 4 -- The Practical View

Not All Windows Are Equal

The same conversation that overflows a small window might barely register in a larger one.

Takeaway

For long conversations: restart the chat when the AI starts ignoring your instructions -- it probably forgot them. Put your most important instructions at the beginning AND the end of your prompt. And remember, everything AI says back to you also fills the window.

The context window is finite, and the model's trained knowledge has gaps. So how do modern AI tools give answers grounded in your specific documents and data? Explore RAG →