Book Brew 165: The Arguments Against Insanity: What Stephen King Knew About AI Context Collapse

Written by Danielle Price Griffin | Apr 6, 2026 4:09:47 PM

There's a line King wrote in two separate books that I keep pondering about how it relates to a common problem with AI chatbots.

Depending on which version you've encountered, in The Shining, it goes something like this:

*The arguments against insanity fall through with a soft shirring sound. Layer on layer.*

He was writing about a man losing his grip on reality inside a hotel that had decided to help him along.

A similar quote also appears in Lisey’s Story, in which Lisey also feels she is taking crazy pills:

“The arguments against insanity fall through with a soft shirring sound; these are the sounds of dead voices on dead records floating down the broken shaft of memory.”

I keep thinking about it every time I watch a long AI conversation slowly eat itself.

What’s Context Collapse?

Context collapse in AI is quiet, a thing people often miss.

There's no jump scare, but more of a slow drift (a soft shirring…)

You start a long chat session with a clear set of instructions: write in this tone, remember this constraint, keep this persona, no em-dashes. The chatbot eagerly nods along, like a good little sycophant.

But, thirty exchanges later, it has stopped following the rules and you have found yourself cursing at an emotionless metal box. Earlier instructions have drifted outside its effective attention, and newer inputs have filled the space like blood rushing out of an elevator into a hallway.

An even more infuriating part of this is that the tool doesn't know it forgot.

Now Let Me Take You to the Overlook Hotel

Jack Torrance doesn't lose his mind all at once because that would be too clean, and it would be a boring story.

He loses it conversationally, incrementally.

The hotel feeds him inputs he can't verify, and he starts weighing the recent, vivid, emotionally charged ones over the older, quieter signal of reality.

His anchor (his family, his sobriety, his own prior self) falls outside his effective attention.

“The arguments against insanity fall through with a soft shirring. Layer on layer.”

An LLM (think GPT-5.4 or gemini-3.1) processing a 40,000-token conversation is doing something structurally analogous. (if you aren’t familiar with tokens, to give you an idea of length, the entire seven-book Harry Potter series is equal to about 1 million tokens)

The model's attention has to work across everything you've exchanged. The earlier material (the rules you set, the context you carefully established, the constraints you thought were load-bearing) gets progressively demoted as newer tokens crowd the window. The model can't attend to everything equally, and something has to give.

What gives is always the old stuff, which is typically the foundational stuff you thought would stay constant.

The Misconception That's Burning Everyone

Bigger context window = solved problem, right?

The research says otherwise.

Models can become less reliable with significantly larger context windows — attending properly to all that data is genuinely hard, and the model's confidence stays high while its grip on the material quietly loosens (...soft shirring).

Summarizing it to start a new chat has its own trap.

Every time you compress a long conversation into a new prompt, you lose nuance.

Do it enough times and you're rewriting a document until the original notes have vanished. A polished version of a thing that has drifted pretty far from where you started.

What You Can Actually Do

The goal is to orchestrate your sessions so drift hurts less when it happens. And if you use these tools enough, it will happen.

Anchor early and explicitly. Put your most critical constraints at the top of the conversation and repeat them periodically. Treat your session-opening instructions like a standing order you re-issue every few exchanges in long work. (This is why Projects/Spaces/Gems are so valuable)
Use structured checkpoints. Thinking of these like hiking waypoints to ensure you aren't getting lost. In a longer project session, periodically ask the chatbot to summarize what it understands your constraints and goals to be. Catch the drift before it becomes a disaster.
Break it up. As the saying goes, don’t put all your eggs in one basket. Fragmented collapse thrives on everything-in-one-session energy. Modularize where you can. One session per component, with explicit context-setting at the top of each.
Choose carefully what's in the window. The most useful material in a long session is the constraints, the decisions, the moments where direction changed. Learn to identify those and keep them surfaced.

The Layer Underneath All of This

King understood something that applies here: the scariest part of the collapse is the overconfidence that things are going along just fine…not the collapse itself.

Jack doesn't spiral into insanity knowing he's spiraling. He arrives there feeling certain he is right.

A chatbot won’t flag its uncertainty in the midst of a mid-context drift. It will keep answering you with confidence as it slowly drifts away from your goal.

That's on us (the humans) to catch.

So, step outside the hotel. Start a new session. Re-anchor. Reread the original notes before they're gone.

You wrote the thing. You remember what it meant.

Ponder This

When did you last notice an AI conversation start giving you subtly wrong answers — and how far back did the drift actually start?
How do you currently signal your most important constraints to an AI chatbot?

Books/Newsletters

Lisey’s Story - Stephen King
The Shining - Stephen King

View full post