SYNTHESIS NOTE
Model Architecture and Internals Agentic Systems and Tool Use Reasoning, Retrieval, and Evaluation

Can LLMs read long documents like humans do?

How might mimicking human reading strategies—storing gist memories and looking up details on demand—help language models handle documents beyond their effective context window?

Synthesis note · 2026-06-03 · sourced from Memory

LLMs are limited not only by an explicit context window but by degrading performance on long inputs well before that limit. ReadAgent's premise is that humans read differently: exact wording is forgotten quickly while gist — the substance irrespective of exact words — persists, and reading is interactive (we look back when we need a detail). It implements this as a simple prompting system that (1) decides what content to store together as a memory episode, (2) compresses each episode into a short gist memory, and (3) looks up the original passages only when a task requires the details. This extends effective context 3–20× and outperforms retrieval baselines on QuALITY, NarrativeQA, and QMSum.

The keeper is that the LLM can generate broadly useful gist memories before knowing the task — compression need not be query-conditioned to be useful — and can then reason interactively over those gists to decide what to retrieve. Gist-first-then-lookup is a different long-context strategy than either stuffing the window or pure retrieval.

This is the gist-compression member of the vault's long-context/memory cluster. It shares the compress-then-act move with Can agents compress their own memory without losing critical details? and the bounded-state philosophy of Can agents fail from weak memory control rather than missing knowledge? — but applied to reading documents rather than managing agent state, and beating retrieval rather than replacing it.

Inquiring lines that use this note as a source 11

This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.

Related concepts in this collection 3

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map
13 direct connections · 85 in 2-hop network ·medium cluster Open in graph ↗

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Original note title

a human-inspired reading agent compresses documents into gist memories and looks up details on demand extending effective context twentyfold