Can agents fail from weak memory control rather than missing knowledge?
As multi-turn agent workflows grow longer, performance degrades—but is this due to insufficient context or poor memory management? This explores whether memory *control* is the real bottleneck.
As multi-turn workflows grow, agents degrade — constraint focus is lost, errors accumulate, decisions drift. The instinct is to blame missing knowledge and add more context. This paper relocates the cause: the failures are driven by weak memory control, not knowledge gaps.
The two dominant patterns both fail in characteristic ways. Transcript replay appends prior interactions to the prompt, so context grows with turn count, attention selectivity drops, and early errors persist and reappear — hallucination carryover and constraint drift. Retrieval-based memory bounds prompt length but adds selection error: stale, conflicting, or injected artifacts perturb the current task state (the authors had to cap retrieval at three artifacts per turn to limit drift escalation).
The Agent Cognitive Compressor (ACC) replaces accumulation with a bounded, schema-governed internal state — the Compressed Cognitive State — updated by controlled replacement rather than growth. Crucially, it separates artifact recall from state commitment: the agent can consult artifacts without those artifacts automatically becoming persistent memory. This makes the write path explicit and auditable, and keeps the memory footprint bounded against poisoning and noisy recall.
The design echoes Is agent memory a storage problem or a connectivity problem? from the opposite direction: where that note argues retrieval quality is about access structure, ACC argues the more basic lever is gating what is allowed to commit at all. Both reject the "store everything, replay everything" default.
Inquiring lines that use this note as a source 8
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- Can the same compress-then-act pattern work for agent state memory?
- How do memory tools and planning each contribute to agent efficiency?
- How do memory hierarchies and compression reduce context management demands?
- Why is digital context more volatile than conventional software context?
- Why do weaker agents need more aggressive context compression than stronger ones?
- How does external context control compare to agents managing their own state internally?
- Should optimal context budgets scale with agent competence or task complexity?
- Can externalizing bookkeeping to a stateful harness replace internalized memory control?
Related concepts in this collection 3
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
Can agents compress their own memory without losing critical details?
Explores whether agents can autonomously consolidate interaction history into structured memory schemas that reduce token overhead while preserving information needed for long-horizon reasoning and strategic reflection.
convergent compression-over-accumulation move
-
Can externalizing bookkeeping improve search agent performance?
Does moving routine state management out of the policy and into a stateful environment harness free reinforcement learning to focus on genuine semantic decisions? This explores whether division of labor between environment and model improves search efficiency.
Harness-1 externalizes the same bookkeeping; ACC internalizes it as a committed state — two routes to bounded state
-
Is long-context bottleneck really about memory or compute?
Explores whether the challenge of handling long context windows stems from storage capacity limits or from the computational cost of transforming context into internal state. Understanding this distinction reshapes how we design language models.
the committed state is exactly that transformed internal representation
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- AI Agents Need Memory Control Over More Context
- From Model Scaling to System Scaling: Scaling the Harness in Agentic AI
- Why Do Multi-agent LLM Systems Fail?
- Useful Memories Become Faulty When Continuously Updated by LLMs
- Agent Workflow Memory
- Real-Time Procedural Learning From Experience for AI Agents
- LLMs Corrupt Your Documents When You Delegate
- Towards a Science of Scaling Agent Systems
Original note title
multi-turn agent failure is weak memory control not missing knowledge — a bounded committed state beats transcript replay