Can agents fail from weak memory control rather than missing knowledge?

As multi-turn agent workflows grow longer, performance degrades—but is this due to insufficient context or poor memory management? This explores whether memory *control* is the real bottleneck.

Synthesis note · 2026-06-03 · sourced from Memory

As multi-turn workflows grow, agents degrade — constraint focus is lost, errors accumulate, decisions drift. The instinct is to blame missing knowledge and add more context. This paper relocates the cause: the failures are driven by weak memory control, not knowledge gaps.

The two dominant patterns both fail in characteristic ways. Transcript replay appends prior interactions to the prompt, so context grows with turn count, attention selectivity drops, and early errors persist and reappear — hallucination carryover and constraint drift. Retrieval-based memory bounds prompt length but adds selection error: stale, conflicting, or injected artifacts perturb the current task state (the authors had to cap retrieval at three artifacts per turn to limit drift escalation).

The Agent Cognitive Compressor (ACC) replaces accumulation with a bounded, schema-governed internal state — the Compressed Cognitive State — updated by controlled replacement rather than growth. Crucially, it separates artifact recall from state commitment: the agent can consult artifacts without those artifacts automatically becoming persistent memory. This makes the write path explicit and auditable, and keeps the memory footprint bounded against poisoning and noisy recall.

The design echoes Is agent memory a storage problem or a connectivity problem? from the opposite direction: where that note argues retrieval quality is about access structure, ACC argues the more basic lever is gating what is allowed to commit at all. Both reject the "store everything, replay everything" default.

Inquiring lines that use this note as a source 8

This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.

Related concepts in this collection 3

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map

14 direct connections · 95 in 2-hop network ·medium cluster Open in graph ↗

Can agents fail from weak memory control rather … Can agents compress their own memory without losin… Can externalizing bookkeeping improve search agent… Is long-context bottleneck really about memory or …

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Can agents compress their own memory without losing critical details? Explores whether agents can autonomously consolidate interaction history into structured memory schemas that reduce token overhead while preserving information needed for long-horizon reasoning and strategic reflection.
convergent compression-over-accumulation move
Can externalizing bookkeeping improve search agent performance? Does moving routine state management out of the policy and into a stateful environment harness free reinforcement learning to focus on genuine semantic decisions? This explores whether division of labor between environment and model improves search efficiency.
Harness-1 externalizes the same bookkeeping; ACC internalizes it as a committed state — two routes to bounded state
Is long-context bottleneck really about memory or compute? Explores whether the challenge of handling long context windows stems from storage capacity limits or from the computational cost of transforming context into internal state. Understanding this distinction reshapes how we design language models.
the committed state is exactly that transformed internal representation

Can agents fail from weak memory control rather than missing knowledge?

Related concepts in this collection 3

Related papers in this collection 8

Search by related questions 4