Why do CoALA and Letta disagree on what counts as working memory?

This reads as asking why two agent-memory frameworks draw the boundary of 'working memory' in different places — and the corpus doesn't name CoALA or Letta directly, but it does hold the conceptual fault line that produces such disagreements.

This explores why agent frameworks disagree on what 'working memory' means — and a candid note first: the collection doesn't contain notes naming CoALA or Letta specifically, so I can't tell you their exact framings. What it *does* have is the reason any two such systems will draw that line differently: 'working memory' isn't one thing, so where you put the boundary depends on which design axis you privilege. The most direct map here shows that agent working memory decomposes into four components along two axes — dialogue-level (running conversation history, scratchpad) versus turn-level (in-context examples, the current task trajectory) How should agent memory split across time scales?. Two frameworks can both say 'working memory' and mean opposite ends of that grid: one counts the persistent conversation buffer, the other counts only the volatile per-turn scratch state. The disagreement is definitional, not empirical.

A second source of divergence is whether you anchor the definition in *architecture* or in *cognitive analogy*. One line in the corpus maps agent memory onto the brain's complementary learning systems: transformer weights as a consolidated neocortex, retrieval (RAG) as hippocampal rapid encoding, and agentic state as prefrontal executive control Can brain memory systems explain how LLMs should store knowledge?. If a framework defines working memory by that prefrontal/executive-control analogy, it will scope it narrowly to active task state. If instead it defines it operationally — 'whatever currently sits in the context window' — the scope balloons to include retrieved documents and history. Same term, two reference frames, guaranteed disagreement.

There's also a functional axis: is working memory just the buffer, or is it a *workspace that does work*? Stateful narrative reasoning research shows the payoff of a persistent memory workspace that doesn't merely store but actively detects and resolves contradictions across retrieval cycles, beating stateless multi-step retrieval by up to 11% Can reasoning systems maintain memory across retrieval cycles?. A framework that treats working memory as this active reasoning workspace will include machinery (reflection, contradiction-checking) that a framework treating it as passive storage would file under something else entirely.

The thing worth carrying away: definitional fights about 'working memory' in agents are downstream of an unresolved design question — should memory tiers be carved by *time scale* (turn vs. session), by *cognitive analogy* (executive vs. index vs. consolidated store), or by *function* (passive buffer vs. active workspace)? Each carving is defensible, each predicts different failure modes and update policies, and none has won. So two frameworks 'disagreeing' is less a contradiction than two reasonable answers to a question the field hasn't settled. If you want to go deeper on why the tiers don't cleanly integrate, the complementary-learning-systems note flags exactly the missing consolidation mechanisms that keep these definitions from converging Can brain memory systems explain how LLMs should store knowledge?.

Sources 3 notes

How should agent memory split across time scales?

RAISE shows that agent memory consists of four components organized by two design axes: dialogue-level (conversation history, scratchpad) versus turn-level (examples, task trajectory). This granularity distinction predicts different failure modes and update policies for each component.

Can brain memory systems explain how LLMs should store knowledge?

Research shows transformer weights function as a distributed neocortex for consolidated knowledge, RAG stores as hippocampal indexing for rapid encoding, and agentic state as prefrontal executive control. The CLS framework predicts why hybrid systems outperform single-tier approaches and identifies missing consolidation mechanisms that prevent memory integration.

Can reasoning systems maintain memory across retrieval cycles?

ComoRAG demonstrates that iterative evidence acquisition with a persistent memory workspace outperforms stateless multi-step retrieval by detecting and resolving contradictions through deeper exploration, achieving up to 11% gains on complex queries.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a systems analyst evaluating whether agent framework definitions of 'working memory' have converged or diverged since mid-2024.

The durable question: Do CoALA, Letta, and similar agentic systems agree on which memory tier is 'working memory'—or does the disagreement reflect unsettled design choices in how memory ought to be carved?

What a curated library found — and when (findings span 2024–2026; treat as dated claims, not current truth):
• Agent working memory decomposes into four components along dialogue-level vs. turn-level axes; frameworks disagree on where to draw the boundary (2025–2026).
• Memory definitions diverge when anchored to cognitive analogy (prefrontal = narrow, executive-control scope) versus operational scope ('whatever is in context window'). Same term, opposite reference frames (2025–2026).
• Stateful narrative reasoning—treating working memory as an *active workspace* that detects and resolves contradictions—outperforms stateless retrieval by ~11%, suggesting function (passive buffer vs. active workspace) is a design axis that partitions frameworks (2025–2026).
• Complementary learning systems map transformer weights (neocortex) + RAG (hippocampus) + agentic state (prefrontal), but consolidation mechanisms between these tiers remain missing (2025–2026).

Anchor papers (verify; mind their dates):
• arXiv:2601.09113 *The AI Hippocampus: How Far are We From Human Memory?* (2026)
• arXiv:2508.10419 *ComoRAG: A Cognitive-Inspired Memory-Organized RAG for Stateful Long Narrative Reasoning* (2025)
• arXiv:2603.29025 *The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning* (2026)
• arXiv:2605.12978 *Useful Memories Become Faulty When Continuously Updated by LLMs* (2026)

Your task:
(1) **RE-TEST each carving axis.** For time-scale partitioning (turn vs. session), operational scope (context-window footprint), and functional scope (passive vs. active workspace), has any framework *empirically* settled which axis dominates? Check whether newer model architectures (e.g., cross-attention patterns, streaming decoding, state-checkpointing in agents) or training methods (e.g., memory-aware fine-tuning, RL for reflection) have collapsed one axis into another. Separate the durable question ('what memory tier should handle contradiction-detection?') from perishable constraints ('working memory must fit in a single context window'). Cite what resolved it.
(2) **Surface the strongest work from ~6 months ago that *contradicts* the synthesis**—either by showing frameworks *do* converge on a definition, or by proposing a fourth carving axis the library missed.
(3) **Propose 2 research questions** that assume the regime *has* moved: e.g., 'If active workspace reasoning now scales beyond a single turn, does the dialogue/turn distinction still matter?' or 'Can consolidation mechanisms (e.g., online distillation into weights) unify the cognitive-analogy carving with the operational one?'

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Why do CoALA and Letta disagree on what counts as working memory?

Sources 3 notes

Next inquiring lines