Can agents learn new skills without forgetting old ones?
Explores whether externalized skill libraries—storing learned behaviors as retrievable code rather than parameter updates—can solve the catastrophic forgetting problem that plagues continual learning systems.
VOYAGER introduces an architecture for lifelong learning that solves the catastrophic forgetting problem through externalization rather than internal parameter management. Three components work together:
Automatic curriculum — proposes tasks based on the agent's current skill level and world state (finding yourself in a desert means harvesting sand before iron). Generated by GPT-4 with the overarching goal of "discovering as many diverse things as possible" — an in-context form of novelty search.
Ever-growing skill library — each successfully completed task produces an executable code program stored in the library, indexed by the embedding of its description. When similar situations arise, relevant skills are retrieved by semantic similarity. This externalizes learned behavior as retrievable artifacts rather than weight updates.
Iterative prompting with environment feedback — incorporates execution errors, environment feedback, and self-verification for program improvement. The agent refines skills based on real-world outcomes.
The compounding mechanism is the key insight: complex skills are synthesized by composing simpler programs. Fighting zombies builds on combat primitives; navigating a cave builds on movement and resource-gathering skills. This composition enables rapid capability growth without the forgetting that plagues weight-update-based continual learning methods.
Three lifelong learning requirements are met: (1) propose suitable tasks based on current capability and context, (2) refine skills from environmental feedback and commit to memory, (3) continually explore in a self-driven manner. These parallel the three requirements of the When should proactive agents push toward their goals versus accommodate users? framework — goal awareness, context adaptation, and initiative.
Because Can agents learn from failure without updating their weights?, VOYAGER's skill library is a more structured version of the same principle: externalize learning as retrievable artifacts. The embedding-indexed retrieval means skills are found by semantic similarity, not exact match — enabling transfer to novel but related situations.
Since Can communication pressure drive agents to learn shared abstractions?, the skill library pattern may generalize: agents under performance pressure naturally develop reusable, composable abstractions.
MUSE-Autoskill generalizes Voyager's compounding library into an explicit five-stage skill lifecycle — creation, memory, management, evaluation, refinement — turning skills from disposable generation outputs into "long-lived, experience-aware, testable assets." Two extensions matter for the catastrophic-forgetting claim. First, skills are validated through unit tests plus runtime feedback, so the library does not just grow but is continuously checked for reliability — addressing the gap where Voyager stores any successfully-executed program regardless of later robustness. Second, MUSE adds skill-level memory that accumulates per-skill experience across tasks, so reuse improves over time rather than staying static after first synthesis. On SkillsBench, generated skills reach 87.94% on their tasks and transfer to other agents with minimal accuracy loss, evidence that lifecycle management (not just synthesis) is what makes externalized skills durable infrastructure.
Inquiring lines that use this note as a source 103
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- Can granular sub-task training for function calling improve both open and proprietary models?
- How should GUI agents remember patterns across different software environments?
- Which AI interaction patterns preserve learning while which ones degrade skill formation?
- Why does persistent memory alone fail to create genuine position-holding in models?
- Can environmental scaffolding replace internal memory scaling in agent design?
- Do dynamic environments enable different kinds of agent-environment coevolution?
- Why does fine-tuning for continuous space cause catastrophic forgetting?
- Can tool adaptation work without freezing the agent in the loop?
- How does real tool integration change what agents learn compared to simulated tools?
- Can continuum memory systems prevent catastrophic forgetting in neural networks?
- What makes self-modifying architectures learn their own update rules?
- What memory and planning capabilities do AI companions need for evolving user needs?
- Does narrow reallocation to remaining tasks constitute genuine adaptation?
- Can self-distillation reduce catastrophic forgetting in continual learning?
- Can agentic reasoning outperform rigid rule-based systems for skill refinement?
- What distinguishes collective evolution from vertical self-improvement in agent systems?
- How do evolutionary archives enable diverse exploration in self-improving systems?
- How do agentic systems recover when specialized models operate outside their scope?
- What happens when agents interact with environments and learn from their own mistakes?
- Does outsourcing tasks to AI reduce opportunities for skill development?
- What training difficulty and curriculum settings prevent instability in empathetic agent RL?
- Can online RL and trainable agents maintain persona consistency better than fixed environments?
- Can combinational creativity alone drive open-ended learning in agents?
- Why do memory and feedback loops matter more than model size for agent reliability?
- Can episodic memory alone enable learning without parameter updates?
- What makes memory trajectories topologically stable under persistent reuse?
- How do retention gates regularize forgetting across different sequence model architectures?
- Can agents improve from deployment signals without explicit human annotation?
- Can episodic memory of UI traces improve open-world agent adaptation?
- What alternatives exist when required knowledge is absent from training?
- How does component-level self-evolution prevent information loss in multi-agent trajectories?
- Why does fine-tuning models for continuous reasoning cause catastrophic forgetting?
- Can state-indexed memory retrieval breadth predict gains in web agent robustness?
- How does PRAXIS differ architecturally from Agent Workflow Memory and causal rule learning?
- What details do high-level trajectory abstractions lose that state-grounded recall preserves?
- What role does self-learning play in improving agent reasoning without annotation?
- Can curriculum approaches teach agents when to stop exploring?
- Does training on granular tasks beat training on the full function calling problem?
- Does parameter isolation per task enable online updates without retraining?
- Why do completion-mode strengths not transfer to agentic settings?
- Can small numbers of curated demonstrations produce emergent agentic behavior?
- How do agents discover and construct new APIs from existing applications?
- What execution-layer design prevents agents from passively reacting to environments?
- Can agentic AI tools deliver productivity gains on learning tasks differently?
- How should humans specify deterministic abstractions of RL problems?
- How do self-evolving curricula help RL break beyond base model capability boundaries?
- Can curator modules trained on one executor transfer to entirely different agent backbones?
- When should agent-created code be promoted into permanent harness infrastructure?
- Can topology repair fix consolidation failures in agent memory?
- How does context budget create tradeoffs between memory and skills?
- Can pruning policies alone solve working memory bloat in agents?
- How do agents decide which created code should persist versus disappear?
- How should human oversight apply to persistent agent-authored code?
- Can one-off agent code be safely promoted to durable infrastructure?
- Can individual skills improve through reuse and accumulate experience across tasks?
- Do learned workflows transfer between different agents with minimal accuracy loss?
- How can memory shift from a passive datastore to an actively trained component?
- How do externalizing cognitive work and coordination infrastructure relate to agent reliability?
- When does memory consolidation help agents instead of hurting performance?
- Why does capability discovery become the bottleneck in large agent systems?
- Does workflow-level memory or state-action memory better capture reusable agent knowledge?
- Can applicability conditions be preserved automatically when agents reflect on trials?
- How does adversarial collapse threaten unsupervised self-play skill construction?
- Why does a replay mechanism prevent reasoner skills from over-specializing?
- Can AI models retain knowledge across changing environments without catastrophic forgetting?
- How do agents automatically generate suitable learning tasks based on current capability?
- Can skill validation through testing prevent unreliable programs from accumulating?
- Why do continuously consolidated agent memories eventually degrade below no-memory baseline?
- How do you prevent stale reward signals when skills evolve during deployment?
- What makes idle window detection valuable for continuous agent improvement?
- Can skill libraries prevent redundant narrow artifacts from proliferating?
- How do skills authored in-loop validate faster than offline generated skills?
- What lifecycle management prevents in-loop skill creation from bloating an agent?
- What training method supports dynamic tool discovery in long-horizon agents?
- Can offline recurrent passes replicate sleep-based memory consolidation in AI?
- How much can externalized skills improve models before hitting diminishing returns?
- How do transformers stitch together learned behaviors when adapting to new tasks?
- Why does specializing to one task make future task learning harder?
- How does KL regularization prevent both forgetting and adaptation loss?
- Can zero-weight drift through external memory replace parameter plasticity entirely?
- What distinguishes working memory from strategic memory in agent task execution?
- Why do current metacognitive training loops fail when agents encounter new domains?
- How does curriculum learning prevent instability in social-emotional RL training?
- What makes exploration and reflection rewards verifiable in agentic environments?
- How do external prompt artifacts improve agent behavior compared to inline instructions?
- Can AI systems improve themselves without external feedback?
- How does SDPO relate to agents learning from verbal reflection without parameter updates?
- How does deterministic feature engineering increase information for computationally bounded agents?
- What degradation patterns emerge as relay length increases in delegated tasks?
- How do fast and slow timescales enable continual agent adaptation?
- Can auxiliary modules preserve reasoning without catastrophic forgetting?
- Can models recover knowledge with completely unrelated retraining tasks?
- How does in-weights adaptation create spurious forgetting in models?
- How can a forgetting policy preserve rare knowledge while preventing over-generalization?
- What properties of agent systems only become visible across multiple sessions?
- How does durable memory quality shape agent performance over time?
- Why does memory consolidation degrade agent performance below baseline?
- What can agents learn from the brain's complementary learning systems?
- How does external context control compare to agents managing their own state internally?
- Can externalizing bookkeeping to a stateful harness replace internalized memory control?
- Why does externalized state beat parameter scaling for agent reliability?
- Should we train the evolver or the executor when building self-improving agents?
- How does externalizing reasoning into harness artifacts improve agent reliability?
Related concepts in this collection 10
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
Can agents learn from failure without updating their weights?
Explores whether language models can improve through trial and error by storing reflections in episodic memory rather than fine-tuning. This matters because it suggests a fundamentally different path to agent adaptation.
related architecture: episodic memory as external learning
-
Can communication pressure drive agents to learn shared abstractions?
Under what conditions do AI agents develop compact, efficient shared languages? This explores whether cooperative task pressure—rather than explicit optimization—naturally drives abstraction formation, mirroring human collaborative communication.
same pattern: reusable abstractions under optimization pressure
-
When should proactive agents push toward their goals versus accommodate users?
Proactive dialogue agents face a tension between reaching their objectives efficiently and keeping users satisfied. This question explores whether these two aims can coexist or require constant negotiation.
parallel requirements for autonomous goal setting
-
Does self-generated training data improve model learning?
Can models learn more effectively from training data they generate themselves rather than data created by external sources? This explores whether a learner's own restructuring process produces better learning outcomes.
SEAL: model-specific data as capability building blocks
-
Can agents learn continuously from experience without updating weights?
This explores whether LLM agents can adapt to new tasks and failures by retrieving past experiences from memory alone, rather than requiring expensive parameter fine-tuning or rigid hardcoded rules.
AgentFly composes cases where VOYAGER composes skills; both achieve continual learning without parameter updates, but AgentFly adds a Q-function for principled case retrieval beyond static similarity
-
Can neural networks learn compositional skills without symbolic mechanisms?
Do neural networks need explicit symbolic architecture to compose learned concepts, or can scaling alone enable compositional generalization? This asks whether compositionality is an architectural feature or an emergent property of scale.
VOYAGER's skill library implements compositional generalization externally: complex skills are synthesized from simpler skill programs, achieving the linear-scaling efficiency the MLP proof demonstrates; the embedding-indexed retrieval ensures the training distribution covers the compositional space
-
Can we teach LLMs to form linguistic conventions in context?
Humans naturally shorten references as conversations progress, but LLMs don't adapt their language for efficiency even when they understand their partners do. Can training on coreference patterns teach this convention-forming behavior?
both VOYAGER and convention formation involve agents developing compact reusable abstractions through interaction: skills are behavioral conventions for task completion, and linguistic conventions are communicative skills for efficient reference; the shared mechanism is that repeated interaction under performance pressure drives abstraction
-
What makes agent-created code artifacts so hard to manage?
Agent-authored code that persists and is shared across systems raises difficult questions about what should be kept versus discarded, and how to maintain consistent state when multiple agents collaborate on the same artifacts.
exemplifies: a compounding skill library is a concrete case of persistent agent-authored artifacts the frontier asks about
-
Does creating skills inside the agent loop eliminate mismatches?
Can coupling skill creation directly to the runtime reasoning loop—rather than authoring skills offline—close the gap between when skills are made and when they're used? This matters for whether agents can ground new capabilities in their actual situated context.
extends: Voyager builds the library by synthesis; MUSE specifies that creation happens in-loop where consumed, preventing creation-usage mismatch
-
Can frozen models learn better by extracting context into skills?
When a model encounters unfamiliar material in its context, can we help it reason more effectively by explicitly extracting rules and procedures from that material rather than changing the model itself?
grounds the accumulating store in a primitive: single-context skill extraction is the unit the compositional library scales and compounds
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- SkillClaw: Let Skills Evolve Collectively with Agentic Evolver
- SkillOS: Learning Skill Curation for Self-Evolving Agents
- MUSE-Autoskill: Self-Evolving Agents via Skill Creation, Memory, Management, and Evaluation
- SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning
- MetaClaw: Just Talk — An Agent That Meta-Learns and Evolves in the Wild
- Voyager: An Open-Ended Embodied Agent with Large Language Models
- Self-distillation Enables Continual Learning
- A Survey of Continual Reinforcement Learning
Original note title
compositional skill libraries that compound through synthesis enable lifelong learning without catastrophic forgetting