Can agents learn new skills without forgetting old ones?

Explores whether externalized skill libraries—storing learned behaviors as retrievable code rather than parameter updates—can solve the catastrophic forgetting problem that plagues continual learning systems.

Synthesis note · 2026-02-23 · sourced from Agents

VOYAGER introduces an architecture for lifelong learning that solves the catastrophic forgetting problem through externalization rather than internal parameter management. Three components work together:

Automatic curriculum — proposes tasks based on the agent's current skill level and world state (finding yourself in a desert means harvesting sand before iron). Generated by GPT-4 with the overarching goal of "discovering as many diverse things as possible" — an in-context form of novelty search.
Ever-growing skill library — each successfully completed task produces an executable code program stored in the library, indexed by the embedding of its description. When similar situations arise, relevant skills are retrieved by semantic similarity. This externalizes learned behavior as retrievable artifacts rather than weight updates.
Iterative prompting with environment feedback — incorporates execution errors, environment feedback, and self-verification for program improvement. The agent refines skills based on real-world outcomes.

The compounding mechanism is the key insight: complex skills are synthesized by composing simpler programs. Fighting zombies builds on combat primitives; navigating a cave builds on movement and resource-gathering skills. This composition enables rapid capability growth without the forgetting that plagues weight-update-based continual learning methods.

Three lifelong learning requirements are met: (1) propose suitable tasks based on current capability and context, (2) refine skills from environmental feedback and commit to memory, (3) continually explore in a self-driven manner. These parallel the three requirements of the When should proactive agents push toward their goals versus accommodate users? framework — goal awareness, context adaptation, and initiative.

Because Can agents learn from failure without updating their weights?, VOYAGER's skill library is a more structured version of the same principle: externalize learning as retrievable artifacts. The embedding-indexed retrieval means skills are found by semantic similarity, not exact match — enabling transfer to novel but related situations.

Since Can communication pressure drive agents to learn shared abstractions?, the skill library pattern may generalize: agents under performance pressure naturally develop reusable, composable abstractions.

MUSE-Autoskill generalizes Voyager's compounding library into an explicit five-stage skill lifecycle — creation, memory, management, evaluation, refinement — turning skills from disposable generation outputs into "long-lived, experience-aware, testable assets." Two extensions matter for the catastrophic-forgetting claim. First, skills are validated through unit tests plus runtime feedback, so the library does not just grow but is continuously checked for reliability — addressing the gap where Voyager stores any successfully-executed program regardless of later robustness. Second, MUSE adds skill-level memory that accumulates per-skill experience across tasks, so reuse improves over time rather than staying static after first synthesis. On SkillsBench, generated skills reach 87.94% on their tasks and transfer to other agents with minimal accuracy loss, evidence that lifecycle management (not just synthesis) is what makes externalized skills durable infrastructure.

Inquiring lines that use this note as a source 103

This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.

Related concepts in this collection 10

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map

22 direct connections · 160 in 2-hop network ·medium cluster Open in graph ↗

Can agents learn new skills without forgetting o… Can agents learn from failure without updating the… Can communication pressure drive agents to learn s… When should proactive agents push toward their goa… Does self-generated training data improve model le… Can agents learn continuously from experience with… Can neural networks learn compositional skills wit… Can we teach LLMs to form linguistic conventions i… What makes agent-created code artifacts so hard to…

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Can agents learn from failure without updating their weights? Explores whether language models can improve through trial and error by storing reflections in episodic memory rather than fine-tuning. This matters because it suggests a fundamentally different path to agent adaptation.
related architecture: episodic memory as external learning
Can communication pressure drive agents to learn shared abstractions? Under what conditions do AI agents develop compact, efficient shared languages? This explores whether cooperative task pressure—rather than explicit optimization—naturally drives abstraction formation, mirroring human collaborative communication.
same pattern: reusable abstractions under optimization pressure
When should proactive agents push toward their goals versus accommodate users? Proactive dialogue agents face a tension between reaching their objectives efficiently and keeping users satisfied. This question explores whether these two aims can coexist or require constant negotiation.
parallel requirements for autonomous goal setting
Does self-generated training data improve model learning? Can models learn more effectively from training data they generate themselves rather than data created by external sources? This explores whether a learner's own restructuring process produces better learning outcomes.
SEAL: model-specific data as capability building blocks
Can agents learn continuously from experience without updating weights? This explores whether LLM agents can adapt to new tasks and failures by retrieving past experiences from memory alone, rather than requiring expensive parameter fine-tuning or rigid hardcoded rules.
AgentFly composes cases where VOYAGER composes skills; both achieve continual learning without parameter updates, but AgentFly adds a Q-function for principled case retrieval beyond static similarity
Can neural networks learn compositional skills without symbolic mechanisms? Do neural networks need explicit symbolic architecture to compose learned concepts, or can scaling alone enable compositional generalization? This asks whether compositionality is an architectural feature or an emergent property of scale.
VOYAGER's skill library implements compositional generalization externally: complex skills are synthesized from simpler skill programs, achieving the linear-scaling efficiency the MLP proof demonstrates; the embedding-indexed retrieval ensures the training distribution covers the compositional space
Can we teach LLMs to form linguistic conventions in context? Humans naturally shorten references as conversations progress, but LLMs don't adapt their language for efficiency even when they understand their partners do. Can training on coreference patterns teach this convention-forming behavior?
both VOYAGER and convention formation involve agents developing compact reusable abstractions through interaction: skills are behavioral conventions for task completion, and linguistic conventions are communicative skills for efficient reference; the shared mechanism is that repeated interaction under performance pressure drives abstraction
What makes agent-created code artifacts so hard to manage? Agent-authored code that persists and is shared across systems raises difficult questions about what should be kept versus discarded, and how to maintain consistent state when multiple agents collaborate on the same artifacts.
exemplifies: a compounding skill library is a concrete case of persistent agent-authored artifacts the frontier asks about
Does creating skills inside the agent loop eliminate mismatches? Can coupling skill creation directly to the runtime reasoning loop—rather than authoring skills offline—close the gap between when skills are made and when they're used? This matters for whether agents can ground new capabilities in their actual situated context.
extends: Voyager builds the library by synthesis; MUSE specifies that creation happens in-loop where consumed, preventing creation-usage mismatch
Can frozen models learn better by extracting context into skills? When a model encounters unfamiliar material in its context, can we help it reason more effectively by explicitly extracting rules and procedures from that material rather than changing the model itself?
grounds the accumulating store in a primitive: single-context skill extraction is the unit the compositional library scales and compounds

Can agents learn new skills without forgetting old ones?

Related concepts in this collection 10

Related papers in this collection 8

Search by related questions 4