SYNTHESIS NOTE
Model Architecture and Internals

Can neural networks actually achieve compositional generalization?

For decades, theorists argued connectionist models fundamentally lack the structure needed for compositionality. But modern LLMs exhibit sophisticated compositional behaviors despite sharing the same design principles. What changed?

Synthesis note · 2026-06-03 · sourced from MechInterp

Compositionality — composing arbitrary concepts into novel combinations for open-ended expressive capacity — has long been held up as a property of human intelligence that neural networks cannot explain, an argument (Fodor & Pylyshyn) that led many to dismiss them as models of cognition. This review traces the debate from Frege through ChatGPT and presses the tension: modern DNNs, sharing the same fundamental design principles as their dismissed predecessors, now dominate AI and exhibit behaviors thought to require compositional processing — syntactically complex error-free sentences, cogent chains of reasoning, original programs. The classical confidence that connectionism lacks the constituent structure for compositionality sits uneasily with this empirical record.

The keeper is the philosophical reframing: the question is no longer "can neural nets be compositional?" but why models without explicit symbolic constituent structure nonetheless produce compositional behavior — and what that implies for the symbolist/connectionist divide and for theories of human cognition.

This is a theory-anchor for Adrian's philosophy-of-mind thread. It complements the empirical Can neural networks learn compositional skills without symbolic mechanisms? (the mechanism: scaling + training coverage) and the limit case in Why do neural networks fail at compositional generalization? (where compositionality still fails), framing the debate the vault's What happens to social order when AI removes ritual constraints? map tracks.

Inquiring lines that use this note as a source 5

This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.

Related concepts in this collection 3

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map
13 direct connections · 105 in 2-hop network ·medium cluster Open in graph ↗

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Original note title

modern neural networks challenge the classical argument that connectionist models cannot achieve compositional generalization