Do language models really understand meaning or just surface frequency?

Explores whether LLMs comprehend semantic meaning independently of textual frequency, or whether high-frequency paraphrases systematically outperform rare ones even when meaning is identical across math, translation, and reasoning tasks.

Synthesis note · 2026-05-02 · sourced from Natural Language Inference

Adam's Law (TFL) generalizes a previously local finding into a global property of LLM computation. The earlier NLI work showed predicates in entailment hypotheses skew higher-frequency than premises, and that fine-tuning amplifies rather than dilutes this bias — see Does fine-tuning on NLI teach inference or amplify shortcuts?. Adam's Law extends this across four task families: math reasoning, machine translation across hundreds of language pairs, commonsense reasoning, and agentic tool calling. The constant: when meaning is held fixed and only surface form varies, the higher-frequency paraphrase outperforms the lower-frequency one.

The mechanism is straightforward but uncomfortable. Higher-frequency text occurred more often during pre-training, so it sits in a denser, better-modeled region of the distribution. The model's "comprehension" is therefore not meaning-recognition first and surface-decoding second — it is statistical-mass recognition first, with meaning emerging downstream of that recognition. This converges with Can models pass tests while missing the actual grammar?: correct outputs do not certify that meaning is what the model is tracking.

The pattern matters because paraphrase invariance is a load-bearing assumption almost everywhere LLMs are deployed. We assume the same prompt, said two ways, will yield the same answer. Adam's Law says no: it will yield the frequency-weighted answer, and the surface form is a covariate of accuracy, not a transparent vehicle for the request. This also shadows the output side. Do different AI models actually produce diverse outputs? documents convergence in what models say; Adam's Law documents the same convergence in how models comprehend what is said to them. Both endpoints of the prompt-response loop pull toward the corpus mean. Frequency is not noise around meaning. Frequency is a substantial fraction of what comprehension means inside a transformer.

Inquiring lines that use this note as a source 79

This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.

Related concepts in this collection 3

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map

13 direct connections · 116 in 2-hop network ·medium cluster Open in graph ↗

Do language models really understand meaning or … Does fine-tuning on NLI teach inference or amplify… Can models pass tests while missing the actual gra… Do different AI models actually produce diverse ou…

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Do language models really understand meaning or just surface frequency?

Related concepts in this collection 3

Related papers in this collection 8

Search by related questions 4