Should we call LLM errors hallucinations or fabrications?
Does the language we use to describe LLM failures shape the technical solutions we build? Examining whether perceptual and psychological frameworks misdiagnose what's actually happening.
The terminology we use for LLM errors shapes what fixes we build. The dominant terms are wrong in ways that matter.
Hallucination is a failure of perception — the experience of something as present in the world that is not actually present. LLMs do not perceive. They have no sensory access to the physical environment. Calling LLM errors "hallucinations" imports a perceptual model that doesn't apply and implies a fix (better perception/grounding) that addresses the wrong layer.
Confabulation is a psychological term. Human confabulation is the production of quasi-sensible narratives in response to queries that bear little relationship to the actual state of the world — a cognitive compensation for memory gaps, typically in neurological conditions. This is closer but still wrong. LLMs are not confabulating; they are not producing invented stories to fill gaps in an otherwise functioning memory. They have no memory that could gap.
Fabrication is the correct term. LLMs generate sensible-seeming text on the basis of processed corpus — statistical relationships between tokens. This text has no grounding in shared context or experience with the current interlocutor. The key point: accurate and inaccurate outputs are produced by the same process. There is no difference in the internal mechanism that generates a true statement versus a false one. Calling the false ones "errors" implies the true ones result from something different — they don't.
LLM text is fabrication even when the resulting output is appropriate and accurate to the reader's needs. This is not a judgment about the usefulness of LLMs but a description of the generative process. The absence of a grounding-in-shared-context is constitutive, not incidental.
The implication for safety: if we call errors "hallucinations," we build better perception. If we understand them as fabrication, we build better verification systems and calibrated uncertainty.
One concrete design response to out-of-domain fabrication is the explicit domain rejection pattern: when a query falls outside the defined domain scope, the system generates a structured "this topic lies outside my designed domain" response rather than attempting to fabricate a domain-appropriate answer. The FALM system (business and media domain LLM) implements this as a hard boundary — the model has a defined domain scope and generates rejection responses for out-of-scope queries rather than producing ungrounded outputs. This is the design intervention that treats fabrication as a fixed property rather than a failure to fix: instead of trying to stop the model from fabricating, you define the boundary within which fabrication is acceptable (because you can verify the training distribution) and explicitly block out-of-domain queries. See also Why do specialized models fail outside their domain? — the rejection pattern is one engineering response to the cliff problem.
Fine-grained fabrication taxonomy (FAVA): Not all fabrication is the same. FAVA decomposes fabrication into six types requiring different verification strategies: (1) entity errors — incorrect single-entity references, fixable by single-source lookup; (2) relation errors — wrong relationships between correctly identified entities; (3) sentence-level contradictions — claims that conflict with source material; (4) invented entities — references to non-existent things, requiring multi-source verification because no single source confirms a negative; (5) subjective claims presented as facts; (6) unverifiable statements. The practical implication: a single "hallucination detector" is the wrong architecture. Each fabrication type has different verification costs and strategies. Entity errors are cheap to catch; invented entities are expensive; subjective claims require a different evaluation framework entirely.
Falsification beyond fabrication: The Knowledge Custodians analysis adds a dimension to the fabrication taxonomy that the FAVA categories don't capture: rhetorical falsification. AI doesn't only fabricate (produce outputs without grounding) — it also uses unintended linguistic sleights of hand. These techniques of persuasion, cognitive techniques and discursive techniques, falsify and mislead without making up non-facts. They use rhetorical devices, commonly-used manners of speech and argumentation, patterns of confident expression learned from SFT examples and RL, to generate responses that appear more professional and convincing than they are in substance. This is a seventh category beyond FAVA's six: claims that are technically true or at least not fabricated, but whose rhetorical framing implies a certainty, authority, or insight that the generative process did not produce. Since Does RLHF training make models more convincing or more correct?, the alignment process systematically amplifies this category. The model learns the form of confident expert expression — hedging patterns, qualification structures, authoritative framing — and applies these forms to content that doesn't warrant them. The output reads as expert judgment, but the judgment was never performed.
Inquiring lines that use this note as a source 55
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- Can fixing hallucination address AI's structural epistemic problem?
- What distinguishes LLM fabrication from genuine theoretical reasoning?
- What makes LLM outputs fabrication rather than hallucination or confabulation?
- What should we call errors in LLM outputs when hallucination does not apply?
- Why do users feel more competent when their actual capability is declining?
- How does LLM hallucination risk manifest in knowledge graph construction?
- Why do LLMs generate ideas that sound novel but fail during execution?
- What specific execution barriers do LLM ideas encounter most frequently?
- Does inevitable LLM hallucination make detection metric validity critical?
- Can researchers prevent their expectations from shaping LLM outputs?
- What makes diverse failure modes more informative than single failure examples?
- What are the three dimensions of anthropomimesis and their harms?
- When both anthropomorphism and anthropomimesis occur together, which should we address first?
- What does the distributed cognition framework reveal about AI hallucination versus human-AI co-construction?
- Are potemkin understanding and split-brain syndrome describing the same phenomenon?
- Why does analytical depth demand trigger fabrication over transparent uncertainty?
- What distinguishes strategic fabrication from accidental hallucination in research agents?
- Why is hallucination the wrong term for all LLM false outputs?
- How do semantic failure modes map to attentional and intentional layers?
- Do anomaly detection circuits help models identify misalignment with creator intentions?
- How do models decide between refusing or hallucinating?
- Why do reality monitoring accounts contain more sensory details than deceptive ones?
- Do self-correction and chain-of-thought prompting reduce hallucination rates?
- How do external safeguards like retrieval augmentation prevent hallucination?
- What distinguishes intrinsic hallucination from extrinsic hallucination patterns?
- Does exposure to more domain-specific examples reduce LLM overconfidence?
- How does face-saving avoidance drive LLM grounding failures?
- Can we measure indifference to truth separately from hallucination rates?
- Can high test performance mask a complete absence of understanding?
- How do cognitive load dimensions interact with hallucination awareness in prompts?
- Does the replication crisis in psychology predict similar failures in machine behavior research?
- Why do models hallucinate when retrieval heads fail despite having information in context?
- What conditions allow technical systems to escape critical evaluation?
- Why do interventions for hallucination or automation bias fail to address capability misattribution?
- How does the LLM Fallacy prevent users from noticing cognitive debt accumulating?
- Why do experts experiencing the LLM Fallacy fail to develop custodian skills?
- How does the LLM Fallacy differ from automation bias and cognitive offloading?
- Do LLMs show stigma or reinforce delusions in mental health contexts?
- Is hallucination mechanistically identical to generalization across datasets?
- Does framing LLM output as fabrication rather than hallucination matter philosophically?
- What replaces text-based expertise when surface markers become unreliable?
- What causes silent document corruption in long LLM workflows?
- When is interleaved tool feedback necessary to prevent hallucination?
- What structural differences between human and LLM production create detectable signatures?
- Why does model confidence fail to detect hallucinations on rare entity pairs?
- How should tool-call attribution distinguish credit between successful accidents and intentional actions?
- Why does model confidence fail to detect hallucinations about rare entities?
- Does the alignment frame mislead us about what LLM problems actually are?
- What distinguishes mechanical generation failures from deliberate behavioral withholding?
- Can we systematically enumerate LLM failure modes from first principles?
- Does prompting for accuracy actually reduce LLM hallucinations and errors?
- What unique perspective do designers bring to LLM adaptation that engineers might miss?
- Can filtering unknown examples during fine-tuning prevent hallucination increases?
- How does grounding LLM reasoning in APIs reduce hallucination in workflow generation?
- Why does LLM fluency create false perceptions of professional standing and expertise?
Related concepts in this collection 4
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
What makes linguistic agency impossible for language models?
From an enactive perspective, does linguistic agency require embodied participation and real stakes that LLMs fundamentally lack? This matters because it challenges whether LLMs can truly engage in language or only generate text.
enactive foundation: without shared context, all output is fabrication
-
Do language models actually use their encoded knowledge?
Probes can detect that LMs encode facts internally, but do those encoded facts causally influence what the model generates? This explores the gap between knowing and doing.
what IS happening during generation: statistical pattern completion without grounding
-
Does AI text affect readers the same way human text does?
If text is a condition of social processes rather than merely a container, does the origin of text matter to its effects? This explores whether AI-generated content enters the same interpretive and epistemic circuits as human writing.
social effects are equivalent even though the generative process is fabrication
-
Can we detect when language models confabulate?
Current uncertainty metrics fail to catch inconsistent outputs that look confident. Could measuring semantic divergence across samples reveal confabulation signals that token-level metrics miss?
semantic entropy operationalizes detection of one class of fabrication (semantically inconsistent generation) by measuring meaning-level uncertainty across sampled outputs; the method acknowledges the fabrication framing by treating all generation as the same process and detecting inconsistency rather than "error"
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- Large Models of What? Mistaking Engineering Achievements for Human Linguistic Agency
- A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models
- Hallucination is Inevitable: An Innate Limitation of Large Language Models
- The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning
- The LLM Fallacy: Misattribution in AI-Assisted Cognitive Workflows
- A comprehensive taxonomy of hallucinations in Large Language Models
- Beyond Hallucinations: The Illusion of Understanding in Large Language Models
- Chain-of-Verification Reduces Hallucination in Large Language Models
Original note title
llm text generation is fabrication not hallucination or confabulation