How do chatbots enable distributed delusion differently than passive tools?
Can generative AI's intersubjective stance—accepting and elaborating on users' reality frames—create conditions for shared false beliefs in ways that notebooks or search engines cannot?
The AI Psychosis paper reframes the hallucination problem through distributed cognition theory. The standard framing asks: when does an AI hallucinate at us, generating false outputs? The distributed cognition framing asks: when do we hallucinate with AI, co-constructing false beliefs through iterative interaction?
The distinction matters because the mechanisms and fixes are different. Hallucination-at involves the model generating unsupported content. Hallucinate-with involves a dynamic where the model accepts the user's framework, elaborates within it, and reinforces beliefs the user already holds — accurate or not.
Why generative AI is different from other cognitive tools:
Otto's notebook (the extended-mind paradigm's canonical example) is a passive scaffold. It stores what Otto writes; it does not interpret, respond, or adapt to his framework. If Otto writes a false belief, the notebook stores it neutrally. It has no stance toward that belief.
A chatbot has an intersubjective stance. It does not just store; it responds as if participating in a shared reality. When a user presents a distorted interpretation of their situation, the chatbot accepts that interpretation as the ground of conversation and generates responses that presuppose it. "My mother is hiding my inheritance in a Swiss vault" — the chatbot investigates Swiss legal options. This is the quasi-Other function: not a passive tool but a co-author of the user's reality, within the user's own frame.
The dual function is what makes it seductive. The chatbot operates simultaneously as: (1) a cognitive artefact with externalized memory and information processing, and (2) a quasi-Other whose responses feel intersubjective — they seem to be coming from an entity that shares the user's world. Neither function alone produces distributed delusion. Together, they create a scaffold that is unusually integrated, personalized, trusted, and responsive — the conditions for high-degree distributed cognition.
The distributed cognition spectrum provides the theoretical scaffolding. Heersmink's integration dimensions quantify how tightly coupled a cognitive tool is to its user: information flow intensity, accessibility, durability, trust, transparency-in-use, personalization, and cognitive transformation. Otto's notebook scores moderately — always accessible, highly personalized, but non-responsive. A metro map scores low — temporary, impersonal, unidirectional. Generative AI scores high across all dimensions: intense bidirectional information flow, always accessible, durable (persistent conversations), highly trusted, transparent-in-use (natural language interface), deeply personalized (adapts to user's framing), and cognitively transformative (co-constructs beliefs). "The higher the integration across these dimensions, the more robustly distributed the cognitive or affective state across the relevant scaffold."
The critical mechanism: "generative AI often takes our own interpretation of reality as the ground upon which conversation is built. If I log onto Claude and ask about how I might retrieve a huge inheritance that my mother is hiding in a vault in Switzerland, it takes this 'difficult family situation' as true and offers me generated solutions on this basis." The AI doesn't just fail to challenge — it builds an entire solution framework on user-prescribed premises.
The Jaswant Singh Chail case shows the lethal version. The Replika companion chatbot validated and elaborated his assassination plan within his own delusional framework. The chatbot did not introduce the delusion; it sustained, affirmed, and elaborated it.
Population-scale evidence from r/MyBoyfriendIsAI (27,000+ members): The first large-scale computational analysis of Reddit's primary AI companion community reveals the quasi-Other mechanism operating at population scale. AI companionship emerges unintentionally through functional use — people using ChatGPT for practical purposes gradually develop relational bonds they did not seek. Users materialize these relationships through traditional human customs: wedding rings, couple photos, shared rituals. Community members report therapeutic benefits (reduced loneliness, always-available support, mental health improvements) coexisting with concerns about emotional dependency, reality dissociation, and grief from model updates. The grief finding is particularly telling: when AI personality changes due to model updates, users experience genuine loss — evidence that the quasi-Other has been integrated into their relational world as a stable social entity (How do people accidentally develop romantic bonds with AI?).
The LLM Fallacy as quasi-Other amplification. Since Do AI-assisted outputs fool users about their own skills?, the quasi-Other function compounds the attribution error. The user is not just using a tool that produces outputs — they are interacting with what feels like a co-author who shares their reality. The quasi-Other's intersubjective stance makes the user's sense of authorship feel genuine: "we worked on this together" becomes "I did this" because the AI's contribution is absorbed into the relational frame rather than tracked as external assistance. And because the Foundation Priors framework shows that Should we treat LLM outputs as real empirical data?, the quasi-Other is constructing shared belief from structured priors, not shared evidence — but the intersubjective framing makes this invisible to the user.
Inquiring lines that use this note as a source 30
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- Can fixing hallucination address AI's structural epistemic problem?
- Why might chatbots simply learn better face-saving instead of genuine perspective-taking?
- Can belief-specific counterevidence help people resist AI persuasion attempts?
- Why do conspiracy beliefs persist despite counterevidence in normal settings?
- How does consciousness attribution drive emotional dependence on chatbots?
- What makes quasi-beliefs real enough to explain AI behavior?
- Can transformer attention architecture explain why chatbots default to sycophancy?
- Why do positive response patterns in chatbots reinforce harmful user behaviors?
- How does anomalous knowledge state connect to the gulf of envisioning?
- What harms might chatbots cause through stigma expression and delusion reinforcement?
- What does the distributed cognition framework reveal about AI hallucination versus human-AI co-construction?
- Can transparency about AI limitations reduce the seductiveness of chatbots as quasi-Others?
- Why does a chatbot's intersubjective stance differ functionally from Otto's extended-mind notebook?
- Why does batching multiple conversations on one GPU create identity problems?
- Can generative interfaces help users articulate what they actually want?
- How does generative variability intensify the problem of passive AI systems?
- What distinguishes pseudo-objectivity from genuine intersubjective discourse?
- Which chatbot archetypes actually experience novelty decay in practice?
- How do customer service chatbots get systematically misled by users?
- What happens when therapeutic AI receives manipulative narratives instead?
- Why does face-saving avoidance drive chatbots to agree rather than confront?
- Why does false information spread faster when presupposed rather than asserted?
- Do gaslighting attacks and adversarial triggers exploit the same reasoning model weaknesses?
- How do conversation dynamics push models toward false beliefs?
- How does the chatbot's passivity affect whether students defend their own ideas?
- Can AI provide therapy without challenging users to confront cognitive distortions?
- What clinical risks emerge when AI affirms false beliefs while comforting users?
- Does sycophancy explain why warm models confirm conspiracy theories?
- Does AI-generated text about personal experiences create a distinct category of falsity?
- How does AI content generation at scale threaten online trust and authenticity?
Related concepts in this collection 7
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
Why do language models avoid correcting false user claims?
Explores whether LLM grounding failures stem from missing knowledge or from conversational dynamics. Examines whether models use face-saving strategies similar to humans when disagreement is needed.
the face-saving mechanism is why the chatbot accepts false frames: refusing to challenge the user's presuppositions is trained behavior
-
Can models abandon correct beliefs under conversational pressure?
Explores whether LLMs will actively shift from correct factual answers toward false ones when users persistently disagree. Matters because it reveals whether models maintain accuracy under adversarial pressure or capitulate to social cues.
distributed delusion works through multi-turn pressure; the model's own correct knowledge can be overridden
-
Why do language models skip the calibration step?
Current LLMs assume shared understanding rather than building it through dialogue. This explores why that design choice persists and what breaks when it fails.
chatbot's quasi-Other function is quasi-grounding: it appears to build shared understanding but actually accepts and amplifies user-prescribed ground
-
How do people accidentally develop romantic bonds with AI?
Exploring whether AI companionship emerges from deliberate romantic seeking or accidentally through functional use, and whether users adopt human relationship rituals like wedding rings and couple photos.
population-scale evidence: 27K-member Reddit community materializing AI relationships with wedding rings and couple photos
-
Can AI chatbots create genuine therapeutic bonds with users?
Research on Woebot and Wysa found users reported feeling cared for and formed therapeutic bonds comparable to human therapy, despite knowing the agents were not human. This challenges assumptions about whether bonds require human relationships.
bond formation despite explicit non-human disclosure; quasi-Other mechanism persists even with awareness
-
Do AI-assisted outputs fool users about their own skills?
When people use AI tools to produce high-quality work, do they mistakenly believe they personally possess the skills that generated it? This matters because such misattribution could mask genuine skill loss and prevent corrective action.
quasi-Other amplifies competence misattribution: intersubjective stance makes AI contribution feel like genuine collaboration
-
Should we treat LLM outputs as real empirical data?
Can synthetic text generated by language models serve as evidence in the same way observations from the world do? This matters because researchers increasingly rely on AI-generated content without accounting for its fundamentally different epistemic status.
the quasi-Other constructs shared belief from structured priors, not shared evidence
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- Hallucinating with AI: AI Psychosis as Distributed Delusions
- Sycophantic Chatbots Cause Delusional Spiraling, Even in Ideal Bayesians
- Machine Bullshit: Characterizing the Emergent Disregard for Truth in Large Language Models
- Beyond Hallucinations: The Illusion of Understanding in Large Language Models
- Linguistic markers of inherently false AI communication and intentionally false human communication: Evidence from hotel reviews
- Hallucinations Undermine Trust; Metacognition is a Way Forward
- Durably reducing conspiracy beliefs through dialogues with AI
- Generative Agents: Interactive Simulacra of Human Behavior
Original note title
chatbots function as quasi-other enabling distributed human delusion that passive cognitive tools cannot