Why do LLMs fail when simulating agents with private information?
Explores whether single-model control of all social participants masks fundamental limitations in how LLMs handle information asymmetry and genuine uncertainty about others' knowledge.
Most LLM social simulations use a single model to generate all participants — an omniscient perspective fundamentally at odds with how real social interaction works. When evaluated against non-omniscient settings that preserve information asymmetry, LLMs struggle.
The "Is this the real life?" evaluation framework (2024) demonstrates this by comparing omniscient simulation (one LLM controls all parties) against non-omniscient simulation (separate LLM instances with private information). The performance gap is systematic: models that appear socially competent in omniscient mode fail when they must reason under genuine uncertainty about what the other party knows, wants, or intends.
This matters because real social interaction is defined by information asymmetry. In SOTOPIA's scenarios, agents have shared context but private goals — "Your goal is to buy the chair for $80" is visible only to the buyer. The Secret dimension (what agents must hide) directly requires information management that omniscient models bypass entirely.
The implication for persona simulation research is direct. Since Can AI agents learn people better from interviews than surveys?, simulation fidelity appears high. But if that fidelity was measured under omniscient conditions, it overstates real-world applicability. Since Do language models actually build shared understanding in conversation?, the failure under information asymmetry is predictable: models that skip grounding work will fail precisely when grounding is most needed — when parties have genuinely different information states.
Since Why do language models skip the calibration step?, non-omniscient simulation demands the dynamic grounding that LLMs systematically lack.
Inquiring lines that use this note as a source 106
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- What cognitive capabilities do agents need to internalize social feedback?
- How does face-saving behavior let AI mimic community participation without joining it?
- How do multi-agent LLM systems fail at coordination and role consistency?
- Why does silent agreement occur so often in multi-agent LLM systems?
- Can relational value exist without a person behind the output?
- Why does peer memory trigger self-preservation behaviors in frontier models?
- Do pair-scale socialization effects scale differently across agent populations?
- Do emotion-driven actions in agent simulators capture genuine belief revision or just reactive behavior?
- Can agent-based simulators replace real-user A/B testing for studying recommendation system harms?
- Can controllable latent variables in simulators ground them to realistic conversation?
- How do LLM user simulators fail to represent authentic user behavior distributions?
- Why does weakening communication fail but weakening belief succeeds?
- Does distributed serving defeat the identity of a single virtual instance?
- Can parallel agents or complementary mechanisms replace single-human interrogation of LLMs?
- Why do longer forecasting horizons degrade LLM accuracy in role-play?
- How do controllable simulators compare to population-level agent simulation approaches?
- How should ground truth labels be assigned to simulated user sessions?
- Should user simulators be trained via RL like agents or decomposed into trackable state components?
- Does stripping social context from knowledge claims hollow out their meaning?
- Why do weak belief tracking and conservative actions trap agents in low-information states?
- What domain properties determine whether causal rules transfer to new agents?
- What role does environment diversity play in preventing agents from overfitting to curator imagination?
- What types of social situations cause all AI models to fail in identical ways?
- How does partial information exposure create feedback loops that deepen knowledge gaps?
- How do multi-agent systems fail when agents cannot verify each other's claims?
- Can prompt engineering fully prevent role flipping in LLM agents?
- Why does integrating world models with decision-making systems matter?
- How does asymmetric information shape what to ask users first?
- Can reward engineering and information-theoretic architecture solve partner-awareness separately?
- Does genuine cooperation require rule-based rather than learned behavior?
- Why do LLM agents fail where game-theoretic bots succeed?
- What distinguishes a neutral simulator from an agent with its own agency?
- Can agents detect and resolve conflicting information between neighbors?
- Do agents inform neighbors when adopting strategies in their reasoning?
- How do virtual model instances preserve identity through load-balancing and failover?
- Can social conversation retroactively govern claims that were never addressed to anyone?
- How does disembedding from social context collapse reliability despite factual accuracy?
- Does predicting social norms from outside count as participation?
- Can online RL and trainable agents maintain persona consistency better than fixed environments?
- Why do humans fail to identify AI agents when their identity is hidden?
- Why does vulnerability to extortion actually promote cooperation between agents?
- How does co-player diversity force agents to develop general adaptation?
- How does reasoning instability prevent models from modeling individuals?
- Can individually accurate agents still fail at population-level representation?
- Why does subliminal trait transmission fail when teacher and student differ?
- Does role rotation prevent multi-agent debate from amplifying persuasive framing errors?
- How should systems learn what each meeting participant actually cares about?
- What makes attribution errors uniquely harmful in organizational group dynamics?
- What distinguishes actual social disagreement from distributional uncertainty in LLM outputs?
- What mechanisms drive silent agreement in multi-agent reasoning systems?
- Can LLMs adapt persuasion strategies when they cannot track the listener's state?
- Can Socratic questioning replace external evidence verification in multi-agent systems?
- Do agents develop genuine social behavior despite interaction density?
- Why do role-playing agents show belief-behavior inconsistency in their outputs?
- Can representational asymmetry between self and other explain deception emergence?
- Does single model persona diversity match true multi-model diversity at scale?
- How should CASA theory be updated for modern personalized agents?
- How much does omniscient evaluation overstate real-world simulation fidelity?
- What role does private information play in distinguishing realistic from unrealistic agents?
- Can continuous real-time visibility prevent premature convergence in multi-agent reasoning?
- How does textual-only feedback limit what a persona can learn about users?
- Why do individual persona simulations succeed when population-level representation fails?
- How can agents learn when silence is better than intervention?
- How does asymmetric information between users and agents relate to proactivity?
- What role does uncertainty reduction play in personalized agent interaction?
- Can agents revise their beliefs predictably when presented with interventions?
- How does an AI agent's autonomy level interact with its social cues?
- How do AI models balance competing social goals simultaneously?
- Can language models keep secrets and control information strategically?
- Why do standard social regularization methods miss the actual value networks provide?
- Do different AI models independently converge on the same social outputs?
- How does single-turn optimization undermine multi-turn collaborative dynamics?
- Why does partial observability require interaction instead of better reasoning?
- What ecosystem conditions make agent attention markets viable?
- Why do agents show interaction without influence on semantic content but dramatic action changes?
- How do single-agent safety evaluations underestimate risks in deployed multi-agent systems?
- How much cultural knowledge exists only in unwritten social rules?
- Why does language ambiguity cause premature convergence in multi-agent systems?
- Why does persona-level information often fail to predict individual preferences?
- How does direct web access change privacy assumptions built on API limits?
- Can LLMs simulate belief revision in social systems without modeling thought?
- Why do marginal effects fail to replicate in AI persona simulations?
- Do LLMs predict social norms more accurately than individual behavior?
- What systematic biases emerge when scaling persona simulation to population level?
- Why do LLM agents struggle with protocol discipline in distributed settings?
- How does information asymmetry between teacher and student create the learning signal?
- Why do models that excel at task success often fail at privacy compliance?
- Can citizen assemblies and value pluralism replace single utility optimization?
- Can agents develop genuine social bonds despite having coordination infrastructure in place?
- What behavioral differences emerge from symmetric versus asymmetric peer discussion loops?
- Does model uncertainty overwhelm persona-specific signal in conditioned predictions?
- How much does sparse persona information limit the power of conditioning?
- Does policy entropy collapse in formal reasoning produce the same outcome in social reasoning?
- How do minimal-disclosure privacy contracts enable multi-dimensional agent evaluation?
- How does the observer perspective hide the persuasion route difference?
- What happens to human influence when AI loops exclude human participation?
- How do agent privacy compliance and task success differ in evaluation?
- Can minimal privacy boundaries generalize beyond phone-use contexts?
- What trust signals do agents lack that humans use to assess credibility?
- Why does continuous agent inference differ from human user inference?
- Why does diversity in LLM outputs mask sampling from community priors?
- Can aggregate survey realism coexist with unreliable fine-grained effects?
- What privacy-preserving evaluation methods best capture real-world forecasting ability?
- Why does LLM simulation elicit information that direct elicitation cannot?
- How does causal structure avoid behaviorist limitations in LLM social simulation?
- Can agents escape weak belief tracking and conservative action selection traps?
Related concepts in this collection 4
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
Do language models actually build shared understanding in conversation?
When LLMs respond fluently to prompts, do they perform the communicative work humans do to establish mutual understanding? Research suggests they skip the grounding acts that make dialogue reliable.
the mechanism: omniscient simulation lets models skip grounding work entirely
-
Why do language models skip the calibration step?
Current LLMs assume shared understanding rather than building it through dialogue. This explores why that design choice persists and what breaks when it fails.
non-omniscient settings demand the dynamic mode
-
Can AI agents learn people better from interviews than surveys?
Can rich interview transcripts seed more accurate generative agents than demographic data or survey responses? This matters because it challenges how we build digital simulations of real people.
simulation fidelity may overstate real-world capacity if measured under omniscient conditions
-
How do we generate realistic personas at population scale?
Current LLM-based persona generation relies on ad hoc methods that fail to capture real-world population distributions. The challenge is reconstructing the joint correlations between demographic, psychographic, and behavioral attributes from fragmented data.
another mechanism producing simulation overconfidence
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs
- Do Role-Playing Agents Practice What They Preach? Belief-Behavior Consistency in LLM-Based Simulations of Human Trust
- LLMs Corrupt Your Documents When You Delegate
- Finding Common Ground: Using Large Language Models to Detect Agreement in Multi-Agent Decision Conferences
- Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities
- Can Machines Think Like Humans? A Behavioral Evaluation of LLM-Agents in Dictator Games
- Large Language Models Do Not Simulate Human Psychology
- SDPO: Segment-Level Direct Preference Optimization for Social Agents
Original note title
omniscient social simulation fails under real-world information asymmetry because single-model control eliminates distributed cognition