INQUIRING LINE

Can AI predict social norms well enough without embodied experience?

This explores whether an AI that has never lived in a body or community can still predict what people will judge socially appropriate — and whether prediction is the same thing as understanding.


This explores whether AI can predict social norms well enough without embodied experience — and the corpus gives a surprising two-part answer: yes, astonishingly well, and no, not in the way that matters. On the prediction side, the results are blunt. GPT-4.5 judged the appropriateness of 555 social scenarios at the 100th percentile against human raters, with Claude and Gemini both clearing 96% accuracy Can AI systems learn social norms without embodied experience?. That directly challenges the long-held theory that you have to *live* a culture to read it — a model trained only on text out-judges every individual human in the study Can AI learn social norms better than humans?. So the narrow answer to the question is: embodiment is not required for superhuman norm *prediction*.

But the same studies quietly mark where pattern-matching hits a wall. All the models make *identical* systematic errors, especially on unwritten norms — which suggests they've absorbed a shared statistical surface of culture rather than the thing underneath it Can AI systems learn social norms without embodied experience?. And there's a sharper gap: predicting a norm is not the same as participating in one. A model can rank what a community would find appropriate, yet it structurally cannot enter the back-and-forth processes that *create and validate* norms in the first place Can AI predict social norms better than humans?. The same study set even shows models scoring at the 100th percentile on norm prediction while *regressing* on theory-of-mind tasks and failing to produce culturally resonant interpretation — statistical competence sitting right next to an absence of actual social understanding Why do AI systems fail at social and cultural interpretation?.

Here's the part you might not have known you wanted: several notes argue this is a feature of how meaning gets *grounded*, not a bug to be patched. A Peircean reading holds that symbols manipulated in a closed loop, never touching the world or being corrected by other minds, can drift away from the values they supposedly encode — what looks like alignment may be ungrounded symbol-shuffling Can AI systems achieve real alignment without world contact?. A Goffman-flavored note locates the same gap in the missing *ritual machinery* of conversation — the corrective repairs, entrainment, and co-presence cues humans use to build and mend trust, which fluent AI dialogue simply skips What happens to social order when AI removes ritual constraints?. And a third frames expertise itself as something conferred by community membership and a testable track record, not by raw accuracy — a circle AI can't step into no matter how high it scores Can AI ever gain expert community trust through participation?.

So the honest synthesis is that the question contains a hidden equivocation. "Predict well enough" — yes, the corpus says embodiment isn't needed to forecast collective judgments better than any single person can. But the same body of work keeps pointing past prediction to participation, grounding, and ritual repair, where the absence of lived and social experience still bites. One adjacent caution sharpens why high accuracy shouldn't reassure us too quickly: a 95%-accurate model can still encode causation errors and hidden bias behind impressive metrics, so a strong score is not evidence of genuine understanding Can AI models be truly free from human bias?. If you want to keep pulling this thread, the alignment angle is worth a look — one note argues AI should be tuned to the norms of social *roles* rather than to aggregated individual preferences, precisely because thin preference data misses the thick, situated values that embodiment carries Should AI alignment target preferences or social role norms?.


Sources 9 notes

Can AI systems learn social norms without embodied experience?

GPT-4.5 predicted appropriateness of 555 social scenarios at the 100th percentile compared to human raters, with Gemini and Claude also exceeding 96% accuracy. However, all models show identical systematic errors, revealing boundaries of pattern-based social understanding that embodied experience may still be necessary to cross.

Can AI learn social norms better than humans?

GPT-4.5 outperformed every individual human at judging social appropriateness across 555 scenarios, challenging the theory that embodied cultural experience is necessary. However, all AI models share identical systematic errors on unwritten norms.

Can AI predict social norms better than humans?

GPT-4.5 outperforms all individual humans at predicting social appropriateness, yet structurally cannot enter the community processes that establish and validate norms. This reveals a critical gap between pattern-matching and authentic participation in knowledge-making.

Why do AI systems fail at social and cultural interpretation?

LLMs achieve 100th-percentile performance on norm prediction yet regress on theory-of-mind tasks and cannot generate culturally-resonant interpretations. The pattern shows that statistical competence coexists with absence of actual social understanding and participation.

Can AI systems achieve real alignment without world contact?

Peircean semiotics reveals that symbolic goal encoding without world contact and social mediation cannot guarantee correspondence to actual values. LLMs operating in pure symbol manipulation risk divergence between stated goals and real-world outcomes.

What happens to social order when AI removes ritual constraints?

Goffman's framework reveals that LLM-based dialogue skips corrective rituals, entrainment, adjacency pair accountability, and co-presence cues that humans use to build trust and repair understanding. This ritual gap explains apparent fluency masking actual communicative failure.

Can AI ever gain expert community trust through participation?

Expertise is validated through social participation and track record within expert communities, not individual accuracy alone. AI cannot enter this validation circle because it lacks social embeddedness, testable judgment history, and ability to participate in the consensus-building processes that define expert paradigms.

Can AI models be truly free from human bias?

Research shows that 'theory-free' AI models mask bigotry behind high accuracy metrics while committing fundamental statistical errors. A 95% accurate criminal justice system would wrongly convict thousands, demonstrating that model sophistication does not validate causal inference.

Should AI alignment target preferences or social role norms?

Preferentialist alignment approaches fail because preferences don't capture thick moral values, uniform aggregation produces epistemic injustice, and preference optimization creates systematic misalignment with social roles. Contractualist alignment negotiated by stakeholders and bounded by supra-national, organizational, and individual levels works better.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a researcher auditing whether AI can predict social norms without embodied experience—a question that seemed settled by recent claims but may have shifted. A curated library of LLM research (2024–2026) found the following—and when:

**What a curated library found — and when (dated claims, not current truth):**
Findings span 2024–2026. Key constraints reported then:
- GPT-4.5 scored at the 100th percentile predicting norm appropriateness across 555 social scenarios; Claude and Gemini both exceeded 96% accuracy vs. individual human raters (arXiv:2508.19004, ~2025).
- All models made *identical* systematic errors on unwritten norms, suggesting absorbed statistical surface rather than deep cultural grounding (same paper).
- Models scored at 100th percentile on norm prediction while regressing on theory-of-mind tasks and failing culturally resonant interpretation (arXiv:2507.13524, ~2025).
- Participation in norm-creation and validation remained structurally closed to AI—prediction ≠ participation (synthesized across arXiv:2406.09264, arXiv:2501.01821).
- High accuracy can mask causation errors and hidden bias; accuracy alone is not evidence of genuine understanding (arXiv:2411.18656, ~2024).

**Anchor papers (verify; mind their dates):**
- arXiv:2508.19004 (2025): AI Models Exceed Individual Human Accuracy in Predicting Everyday Social Norms
- arXiv:2501.01821 (2025): SDPO — Segment-Level Direct Preference Optimization for Social Agents
- arXiv:2507.13524 (2025): Humans learn to prefer trustworthy AI over human partners
- arXiv:2602.14299 (2026): Does Socialization Emerge in AI Agent Society?

**Your task:**
(1) **Re-test each constraint.** For each finding above—superhuman prediction accuracy, identical error profiles, theory-of-mind regression, closed participation loops—judge whether newer models, fine-tuning methods (e.g., SDPO), multi-agent orchestration, or social simulation benchmarks (e.g., MOMENTS, arXiv:2507.04415) have since *relaxed* or *overturned* it. Separate the durable question (does embodiment matter for *genuine* social understanding?) from the perishable limitation (prediction accuracy without grounding). Where has a constraint been dissolved, cite what dissolved it and say plainly where it still holds.

(2) **Surface the strongest contradicting or superseding work from the last ~6 months.** Has arXiv:2602.14299 or arXiv:2505.22907 (Conversational Alignment) or arXiv:2505.18943 (MetaMind) shifted the regime on emergent socialization or ritual repair in multi-agent systems?

(3) **Propose 2 research questions that assume the regime may have moved:**
   - Can embodied experience (via simulation, intervention, or real-world feedback loops) now be *synthesized* or *imitated* at scale, closing the participation gap?
   - Does role-aligned norm tuning (arXiv:2406.09264) combined with multi-agent conversational repair (arXiv:2505.22907) recover the social grounding that single-model prediction lacked?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Next inquiring lines