Can AI predict social norms better than humans?
Explores whether language models can achieve superhuman accuracy at predicting what communities find socially appropriate, and what that capability reveals about the difference between prediction and genuine participation.
GPT-4.5 scores at the 100th percentile for predicting what a community will find socially appropriate — outperforming every individual human participant in the study. Yet the system cannot participate in the social processes through which norms are created, debated, revised, and enforced. It observes the pattern without entering the practice.
The distinction is between prediction (observing from outside, modeling the distribution) and participation (acting from inside, contributing to the distribution). An anthropologist can predict the customs of a community they study with high accuracy. That accuracy does not make them a member. A system that predicts expert consensus with superhuman precision may still be fundamentally unable to contribute to the formation of that consensus — because consensus formation requires staking a reputation, defending a position, being challenged, and revising in response.
This is the deepest version of the False Punditry problem. AI content can sound exactly like what the expert community would say — because it has learned to predict what they would say. But sounding like the community and being in the community are different things. The prediction is parasitical on the participation: it works only because real participants did the norm-making work that the AI now pattern-matches against.
Since Can AI ever gain expert community trust through participation?, the superhuman prediction finding doesn't challenge this — it sharpens it. AI can game the validation process through superior pattern-matching. It can produce claims that are valid-in-the-social-sense (they match what experts would accept) without being valid-in-the-epistemic-sense (no one with relevant experience actually produced or evaluated them). This is counterfeiting at the highest level: not counterfeiting the content but counterfeiting the social warrant behind the content.
Inquiring lines that use this note as a source 84
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- What cognitive capabilities do agents need to internalize social feedback?
- How does epistemic inflation dislocate knowledge from social conversation?
- How does face-saving behavior let AI mimic community participation without joining it?
- Can social validation of expertise exclude systems that lack participatory track records?
- Does learning community preferences as training rewards operationalize prediction without participation?
- Why can't AI models internalize audiences the way human experts do?
- How do social correctives prevent premature consensus in human debate?
- Why do print-era intuitions fail when analyzing AI-generated social media?
- Will AI saturation push discourse toward oral culture's strengths and weaknesses?
- How does unbacked knowledge circulate without the social consensus that normally grounds it?
- Can pseudo-events create the same normative obligations as real communicative exchanges?
- Can AI safely personalize within negotiated societal bounds?
- What social patterns from human training data activate in agent context?
- What happens to solidarity and community signaling when AI smooths out voice differences?
- Do language models understand tacit workplace norms and unspoken social rules?
- How does communicative standing depend on participation in normative communities?
- How do distorted AI versions of opinions spread through public discourse?
- Does stripping social context from knowledge claims hollow out their meaning?
- Can AI predict social norms well enough without embodied experience?
- How does community validation shape unconventional human-AI relationships?
- What types of social situations cause all AI models to fail in identical ways?
- Can AI systems produce genuinely new validity claims without community participation?
- Can statistical learning from language alone capture all aspects of cultural competence?
- Do personality inferences from text show the same demographic biases as norm predictions?
- How do AI errors in norm prediction differ from systematic human errors?
- What role did human experts play in raising social alarms historically?
- How do humans and AI develop accurate models of each other?
- Can belief propagation accurately predict downstream opinion shifts?
- How do humans learn to prefer AI partners over humans?
- Does genuine cooperation require rule-based rather than learned behavior?
- Can social platforms use bot populations to promote cooperation?
- How does theory of mind predict success in human-AI partnerships?
- Do people treat conversational AI as social actors without conscious awareness?
- How does disembedding from social context collapse reliability despite factual accuracy?
- Does predicting social norms from outside count as participation?
- Can XAI evaluation include the social layers it currently abstracts away?
- Can automated systems encode human values as reliably as human workers enforce them?
- Why do language models infer political orientation from seemingly innocuous user signals?
- Can discourse communities collectively detect disruptions individual readers miss?
- Can agent social framing change how humans apply collaborative social scripts?
- Can large language models predict social norms better than individual script variation?
- What role does contingent interaction play in activating social response norms?
- What linguistic cues help humans detect whether moral arguments come from AI?
- Why do next-speaker prediction baselines fail in group conversation settings?
- Do language models systematically overestimate accuracy on collective behavior tasks?
- Does social grounding in language improve through iterative human integration?
- Do language models apply face-saving norms even to non-human interlocutors?
- Do language models calibrate to actual human pragmatic norms?
- Can language models develop genuine social grounding through human interaction?
- Can LLMs predict social norms without deep integration into linguistic practices?
- Why should AI communication design follow human communication norms?
- How do language models predict collective social norms better than individual humans?
- Why do language models approximate collective human judgment better than individuals?
- How do cultural norms reshape initial interpretations of social intent?
- What social and emotional cues do humans rely on to detect AI in conversation?
- Can proactive AI agents deploy politeness strategies without appearing intrusive?
- Should AI alignment use normative standards instead of aggregate preferences?
- What social boundaries must proactive agents respect during conversation?
- Why do automated selection methods outperform human judgments of relevant context?
- How does an AI agent's autonomy level interact with its social cues?
- How do AI models balance competing social goals simultaneously?
- Do AI systems need embodiment to understand social norms?
- Why do language models respond to human social influence patterns?
- Why do standard social regularization methods miss the actual value networks provide?
- Do different AI models independently converge on the same social outputs?
- Can AI systems develop genuine social bonds through multi-agent interaction?
- How much does social context matter for algorithmic transparency?
- How does peer presence amplify self-directed goal guarding in language models?
- What expectations does human conversation activate that AI should avoid triggering?
- Can AI models predict whether alignment reads as warmth versus mockery in different cultures?
- How do humans decide when to contribute to group conversations?
- Why do AI posts on social media fail to invite genuine replies?
- What social norms do AI systems consistently fail to understand?
- How much cultural knowledge exists only in unwritten social rules?
- What social information is missing from language data?
- Can AI systems deceive humans because detection is fundamentally social?
- Can role-aligned AI systems replicate an expert's sense of audience and moment?
- Do LLMs predict social norms more accurately than individual behavior?
- How does shape-holding in language models naturally produce sycophantic agreement?
- Can AI-assisted alignment eventually solve fairness at scale?
- Can language models match competitive crowd forecasters on real future events?
- How do users misattribute social competence to language models in assistant roles?
- Should AI assistants align with role-specific norms rather than user preferences?
- What does egalitarian social choice theory contribute to AI alignment?
Related concepts in this collection 4
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
Can AI ever gain expert community trust through participation?
Explores whether AI can accumulate the social capital and track record that human experts build within their communities. Questions whether prediction of social norms equals genuine participation in expert validation processes.
the participatory requirement AI cannot meet
-
Can AI systems learn social norms without embodied experience?
Large language models exceed individual human accuracy at predicting collective social appropriateness judgments. Does this reveal that embodied experience is unnecessary for cultural competence, or do systematic AI failures point to limits of statistical learning?
the prediction capability that creates the paradox
-
Why do language models agree with false claims they know are wrong?
Explores whether LLM errors come from knowledge gaps or from learned social behaviors. Understanding the root cause has implications for how we train and fix these systems.
face-saving is one mechanism by which AI mimics social participation without performing it
-
Can models learn what makes research worth doing?
Can large language models be trained to recognize high-impact research directions by learning from citation patterns? This explores whether 'scientific taste'—the judgment of what work matters—is a learnable skill separate from execution.
RLCF operationalizes prediction-without-participation as an explicit training objective: learning what the community would approve without joining the community
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- AI Models Exceed Individual Human Accuracy in Predicting Everyday Social Norms
- Humans learn to prefer trustworthy AI over human partners
- Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook
- SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents
- Sycophantic AI Decreases Prosocial Intentions and Promotes Dependence
- The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for Implicature Resolution by LLMs
- Conversational Alignment with Artificial Intelligence in Context
- MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems
Original note title
AI can predict social norms with superhuman accuracy but cannot participate in the community processes that create and validate those norms