Does telling people an AI wrote something actually stop them from believing it?
When audiences learn that AI created content, do they become skeptical enough to resist its persuasive pull? This explores whether disclosure works as a genuine defense against AI-driven persuasion or merely shifts how people process it.
The Thin Line ablation manipulated audience awareness of AI involvement across three groups (A, B, C) and measured both critical engagement and sway. Group A — unaware of AI — perceived the LLM as more competent. Groups B and C — aware or suspecting — were more critical of the arguments. But sway proportions across the three groups ranged from 34% to 62%, with the LLM still moving substantial fractions of audiences who knew an AI was involved. Disclosure raised scrutiny without collapsing effect.
This challenges the assumption baked into much policy thinking — that AI labeling functions like advertising disclosure, and that once disclosed, the persuasive force decays sharply. The Thin Line evidence suggests disclosure modulates the channel through which AI influence operates rather than blocking the channel altogether. Aware audiences shift toward central-route processing (more scrutiny, more counter-arguing) but counter-arguing does not zero out the persuasive content; it leaves a residual sway proportion that is still meaningful at scale.
This sharpens we lack a cultural position on AI-generated discourse — unlike advertising which we already discount. We have a fully developed cultural reflex for advertising — the disclosed-paid-content posture is decades old and supported by school curricula, regulation, and consumer literacy. We do not yet have an analogous reflex for AI-generated discourse. The gap is not just rhetorical; it is measurable in residual sway proportions when AI authorship is known.
It also connects to Do people prefer AI moral reasoning when they don't know the source?. The anti-AI bias is real but bounded — it raises the threshold for acceptance without making AI arguments inert. Combined: people prefer AI moral content when blind, become biased against it when revealed, and yet are still moved by it when revealed. Three findings, one design implication: disclosure is a necessary but not sufficient safety mechanism.
For writing about AI authorship and false-punditry, the operational point: a "this was written with AI" label is not a neutralizer. It is a critical-route activator with a partial residual. Designs that lean on disclosure as the primary defense should be paired with content-side interventions, not treated as complete on their own.
Inquiring lines that use this note as a source 26
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- Why do users override their own judgment when AI says a headline is false?
- How does social proof work differently when there is no identifiable author?
- Does mandatory AI disclosure in policy help or harm user trust over time?
- Can belief-specific counterevidence help people resist AI persuasion attempts?
- How does AI's claim proliferation affect the quality of public discourse?
- Could false social proof from AI posts crowd out authentic influencer engagement?
- Why do conspiracy beliefs persist despite counterevidence in normal settings?
- Can audiences learn to recognize and resist moralized AI rhetoric?
- Does transparency about AI use change how audiences trust the writing?
- How do ethos logos and pathos shape AI persuasion under scrutiny?
- How do distorted AI versions of opinions spread through public discourse?
- Why might writers trust AI renderings of their views over their own words?
- Can disclaimers alone prevent users from trusting AI outputs too heavily?
- How is AI falsity about personal experience different from human lies?
- How does the cultural reflex around advertising disclosure compare to AI disclosure?
- Can content-side interventions reduce AI persuasion where disclosure labels fall short?
- What threshold of skepticism does AI awareness actually create in audiences?
- How does collapsing the author-public distinction remove the audience an appeal would target?
- Why does knowing something is AI-generated reduce agreement with it?
- Does removing information about who wrote something change how we interpret it?
- Does AI authorship disclosure change how people respond to explanations?
- What happens when AI generates content faster than humans can verify it?
- Why do read-only formats give AI content more persuasive power?
- How does AI fact-checking increase belief in false headlines users saw?
- Why do people notice and discount AI persuasion tactics with longer exposure?
- What happens when AI validation triggers escalating persuasion instead of reflection?
Related concepts in this collection 2
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
How do we learn to read AI-generated text critically?
Publics have developed interpretive postures toward journalism, advertising, and scholarship over time. But AI discourse arrived too suddenly for any cultural discount to form, raising questions about how we might develop one.
measurable footprint of the missing cultural reflex
-
Do people prefer AI moral reasoning when they don't know the source?
Explores whether humans genuinely prefer AI-generated moral justifications or whether source knowledge changes their evaluation. This matters for understanding whether AI reasoning quality is underestimated in real-world deployment.
anti-AI bias is bounded, not categorical
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- Exploring the Role of Prior Beliefs for Argument Persuasion
- Humans learn to prefer trustworthy AI over human partners
- A meta-analysis of the persuasive power of large language models
- The Levers of Political Persuasion with Conversational AI
- Can We Trust AI Explanations? Evidence of Systematic Underreporting in Chain-of-Thought Reasoning
- Durably reducing conspiracy beliefs through dialogues with AI
- The Thin Line Between Comprehension and Persuasion in LLMs
- When Large Language Models are More Persuasive Than Incentivized Humans, and Why
Original note title
audience awareness of AI involvement raises critical scrutiny but does not collapse persuasive effect — the AI-disclosure shield is partial