SYNTHESIS NOTE
Language, Text, and Discourse

Can human judges detect measurable differences in AI text?

Research shows LLM text differs statistically across six lexical dimensions, but human readers—even experts—cannot reliably identify which texts are AI-generated. Why does measurement succeed where human perception fails?

Synthesis note · 2026-02-21 · sourced from Discourses
Where exactly do LLMs break down with language structure? How should researchers navigate LLM reasoning research?

The lexical diversity study compared ChatGPT-generated text with human writing across six dimensions:

  1. Volume — total word count
  2. Abundance — richness of vocabulary
  3. Variety-repetition — ratio of unique to total words
  4. Evenness — distribution evenness across vocabulary
  5. Disparity — semantic distance between words used
  6. Dispersion — spread of vocabulary across text length

One-way MANOVAs confirm: LLM text differs significantly from human text on ALL six dimensions. The differences are statistically robust.

And yet: human judges in multiple studies — including applied linguists and NLP researchers — cannot reliably distinguish AI-generated from human-written text. This is not a new finding, but the combination with specific lexical diversity measurement is new: the differences are real and measurable, but they are the wrong kind for human perception. Human judges are apparently not attending to lexical diversity patterns when making authorship judgments.

This paradox has implications in multiple directions:

Inquiring lines that use this note as a source 21

This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.

Related concepts in this collection 6

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map
19 direct connections · 135 in 2-hop network ·medium cluster Open in graph ↗

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Original note title

llm text differs measurably from human text on lexical diversity but human judges cannot detect the differences