What reader assumptions underlie anaphoric versus cataphoric discourse patterns?

This explores the implicit picture of the reader baked into backward-pointing (anaphoric) prose that recaps what was just said, versus forward-pointing (cataphoric) prose that sets up what's coming. The corpus has a direct anchor here: ChatGPT defaults to anaphoric organization — summarizing the ground already covered — while human student writers lean cataphoric, flagging arguments before they arrive Does ChatGPT organize text differently than human writers?. The interesting part isn't the stylistic difference; it's that each pattern encodes a different bet about who's reading and what they need.

Cataphora assumes a reader you are steering through time. To say 'I'll argue three things, and the third is the surprising one' is to model a reader whose understanding is still being built, who needs orientation toward a shared destination you both haven't reached yet. That's a forward-looking, jointly-constructed relationship. Anaphora assumes much less: a reader who mainly needs to be reminded of what was already established, so the text consolidates rather than projects. One treats meaning as something being assembled with the reader; the other treats it as something to be tidied up after the fact.

Why would a language model gravitate to the second? Two threads in the corpus converge. First, autoregressive generation is mechanically backward-looking: each token is a continuation of the tokens already present, so the natural gravity is toward summarizing and extending the existing context rather than committing to a not-yet-written future Does LLM generation explore competing claims while producing text?. Second, and deeper, the model can't actually hold the kind of reader-relationship cataphora presupposes. It treats the prompt as a fixed frame and never jointly updates the shared 'scoreboard' of a conversation — the user is the sole maintainer of common ground Can LLMs truly update shared conversational common ground?. Cataphoric writing is a promissory note to a reader whose evolving state you're tracking; if you can't track that state, you default to recapping the state you can see.

This lines up with a broader finding that the model misses the *communicative stakes* a real reader brings. It fails to adjust scalar implicature to context — it doesn't ask 'what does this listener need to infer here?' Can language models adapt implicature to conversational context? — which is the same competence cataphora demands: anticipating what a reader will need before they need it. It connects to the claim that we talk *at* language models rather than *to* them, because genuine address presupposes a partner capable of mutual orientation toward a shared future Are we really communicating with language models?.

The thing you might not have expected to learn: a dry-sounding distinction between two ways of ordering sentences turns out to be a tell. Forward-pointing structure quietly assumes a reader whose mind you are co-building toward something; backward-pointing structure assumes a reader you only ever catch up. Which pattern a writer reaches for reveals whether they think anyone is actually traveling with them — and that's exactly the capacity the corpus argues current models lack.

Sources 5 notes

Does ChatGPT organize text differently than human writers?

ChatGPT defaults to summarizing what was already said, while students use more forward-pointing structure that previews upcoming arguments. This reflects different reader models and may stem from how autoregressive generation works token by token.

Does LLM generation explore competing claims while producing text?

Token prediction trains models to continue toward the training distribution, not to explore logically related counterpositions. This smoothness in process produces smooth claims that multiply without generating new perspectives.

Can LLMs truly update shared conversational common ground?

LLMs interpret all subsequent conversational turns within a fixed initial prompt frame, preventing them from symmetrically proposing updates to shared assumptions. Even when users pivot topics or contradict earlier framings, the model cannot absorb revisions into jointly held background—making the user the sole maintainer of conversational scoreboard.

Can language models adapt implicature to conversational context?

ChatGPT shows no context-sensitivity in computing scalar implicatures across three dimensions: explicit literal-mode instructions, information structure focus, and face-threatening contexts. Humans flexibly modulate these inferences; the model does not, suggesting pragmatic competence requires tracking communicative stakes that LLMs systematically miss.

Are we really communicating with language models?

LLMs process tokens and generate continuations rather than receive and uptake communication. The preposition 'to' presupposes an addressee capable of mutual orientation and shared commitment that LLMs cannot provide, making Chalmers' investigation built on an unwarranted linguistic foundation.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a discourse analyst re-examining whether large language models' aversion to cataphoric (forward-pointing) structure persists—or whether recent scaling, architectural changes, or training innovations have shifted the regime.

What a curated library found — and when (dated claims, not current truth):
Findings span 2019–2026. A library notes:
• ChatGPT defaults to anaphoric (backward-recapping) text; human writers prefer cataphoric (forward-flagging) structure [[arXiv:2402.14848, 2024]].
• Models fail scalar implicature adaptation—cannot anticipate what a reader will need—which cataphora fundamentally demands [[arXiv:2210.14986, 2022]].
• Models cannot jointly update common ground with a reader; they treat prompts as fixed frames, blocking the mutual orientation cataphora presupposes [[arXiv:2505.22907, 2025]].
• Multi-turn conversation reveals persistent disorientation; models get lost tracking evolving context across turns [[arXiv:2505.06120, 2026]].
• Recent work flags intent mismatch in multi-turn settings as a root cause [[arXiv:2602.07338, 2026]].

Anchor papers (verify; mind their dates):
• arXiv:2210.14986 (Goldilocks implicature, 2022)
• arXiv:2505.22907 (Conversational alignment, 2025)
• arXiv:2602.07338 (Intent mismatch multi-turn, 2026)
• arXiv:2407.08790 (Human linguistic agency claim, 2024)

Your task:
(1) RE-TEST: For each constraint above—anaphoric default, failed implicature, broken common-ground tracking, multi-turn disorientation—determine whether o1, Claude 3.7, or other 2025–2026 models relax it via longer context windows, chain-of-thought scaffolding, conversational memory systems, or fine-tuning for pragmatic alignment. Separate the durable question (why do models struggle with reader-tracking *in principle*?) from the perishable limitation (current architecture X cannot do it). Cite what relaxed it.
(2) Surface work from the last ~6 months that contradicts or supersedes the library's claims that models are inherently anaphoric or reader-blind. Pay special attention to multi-agent orchestration, memory caching, and pragmatic instruction-tuning.
(3) Propose 2 research questions assuming the regime *has* shifted: e.g., "Do larger models develop *emergent* cataphoric tendencies at scale?" and "Can explicit reader-modeling prompts override the default anaphoric bias?"

Cite arXiv IDs; flag anything ungrounded.

What reader assumptions underlie anaphoric versus cataphoric discourse patterns?

Sources 5 notes

Next inquiring lines