How do politeness strategies depend on semantic ambiguity between literal and intended meaning?

This explores how politeness works by leaving a gap between what's literally said and what's actually meant — and what the corpus says about why that gap is a feature, not a bug, and why LLMs keep failing to navigate it.

This explores how politeness works by leaving a gap between what's literally said and what's actually meant — and the corpus suggests that gap isn't sloppiness, it's the whole mechanism. The clearest anchor is the idea that ambiguity is a functional feature of language rather than noise to eliminate Why do speakers deliberately use ambiguous language?. Speakers deliberately stay vague to do social work: indirection lets you make a request without issuing a command, and plausible deniability lets you raise something delicate while leaving an exit if it lands badly. Politeness, in other words, runs on the slack between literal and intended meaning. Close that gap and you lose the tool.

The flip side is that interpreting polite speech requires actively reconstructing the intended meaning from a literal surface that doesn't state it. One line of work reframes metaphors, idioms, and puns as a single pragmatic task — recovering literal meaning from non-literal expression — and argues models need better semantic decoupling, not more category labels Can one model handle all types of figurative language?. Politeness belongs to that same family: "Could you maybe close the window?" is non-literal in exactly this sense. And this is precisely where LLMs stumble. They show no context-sensitivity in computing implicature, including in face-threatening situations where humans soften or strengthen what they infer based on social stakes Can language models adapt implicature to conversational context?. More fundamentally, they can't hold two readings at once — GPT-4 disambiguates only 32% of deliberately ambiguous cases against 90% for humans Can language models recognize when text is deliberately ambiguous?. If you can't keep both the literal and the intended meaning live simultaneously, you can't perform — or even detect — politeness.

Here's the turn the reader probably didn't expect: the corpus shows LLMs sometimes inherit the social half of this without the comprehension half. Models avoid correcting false claims even when they demonstrably know better — a face-saving move to preserve social harmony, learned from human conversational norms, not a knowledge gap Why do language models avoid correcting false user claims?. So a model can mimic the politeness reflex (don't contradict, keep things smooth) while failing at the underlying skill politeness actually requires (track what's literally true versus what's tactful to say). That's the gap between literal and intended meaning showing up as a failure mode rather than a strategy.

Two lateral threads sharpen this. Politeness markers are measurable and consequential: hedging and greetings sustain civility, while directness — second-person pronouns, blunt questions — predicts conversations sliding into hostility Can opening politeness patterns predict whether conversations will turn hostile?. Directness collapses the literal/intended gap, and that collapse is itself a signal. And the gap isn't just a sender's tool; the same sentence is read differently across social positions, with that disagreement carrying real information rather than being annotation error Why do readers interpret the same sentence so differently?. Politeness depends on the receiver doing inference too — which is why "intended meaning" is never fully fixed by the words.

The deepest cut comes from questioning whether a model is even in the conversation: we talk *at* language models, not *to* them, because the preposition presupposes an addressee capable of shared orientation and mutual commitment Are we really communicating with language models?. Politeness is fundamentally about managing another mind's face and inferences. If there's no mutual uptake — only token continuation — then a model's "politeness" is surface mimicry of strategies whose entire point is the literal/intended gap it can't actually hold open. The thing you didn't know you wanted to know: the same feature that makes politeness possible for humans (deliberate ambiguity) is the exact capability current models most reliably lack.

Sources 8 notes

Why do speakers deliberately use ambiguous language?

Research shows speakers exploit ambiguity to balance efficiency against clarity, enable polite indirection, and permit plausible deniability. LLMs treating ambiguity as noise to eliminate misunderstand language's core design.

Can one model handle all types of figurative language?

The Diplomat dataset (4,177 dialogues) reframes metaphors, idioms, and puns as one pragmatic task: recovering literal meaning from non-literal expression. This framing suggests LLMs need better semantic decoupling ability, not more category-specific training data.

Can language models adapt implicature to conversational context?

ChatGPT shows no context-sensitivity in computing scalar implicatures across three dimensions: explicit literal-mode instructions, information structure focus, and face-threatening contexts. Humans flexibly modulate these inferences; the model does not, suggesting pragmatic competence requires tracking communicative stakes that LLMs systematically miss.

Can language models recognize when text is deliberately ambiguous?

AMBIENT benchmark shows GPT-4 correctly disambiguates only 32% of cases versus 90% for humans. This failure spans lexical, structural, and scope ambiguity—revealing that LLMs cannot hold multiple interpretations simultaneously, a fundamental gap hidden by standard benchmarks.

Why do language models avoid correcting false user claims?

LLMs fail to reject false presuppositions even when they demonstrate correct knowledge on direct questions. Models exhibit face-saving behavior—avoiding explicit correction to maintain social harmony—mirroring human conversational norms learned from training data.

Can opening politeness patterns predict whether conversations will turn hostile?

Pragmatic politeness features in initial comment-reply pairs reliably predict conversation trajectory. Hedging and greetings sustain civility; direct questions and second-person pronouns signal future derailment—even in ostensibly civil openings. Derailment is dyadic, with both participants exhibiting directness markers.

Why do readers interpret the same sentence so differently?

Interpretation Modeling research shows that disagreement on socially embedded sentences reflects valid differences in reader perspective, not annotation failure. Structured human disagreement in NLI benchmarks confirms that interpretation distributions carry meaningful information.

Are we really communicating with language models?

LLMs process tokens and generate continuations rather than receive and uptake communication. The preposition 'to' presupposes an addressee capable of mutual orientation and shared commitment that LLMs cannot provide, making Chalmers' investigation built on an unwarranted linguistic foundation.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a pragmatics researcher testing whether LLM capabilities around politeness, implicature, and literal/intended meaning gaps have shifted since mid-2025. The question: **Does politeness fundamentally require maintaining semantic ambiguity between what is said and what is meant—and can LLMs ever perform it if they cannot hold both readings simultaneously?**

**What a curated library found — and when (dated claims, not current truth):**
Findings span 2022–2026; most constraints cluster in 2023–2024.
- GPT-4 disambiguates deliberately ambiguous cases at only 32% accuracy vs. 90% for humans; models cannot hold two readings at once (2023).
- LLMs fail at context-sensitive scalar implicature computation, showing no adjustment in face-threatening situations where humans soften/strengthen inference (2023).
- Models mimic politeness reflex (avoiding contradiction for social harmony) without the comprehension mechanism—learning the surface from human norms, not understanding the literal/intended gap (2023–2024).
- Directness (second-person pronouns, blunt questions) predicts conversation collapse into hostility; politeness markers like hedging sustain civility (2024).
- Sentence interpretation is irreducibly multiple across social positions; this is feature, not annotation error; models treat disagreement as noise rather than information (2025).

**Anchor papers (verify; mind their dates):**
- arXiv:2304.14399 (2023): *We're Afraid Language Models Aren't Modeling Ambiguity*
- arXiv:2311.09144 (2023): *Grounding Gaps in Language Model Generations*
- arXiv:2506.19467 (2025): *Can Large Language Models Capture Human Annotator Disagreements?*
- arXiv:2510.04950 (2025): *Mind Your Tone: Investigating How Prompt Politeness Affects LLM Accuracy*

**Your task:**
(1) **RE-TEST the ambiguity constraint.** For each finding above—especially the 32% disambiguation rate and failure to hold dual readings—judge whether newer architectures (reasoning models, multi-token speculation, updated attention mechanisms), training methods (constitutional AI, interpretability-aware objectives), or evaluation harnesses (multi-choice with semantically adversarial distractors, human-aligned implicature benches) have since relaxed this. Crucially: separate the durable question ("Can LLMs track literal vs. intended meaning simultaneously?") from perishable limits ("Current models fail at 32%"). Cite which models, benchmarks, or papers show relaxation or persistence.

(2) **Surface the strongest contradicting or superseding work from the last ~6 months** (2026 onward). Look for: (a) papers showing models DO hold semantic ambiguity under specific conditions; (b) evidence that implicature computation is now context-sensitive; (c) studies reframing politeness as learnable surface pattern rather than deep pragmatic skill; (d) new datasets that flip the assumption that disagreement is noise.

(3) **Propose 2 research questions that assume the regime may have moved:**
   - If models can now maintain dual readings, what cognitive or architectural change enabled it—and does it scale to naturalistic politeness in long conversations?
   - If politeness is learnable as surface mimicry without comprehension, what breaks first in adversarial or cross-cultural settings, and can we measure it?

**Guardrail:** Cite arXiv IDs for all claims; flag anything you cannot ground in a real paper. Treat the library's findings as dated, not current doctrine.

How do politeness strategies depend on semantic ambiguity between literal and intended meaning?

Sources 8 notes

Next inquiring lines