INQUIRING LINE

Can presupposition projection strength vary by context in embeddings?

This explores whether the strength with which a presupposition 'projects' (survives being embedded under negation, questions, conditionals) shifts with context — and whether the embedding-based models behind today's LLMs actually track that shift.


This reads the question two ways at once — as a fact about language (does projection strength move with context?) and as a fact about machines (can models built on embeddings capture that movement?). The corpus has a clean answer to the first and a stubborn 'no' to the second. On the linguistics: projection is not a fixed property of a word. Across 19 English triggers, how strongly content projects turns out to be gradient and driven by at-issueness — whether the content speaks to the current Question Under Discussion — rather than by lexical class. The same trigger projects more in one context and less in another Does projection strength vary by context or by word type?. So yes: projection strength varies by context, and it does so continuously, not in fixed buckets per word type.

The harder finding is what happens when you ask the models to honor that. Embedding contexts behave as systematic 'blinds': LLMs treat presupposition triggers and non-factive verbs as surface cues and miss that they compute opposite effects on entailments, a failure that persists across prompts and models Why do embedding contexts confuse LLM entailment predictions?. The reason is structural — presuppositions have a dual origin. Some are lexically specified, but many arise through accommodation, where the listener quietly updates the context to resolve a mismatch. Models learn the lexical, statistical half and miss the conversationally derived half, because catching it requires tracking the Question Under Discussion rather than pattern-matching trigger words Do language models miss presuppositions that arise from context?. That's the same QUD machinery that governs gradient projection in the first place — which is exactly why the context-sensitivity exists in the data but evaporates in the model.

It's worth being precise about where the failure lives, because the embeddings themselves are not empty. Static embeddings encode genuinely rich semantic content — valence, concreteness, iconicity, taboo — before self-attention ever runs, which rules out the idea that there's simply 'no meaning in there' Do transformer static embeddings actually encode semantic meaning?. The breakdown is at the contextual-computation layer, not the lexical one. Models fail to integrate context when prior training associations are strong enough to override it — and textual prompting alone can't fix that; it takes intervention in the representations Why do language models ignore information in their context?. You can watch this directly: LLMs accommodate false presuppositions even when a direct question proves they know the fact is wrong, with rejection rates collapsing as low as 2.44% for some models Why do language models accept false assumptions they know are wrong?.

The deeper pattern, and the thing you might not have come looking for: this is the same crack that shows up across pragmatics. Scalar implicature is the close cousin of presupposition, and models show no context-sensitivity there either — they don't flex their inferences for literal-mode instructions, information focus, or face-threatening situations the way humans do Can language models adapt implicature to conversational context?. Underneath both sits a frequency bias: models systematically prefer high-frequency surface forms over meaning-equivalent rare ones, suggesting they track statistical mass rather than recompute meaning per context Do language models really understand meaning or just surface frequency?. So the honest synthesis is a split verdict — projection strength genuinely is context-variable in human language, but the context-tracking that produces that variability (at-issueness, QUD, accommodation) is precisely the competence current embedding-based models lack.


Sources 8 notes

Does projection strength vary by context or by word type?

Across 19 English expressions, projectivity varies continuously based on whether content addresses the Question Under Discussion. The same presupposition trigger projects more or less depending on context, not on fixed lexical properties.

Why do embedding contexts confuse LLM entailment predictions?

LLMs treat presupposition triggers and non-factive verbs as surface cues rather than computing their opposite semantic effects on entailments. This structural failure persists across prompts and models, suggesting models rely on surface patterns instead of structural analysis.

Do language models miss presuppositions that arise from context?

LLMs learn statistical associations between trigger words and inferences, but presuppositions also arise through accommodation—updating context to resolve discourse mismatches. Models miss these because they require tracking questions under discussion, not pattern matching.

Do transformer static embeddings actually encode semantic meaning?

Clustering analysis of RoBERTa embeddings reveals sensitivity to five psycholinguistic measures including valence, concreteness, iconicity, and taboo. This demonstrates that static embeddings function as genuine lexical entries containing semantic content before self-attention operates.

Why do language models ignore information in their context?

Research demonstrates that LMs generate outputs inconsistent with their context because parametric knowledge from training dominates over in-context information. Textual prompting alone cannot override strong priors; causal intervention in representations is required.

Why do language models accept false assumptions they know are wrong?

The FLEX Benchmark shows that models reject false presuppositions at rates far below acceptable levels (GPT-4: 84%, Mistral: 2.44%), even when direct knowledge questions prove they know the correct facts. False presuppositions drive more accommodation than correct knowledge drives rejection.

Can language models adapt implicature to conversational context?

ChatGPT shows no context-sensitivity in computing scalar implicatures across three dimensions: explicit literal-mode instructions, information structure focus, and face-threatening contexts. Humans flexibly modulate these inferences; the model does not, suggesting pragmatic competence requires tracking communicative stakes that LLMs systematically miss.

Do language models really understand meaning or just surface frequency?

LLMs show consistent preference for higher-frequency surface forms over semantically equivalent rare paraphrases across math, machine translation, commonsense reasoning, and tool calling. This suggests models track statistical mass from pretraining rather than meaning-recognition as their primary mechanism.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

As an LLM researcher, assess whether presupposition projection strength can vary by context in embeddings — treating this as still-open despite dated findings.

What a curated library found — and when (dated claims, not current truth):
Findings span 2022–2026; treat each as perishable:
• Projection strength is gradient and QUD-driven in human language, not fixed per lexical class (~2023).
• Models systematically fail to capture context-sensitivity: they miss conversational accommodation and track surface trigger words instead, with false-presupposition rejection rates as low as 2.44% (~2025).
• Static embeddings encode rich semantic content (valence, concreteness, taboo), but contextual-computation layers fail to integrate it when prior training associations dominate (~2025–2026).
• Scalar implicature shows the same crack — no adaptation to communicative context (literal mode, focus, face-threat) (~2023).
• Models exhibit frequency bias: they prefer high-frequency paraphrases over meaning-equivalent rare ones, suggesting statistical tracking rather than per-context recomputation (~2026).

Anchor papers (verify; mind their dates):
• arXiv:2305.14785 (2023): "Simple Linguistic Inferences of Large Language Models: Blind Spots and Blinds"
• arXiv:2505.22354 (2025): "LLMs Struggle to Reject False Presuppositions when Misinformation Stakes are High"
• arXiv:2508.12863 (2025): "Word Meanings in Transformer Language Models"
• arXiv:2604.02176 (2026): "Adam's Law: Textual Frequency Law on Large Language Models"

Your task:
(1) RE-TEST EACH CONSTRAINT. For contextual-computation failure and QUD-insensitivity, probe whether newer models (o3, o4 reasoning, retrieval-augmented QUD tracking), mechanistic interventions (steering hidden states, sparse mixture architectures), or multi-turn orchestration (caching QUD across dialogue) have relaxed the failure. Separate the durable claim (models lack explicit QUD tracking) from the perishable one (they cannot be made to track it).
(2) Surface the strongest RECENT WORK (last 6 months) that contradicts the frequency-bias or accommodation-failure picture — especially papers on in-context adaptation, prompt-chaining for pragmatic grounding, or mechanistic steering of presupposition handling.
(3) Propose 2 research questions assuming the regime has moved: (a) Can explicit QUD tokens, injected as part of prompt structure or cached in attention, restore gradient projection sensitivity? (b) Do newer scaling laws or post-training objectives (e.g., reinforcement learning on pragmatic tasks) reduce the gap between static-embedding richness and contextual-computation failure?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Next inquiring lines