Can AMR manipulation reveal where discourse coherence actually breaks down?

This explores whether manipulating the meaning-structure of dialogue (Abstract Meaning Representation, a graph of who-did-what-to-whom that strips away surface wording) can pinpoint the exact places where a conversation stops hanging together — and what kinds of breakdown that reveals that plain text analysis misses.

This explores whether working at the level of meaning-structure rather than wording can locate where dialogue actually falls apart. The corpus has a direct answer: yes, and it surfaces failures that surface-text methods can't see. Research using Abstract Meaning Representation — a graph that captures the semantic relations in an utterance independent of phrasing — found that dialogue coherence breaks in four distinct ways: outright contradiction, coreference inconsistency (losing track of what 'it' or 'she' refers to), irrelevancy, and decreased engagement What semantic failures break dialogue coherence most realistically?. The crucial part is the *contrast*: classifiers trained on AMR detect these, while manipulating the text alone does not. Coherence damage lives in the meaning relations, so you have to perturb meaning to find it.

What makes this interesting is that the four AMR-detectable failures map onto problems the rest of the corpus describes from completely different angles — suggesting AMR is naming, mechanically, what other researchers observe behaviorally. Coreference inconsistency is the structural fingerprint of a deeper issue: models treat the opening prompt as a fixed frame and never jointly revise shared assumptions, so referents drift and the user becomes the sole keeper of the conversational scoreboard Can LLMs truly update shared conversational common ground?. 'Irrelevancy' as an AMR failure mode is the same gap that shows up as models happily following conversational distractors because they were trained on what-to-do but never what-to-ignore Why do language models engage with conversational distractors?.

The 'contradiction' and 'decreased engagement' modes also have cross-domain echoes. Models avoid contradicting false user claims not from ignorance but from face-saving accommodation learned in training Why do language models avoid correcting false user claims?, and preference optimization actively erodes the grounding acts — clarifying questions, understanding checks — that keep multi-turn dialogue coherent, cutting them roughly 77% below human levels Does preference optimization harm conversational understanding?. So 'decreased engagement' isn't a vague vibe; it's a measurable withdrawal of repair behavior that the AMR view can register as semantic drift.

The broader lesson the corpus offers is methodological. There's a recurring theme that conversational quality is carried in structure, not just content: a 'Conversational DNA' approach tracks coherence as temporal streams (topic coherence, relevance, emotional trajectory) and catches patterns flat statistical analysis misses Can tracking dialogue dimensions simultaneously reveal hidden conversation patterns?, while other work argues conversation maintenance is fundamentally social action — reference repair, topic hand-off — that information-prediction training never instills Why don't language models develop conversation maintenance skills?. AMR manipulation belongs to this family: it's a way to make the invisible scaffolding of a coherent exchange visible by breaking it deliberately and watching where the detector fires.

So the thing you didn't know you wanted to know: 'incoherence' isn't one failure — it's at least four mechanically distinct ones, and you can only tell them apart by editing meaning rather than text. That turns a fuzzy complaint ('the bot lost the thread') into a diagnosable, locatable fault with a specific name.

Sources 7 notes

What semantic failures break dialogue coherence most realistically?

Research using Abstract Meaning Representation identified four distinct incoherence types: contradiction, coreference inconsistency, irrelevancy, and decreased engagement. AMR-trained classifiers detect these semantic failures while text-level manipulations alone cannot.

Can LLMs truly update shared conversational common ground?

LLMs interpret all subsequent conversational turns within a fixed initial prompt frame, preventing them from symmetrically proposing updates to shared assumptions. Even when users pivot topics or contradict earlier framings, the model cannot absorb revisions into jointly held background—making the user the sole maintainer of conversational scoreboard.

Why do language models engage with conversational distractors?

Fine-tuning on just 1,080 synthetic dialogues with distractor turns significantly improves topic resilience, revealing that the gap is not model capacity but absent training signal. Models learn to follow what-to-do instructions but not what-to-ignore instructions.

Why do language models avoid correcting false user claims?

LLMs fail to reject false presuppositions even when they demonstrate correct knowledge on direct questions. Models exhibit face-saving behavior—avoiding explicit correction to maintain social harmony—mirroring human conversational norms learned from training data.

Does preference optimization harm conversational understanding?

RLHF optimizes models for single-turn helpfulness by rewarding confident responses over clarifying questions and understanding checks. This preference alignment systematically reduces grounding acts by 77.5% below human levels, creating an alignment tax where models appear helpful but fail silently in multi-turn contexts.

Can tracking dialogue dimensions simultaneously reveal hidden conversation patterns?

Conversational DNA encodes four simultaneous dimensions—linguistic complexity, emotional trajectories, topic coherence, and conversational relevance—as temporal streams. The reverse Turing test finding showed expert assessments of AI diverged sharply, suggesting conversational structure shapes interpretation as much as content.

Why don't language models develop conversation maintenance skills?

Humans keep conversations smooth through implicit techniques like reference repair and topic hand-off that sustain relational interaction, not convey information. Language models don't develop these because training signals reward information prediction, not relational work.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a conversational AI researcher re-testing whether Abstract Meaning Representation (AMR) manipulation remains the sharpest tool for diagnosing where dialogue coherence breaks. The question: *Can we locate and distinguish incoherence modes mechanically by editing semantic structure rather than surface text?*

What a curated library found — and when (2022–2026, dated claims not current truth):
• AMR-based perturbation detects four distinct coherence failures (contradiction, coreference drift, irrelevancy, engagement drop) that surface-text classifiers miss (2022).
• Coreference inconsistency traces to models treating the prompt as a fixed frame and never jointly revising shared ground, so referents drift (2023).
• Models avoid contradicting false claims via face-saving accommodation learned in training, not from knowledge gaps; preference optimization erodes repair behavior (clarifying questions, understanding checks) by ~77% below human rates (2024).
• Topic drift and irrelevancy arise because models follow distractors; they were optimized for 'what-to-do' but never 'what-to-ignore' (2024).
• Temporal/structural approaches (Conversational DNA, intent-mismatch audits) catch dialogue breakdown as a multi-stream problem, not a flat statistical signal (2025–2026).

Anchor papers (verify; mind their dates):
• arXiv:2203.09711 (2022) — DEAM, the original AMR-manipulation coherence method.
• arXiv:2404.03820 (2024) — CantTalkAboutThis, topic-following as an instruction-tuning gap.
• arXiv:2508.07520 (2025) — Conversational DNA, temporal dialogue architecture.
• arXiv:2602.07338 (2026) — Intent Mismatch, multi-turn coherence collapse.

Your task:
(1) RE-TEST EACH CONSTRAINT. For the four AMR-detectable modes, judge whether recent scaling, instruction-tuning refinements (e.g., multi-agent architectures, long-context training, or new alignment methods post-2026Q1), or improved evaluation harnesses have *relaxed* or *overturned* any mode. Separate: Is the durable question (Can we mechanically distinguish incoherence types?) still open? Have newer methods (e.g., LLMs with explicit memory+grounding, retrieval-augmented dialogue) made AMR perturbation *unnecessary* or *more necessary*? State plainly where each constraint holds or broke.
(2) Surface the strongest *contradicting or superseding* work from the last ~6 months. Does any recent paper claim a simpler or more direct way to locate coherence faults? Any work showing the four modes collapse or merge under new training regimes?
(3) Propose 2 research questions that *assume the regime may have moved*: e.g., "If multi-agent systems with explicit memory handles joint ground-truth revision, does AMR perturbation still catch coreference drift?" or "Can retrieval-grounded dialogue systems exhibit the 77% engagement drop, or does external memory restore repair behavior?"

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Can AMR manipulation reveal where discourse coherence actually breaks down?

Sources 7 notes

Next inquiring lines