Can dialogue systems track both speakers' beliefs across turns?
Explores whether pragmatic reasoning frameworks can extend beyond single utterances to model how both conversation partners' understanding evolves. This matters because current dialogue systems lack principled ways to represent shared meaning-making.
The Rational Speech Act (RSA) framework models pragmatic reasoning as recursive social inference between speakers and listeners. But RSA has a fundamental limitation for dialogue: it handles single utterances, not evolving multi-turn conversations. CRSA fixes this by integrating a multi-turn gain function grounded in interactive rate-distortion theory.
The key extension: Both agents have private information. Each produces utterances conditioned on the full dialogue history. The gain function tracks evolving beliefs of both interlocutors — not just one listener inferring one speaker's intent, but bidirectional, progressive convergence of shared understanding.
Demonstrated on: referential games and template-based doctor-patient dialogues (disease diagnosis from symptoms). CRSA captures the progression from partial to shared understanding across turns.
A critical limitation acknowledged: there is no systematic way to model the meaning spaces, which are always application-dependent. And shifting from utterance-level to token-level reasoning (for scaling to real LLMs) may influence pragmatic capabilities — the reasoning granularity problem is unresolved.
This provides the mathematical framework that current LLM dialogue systems lack. Since the fluency gap — llm text is linguistically well-formed but communicatively empty because fluency substitutes for the grounding work that makes communication meaningful, CRSA offers a principled alternative: pragmatic reasoning grounded in information theory rather than next-token prediction. The question is whether token-level LLM generation can implement utterance-level pragmatic optimization.
Since Why do standard alignment methods ignore partner interventions?, CRSA's bidirectional belief tracking is the theoretical complement to the counterfactual invariance approach — one addresses it through reward engineering, the other through information-theoretic architecture.
Inquiring lines that use this note as a source 73
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- Does the same uncertainty-driven logic appear in other conversation systems?
- Can dialogue systems abstain from responding when uncertainty is too high?
- Can AI arguments participate in discourse without temporal grounding?
- How does multi-turn conversation degrade AI intent alignment?
- How does lexical entrainment depend on selective frame-activation in conversation?
- What role does user contribution play in constituting the interlocutor?
- How does Stalnaker's common ground model apply to machine conversation?
- Why does dialogue-shaped text fail to produce dialogue-like operations in practice?
- Why do conversational pivots require explicit re-prompting instead of natural evolution?
- How do humans maintain separate mental contexts during a single conversation?
- How do dialogue dimensions predict explanation success across different exchanges?
- What dialogue dynamics distinguish negotiation from standard information-provision tasks?
- Which alignment dimensions matter most in educational conversation design?
- How does the superposition view change the folk-psychology interpretation of dialogue?
- Why does a chatbot's intersubjective stance differ functionally from Otto's extended-mind notebook?
- Why do dialogue systems fail to detect declarative clarification requests?
- What makes active reasoning through dialogue harder than passive reasoning?
- Does conversational back-and-forth increase persuasion more than single responses?
- What metrics actually measure disagreement in multi-turn conversations?
- Why do current conversational AI systems fail to develop shared vocabulary with users?
- How do conversational design patterns predict whether dialogue will derail?
- Can visual representation of dialogue reveal patterns that numbers and statistics cannot?
- Does conversational structure determine how humans interpret communication as much as content?
- How should dialogue state tracking change when user preferences shift mid-conversation?
- Can AI learn when to speak in a conversation?
- Why does static grounding prevent AI systems from supporting dialectical reconciliation?
- How do dialogue acts and explanation moves interact to predict understanding success?
- How do discourse structure and dialogue state management relate to each other?
- How do coreference chains preserve coherence across dialogue turns?
- How do dialogue coherence failures map onto the three discourse components?
- How do question acts and intents map to speech act theory?
- Can structured artifact sharing replace direct latent thought communication?
- Can conversation analysis predict when agents should ask users for clarification?
- Can AI systems recover from premature assumptions made early in multi-turn conversations?
- Can conversational AI achieve mutual understanding if trained only on text?
- Can language meaning emerge without joint attention and shared embodied interaction?
- Can real-time linguistic coordination tracking improve conversational AI quality?
- How does linguistic coordination build shared reference between conversational partners?
- Can topic planning and response generation reduce dialogue turns?
- What data would be needed to train proactive conversational systems?
- How does temporal event structure scaffold coherence in dialogue?
- Does chain-of-thought reasoning improve mental state tracking in dialogue?
- How does shared reference and grounding affect assumption detection in dialogue?
- How do users update their partner models during ongoing conversation?
- How do discourse relation types improve dialogue beyond sentence-level semantic matching?
- What distinguishes local coherence from global coherence in dialogue?
- Can multi-turn conversations manipulate language model reasoning in similar ways to personas?
- Does adding multiple interpretations to ambiguous situations respect language more than resolving them?
- Why do posters acknowledge multiple viewpoints without integrating them into coherent judgments?
- What role do first-person pronouns play in sustaining collaborative conversation tone?
- Can offline RL and pragmatic inference together improve dialogue agent reliability?
- Can AI systems deliberately align arguments to audience presuppositions?
- What role does accommodation play in making discourse coherent?
- What social boundaries must proactive agents respect during conversation?
- Why are task-oriented dialogue datasets systematically underrepresenting human proactive behavior?
- How can dialogue structure and trajectory predict social agent performance?
- How do conversation dynamics push models toward false beliefs?
- What specific repair mechanisms maintain intersubjectivity during conversation?
- Can discourse-level structure and conversational-level organization work together?
- Why do conversational systems benefit from post-thinking between user turns?
- Do dialogue systems need different retrieval strategies for opinions versus factual knowledge?
- Why do longer context windows alone fail to capture temporal dynamics in dialogue?
- How should AI systems model relationship evolution within a specific ongoing conversation history?
- Why does single-turn Q&A framing not match real user deployment patterns?
- How does local helpfulness per turn conflict with maintaining session-level conversational goals?
- How might dual-process dialogue use information gain to trigger clarification?
- How does unilateral interpretation differ from mutual communicative uptake?
- Can statistical token processing create the accountability needed for dialogue?
- Does conversational shape carry diagnostic meaning independent of what is discussed?
- Can structural conversation analysis replace text-based reward signals for AI alignment?
- How does multi-turn dialogue improve user satisfaction in search interactions?
- How do language models track multiple negotiating parties' commitments simultaneously?
- What structural updates prevent context collapse in evolving conversations?
Related concepts in this collection 2
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
Why do standard alignment methods ignore partner interventions?
Standard RLHF and DPO optimize for token-level quality but may structurally prevent agents from meaningfully incorporating partner input. This explores whether the training objective itself blocks collaborative reasoning.
complementary approaches to the same problem: partner-awareness
-
Why do models fail at asking good questions during interaction?
When models must actively seek information through questions rather than receive it passively, they struggle dramatically. This explores why GPT-4o plateaus at 35% accuracy and whether training or prompting can fix the underlying deficit.
CRSA's belief tracking could explain WHY active reasoning fails: without tracking what information has been gained, questioning degenerates
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- Collaborative Rational Speech Act: Pragmatic Reasoning for Multi-Turn Dialog
- Intent Mismatch Causes LLMs to Get Lost in Multi-Turn Conversation
- Consistently Simulating Human Personas with Multi-Turn Reinforcement Learning
- DR-HAI: Argumentation-based Dialectical Reconciliation in Human-AI Interactions
- Collaborative Reasoner: Self-Improving Social Agents with Synthetic Conversations
- DialogueReason: Rule-Based RL Sparks Dialogue Reasoning in LLMs
- WHEN TO ACT, WHEN TO WAIT: Modeling Structural Trajectories for Intent Triggerability in Task-Oriented Dialogue
- The Thin Line Between Comprehension and Persuasion in LLMs
Original note title
collaborative rational speech acts extend pragmatic reasoning to multi-turn dialogue by modeling evolving beliefs of both interlocutors through rate-distortion theory