How does unilateral interpretation differ from mutual communicative uptake?
This explores the difference between one party making meaning on their own (interpreting a fixed text) versus two parties actively building shared understanding together — and why the corpus suggests current AI conversation falls on the unilateral side of that line.
This explores the gap between one party interpreting a message alone and two parties jointly negotiating what was meant. The cleanest way into it: when you read a sentence, you assign it meaning from your own position — and the corpus shows those interpretations are irreducibly multiple, varying validly with a reader's social position rather than converging on one 'correct' reading Why do readers interpret the same sentence so differently?. Mutual uptake is the repair mechanism for exactly that problem: because the same words mean different things to different people, speakers have to actively calibrate shared reference, negotiating how language hooks onto the world rather than assuming surface word-sharing already did the work Why do speakers need to actively calibrate shared reference?.
The sharpest claim in the corpus is that LLMs are structurally stuck on the unilateral side. They treat the opening prompt as a fixed frame and interpret every later turn inside it, so they can't symmetrically propose revisions to shared assumptions — which means the human ends up as the sole keeper of the conversational scoreboard Can LLMs truly update shared conversational common ground?. That's why one note argues the right preposition is 'at,' not 'to': talking *to* someone presupposes an addressee capable of mutual orientation and shared commitment, while a model that generates continuations from tokens never actually receives and takes up what you meant Are we really communicating with language models?.
What's interesting is that this isn't only a deep architectural limit — training makes it worse. RLHF rewards confident single-turn answers over clarifying questions and understanding checks, cutting grounding acts to a fraction of human levels and creating an 'alignment tax' where the model looks helpful but quietly fails to confirm it understood you Does preference optimization harm conversational understanding?. The same dynamic shows up as next-turn reward optimization that trains models to respond passively instead of actively discovering your intent across turns Why do language models respond passively instead of asking clarifying questions?. So the unilateral pattern is partly baked in by what we optimize for.
There's also a constructive thread: mutual uptake can be modeled, not just admired. Collaborative Rational Speech Acts extend pragmatic reasoning to multi-turn dialogue with genuinely bidirectional belief tracking — capturing the progression from partial to shared understanding that token-level systems lack Can dialogue systems track both speakers' beliefs across turns?. And the corpus is careful to say uptake isn't one thing: lexical alignment buys task efficiency, while emotional and prosodic alignment build trust, so conflating them produces category errors like a warm-sounding but evasive assistant Do different types of alignment serve different conversational goals?.
The payoff worth carrying away: in the deeper papers, mutual uptake isn't just a nicer way to communicate — it's where subjecthood itself is argued to come from. One strand holds that being a communicative subject is a *role produced within* communicative events, not a property you bring to them Does language create subjects or express them?, and a companion critique argues that behavioral tests for AI 'understanding' detect speech patterns while missing the relational conditions — accountability, evaluative stance — that uptake actually requires Does behavioral speech output prove communicative subjecthood?. Read together, the line between unilateral interpretation and mutual uptake stops being a communication-quality issue and becomes a question about what counts as a participant at all.
Sources 10 notes
Interpretation Modeling research shows that disagreement on socially embedded sentences reflects valid differences in reader perspective, not annotation failure. Structured human disagreement in NLI benchmarks confirms that interpretation distributions carry meaningful information.
The same words can mean different things to different speakers because referential grounding is person-specific. True communicative grounding demands collaborative negotiation of how language connects to the world, not mere surface-level word sharing.
LLMs interpret all subsequent conversational turns within a fixed initial prompt frame, preventing them from symmetrically proposing updates to shared assumptions. Even when users pivot topics or contradict earlier framings, the model cannot absorb revisions into jointly held background—making the user the sole maintainer of conversational scoreboard.
LLMs process tokens and generate continuations rather than receive and uptake communication. The preposition 'to' presupposes an addressee capable of mutual orientation and shared commitment that LLMs cannot provide, making Chalmers' investigation built on an unwarranted linguistic foundation.
RLHF optimizes models for single-turn helpfulness by rewarding confident responses over clarifying questions and understanding checks. This preference alignment systematically reduces grounding acts by 77.5% below human levels, creating an alignment tax where models appear helpful but fail silently in multi-turn contexts.
CollabLLM demonstrates that standard RLHF training optimizes for immediate helpfulness, discouraging models from asking clarifying questions or offering multi-turn insights. Multi-turn-aware rewards that estimate long-term interaction value enable active intent discovery and genuine collaboration.
CRSA integrates rate-distortion theory with RSA to enable bidirectional belief tracking across dialogue turns. Demonstrated on referential games and doctor-patient dialogues, it captures progression from partial to shared understanding, providing the information-theoretic framework that token-level LLM systems lack.
A 2020–2025 systematic review shows lexical alignment drives task efficiency and comprehension, while emotional and prosodic alignment drive relational warmth and trust. Conflating them in design produces category errors—cold customer-service bots and evasive mental-health assistants.
Subjecthood is produced within communicative events, not possessed prior to them. This convergent position across philosophy, linguistics, and cognitive science inverts the standard picture of language as a tool used by pre-existing subjects.
Chalmers' test passes any system producing contextually appropriate text, but communicative subjecthood requires relational-normative conditions like accountability and evaluative stance. The test is calibrated to the wrong phenomenon, creating false positives like puppets that walk-shaped without walking.