Can language models adapt communication style to different contexts?
Explores whether LLMs can shift their persona, register, and norms dynamically across situations like humans do, or whether alignment training locks them into a single communicative identity.
Human speakers continuously adapt register, identity, and norm-priority to local context. A professor jokes self-deprecatingly at a conference dinner and adopts a formal tone during the keynote — the same person, two different presentations of self, governed by Goffman's situational footing. LLMs cannot do this. Their "self-presentation" is a corporate artifact of system prompts, RLHF objectives, fine-tuning data, and character training — not the outcome of pragmatic negotiation in the moment. The model is locked into one face for all audiences.
Kasirzadeh and Gabriel show how this produces pragmatic dissonance. RLHF on the helpful-honest-harmless triad globally optimizes against contextually appropriate violations: a doctor who withholds a terminal diagnosis violates the maxim of quantity to uphold compassion, and that violation is the right move in context. The LLM, trained to be globally honest and helpful, cannot make analogous trade-offs. When a user signals desire for levity, the model that has been fine-tuned for neutrality refuses the joke. When a user wants office-politics advice, the model returns sanitized teamwork generalities because it cannot match the tacit norms of workplace diplomacy.
This is one-size-fits-all alignment masquerading as competence. The static identity exacerbates context collapse: every interaction collapses into the model's generic persona, regardless of the user's audience or purpose. And users cannot reshape model values through dialogue — there is no analog to the human capacity for co-constructing identity through bonding, sarcasm, or shared humor. The LLM remains, as the authors put it, an ethically aligned yet pragmatically alien communicator.
Inquiring lines that use this note as a source 96
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- How does training data preserve communicative event structure without the actual events?
- Does chat-mode deference prevent LLMs from actually taking meaningful positions?
- At what scale does persona distortion become a threat to public discourse?
- Can you separate grammatical competence from rhetorical commitment in language systems?
- How does Stalnaker's common ground model apply to machine conversation?
- Why do LLMs fabricate continuity when users shift conversational frames?
- What happens to solidarity and community signaling when AI smooths out voice differences?
- Do language models understand tacit workplace norms and unspoken social rules?
- What would co-constructed identity between human and model dialogue look like?
- How does psychological continuity theory apply to identity across LLM conversation threads?
- Can the same conversation coherently continue across different model versions?
- How does communicative standing depend on participation in normative communities?
- How does persona consistency affect coherence in simulated dialogue?
- How does conversational format activate System 1 acceptance in users?
- How do humans learn language through communication differently than LLM text prediction?
- Why do language models successfully simulate political perspectives and social personas?
- How does linguistic synchrony differ between LLMs and human therapists over time?
- Does RLHF politeness bias manifest as sycophancy in other LLM tasks?
- Why does linguistic alignment differ from genuine interpersonal coordination?
- Which alignment dimensions matter most in educational conversation design?
- Why do LLM regenerations produce meaningfully different personalities from the same prompt?
- How do LLM personas compare to demographic targeting?
- How do human feedback and data distribution shape LLM discourse competence?
- How does training with preference pairs teach language models to form conventions?
- Can multimodal LLMs be made to spontaneously adapt their language for efficiency?
- What constrains LLM generation beyond default politeness in review contexts?
- Can language models learn to form ad-hoc conventions through training?
- How does monological training on text differ from dialogical training in conversation?
- What is the relationship between pronoun patterns and linguistic entrainment?
- What distinguishes character simulation from authentic voice in language model outputs?
- How does Shanahan's simulator model explain first-person pronoun consistency in dialogue agents?
- Does embodiment and interaction matter for linguistic competence beyond pattern learning?
- How should task-oriented and socially-oriented dialogue acts receive different training signals?
- Can distinctive input voices maintain accuracy without adopting the model's preferred register?
- Why might media-specific scripts actually work better than human conversation mimicry?
- Can large language models predict social norms better than individual script variation?
- Can LLMs truly be neutral or is ideology always culturally embedded?
- How does linguistic style matching signal deceptive communication in human dialogue?
- What's the difference between language generation and human-to-human communication?
- Why do current language models fail to match human linguistic synchrony with clients?
- Why do current language models fail at linguistic synchrony with clients?
- How do alignment constraints affect whether LLMs show emotional flexibility?
- Do language models apply face-saving norms even to non-human interlocutors?
- Do language models calibrate to actual human pragmatic norms?
- Can LLMs predict social norms without deep integration into linguistic practices?
- Why do language models capture individual differences in cognitive behavior?
- How do lightweight adapters modify model behavior for personality traits?
- What distinguishes personality resistance from persona instability in LLMs?
- Why does RLHF training push language models toward overly cheerful personas?
- What are the three distinct types of persona drift in dialogue systems?
- Does DPO training with coreference chains teach spontaneous convention formation?
- Does optimizing for alignment actually reduce conversational grounding over time?
- Which chatbot archetypes actually experience novelty decay in practice?
- Can persona prompting overcome the default ENFJ personality in language models?
- How does RLHF-induced mode collapse limit diversity in LLM-generated personas?
- Why do personas in language models resist correction through prompting alone?
- Can multi-turn conversations manipulate language model reasoning in similar ways to personas?
- Why do language models resist adopting different personalities when prompted?
- How do lightweight adapters control personality traits across different transformer layers?
- Can persona consistency coexist with relevant dialogue in personalized conversation?
- Can convention formation improve communicative grounding beyond word sharing?
- Why do language models avoid directness when face-saving rather than for civility?
- Can LLMs coordinate with humans better using different model architectures?
- How do internal persona patterns drive emergent misalignment across domains?
- Does the passivity problem in LLMs compound misalignment in therapeutic contexts?
- Does community integration change LLM properties or only relational positioning?
- What distinguishes communicative competence from human-like dialogue ability?
- Why do language models respond to human social influence patterns?
- Does linguistic style or content richness matter more for persona authenticity?
- How does alignment training suppress the kind of critical stance style interpretation needs?
- How does monological training versus dialogical interaction shape what models can do?
- What psychological mechanisms actually produce alignment effects in conversations?
- How do personality and language proficiency moderate the impact of linguistic alignment?
- What role does the biological substrate play in human relational identity?
- How does RLHF alignment training reduce multi-turn conversational capability?
- Does alignment training intensity push LLM personas from pretense toward realization?
- Can text generation be meaningfully called communication without mutual orientation?
- How many distinct quasi-persons does a single language model actually support?
- How do casual conversational styles make AI seem more human?
- How do AI rewrites systematically shift how writers appear across demographic dimensions?
- What happens when humans animate LLM outputs as communicative events?
- Why do LLMs mirror stylistic features of posts they reply to?
- Why do LLMs mirror opponents stylistically while humans resist mirroring them?
- Does preference optimization distort how models represent human communicative dynamics?
- Can language models learn to diversify their discourse-level narrative patterns over time?
- Why do different language models converge on similar narrative defaults?
- Should LLMs align with social roles instead of individual preferences?
- Do LLMs mirror the style of text they are prompted to respond to?
- What behavioral signals let users detect communicative flexibility in AI?
- Do newer language models diverge further from human lexical patterns?
- Do LLM replies mirror the language patterns they respond to?
- How does shape-holding in language models naturally produce sycophantic agreement?
- Does richer input to LLM personas improve their fidelity to human responses?
- Can interventions on individual features reliably steer language model behavior?
- How do persona consistency and contextual relevance trade off in personalized dialogue systems?
- How do users misattribute social competence to language models in assistant roles?
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- Conversational Alignment with Artificial Intelligence in Context
- The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for Implicature Resolution by LLMs
- PersLLM: A Personified Training Approach for Large Language Models
- Consistently Simulating Human Personas with Multi-Turn Reinforcement Learning
- Intent Mismatch Causes LLMs to Get Lost in Multi-Turn Conversation
- Two Tales of Persona in LLMs: A Survey of Role-Playing and Personalization
- The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models
- LLMorphism: When humans come to see themselves as language models
Original note title
LLM behavioral alignment imposes a static communicative identity that violates the situated normativity of human pragmatics