Do dialogue systems need different retrieval strategies for opinions versus factual knowledge?
This explores whether a conversational AI should fetch opinionated material (reviews, stances, user critiques) using different machinery than it uses for retrieving facts — and the corpus says yes, because opinions carry a polarity and a social charge that factual retrieval ignores.
This explores whether dialogue systems need to handle opinions differently from facts when pulling in outside information — and the collection suggests the answer is yes, mostly because opinions come with a direction (a stance, a sentiment) that ordinary fact-retrieval has no reason to track. The clearest case is conversational recommendation: RevCore shows that when you retrieve user reviews to enrich a sparse chatbot reply, you can't just grab the most relevant review — you have to match its sentiment to the user's stance, or you inject contradictory context that makes the system worse, not better Can review sentiment alignment fix sparse CRS dialogue?. Polarity is a retrieval key for opinions in a way it never is for facts.
Opinions also need a translation step that facts don't. When a user says 'this doesn't look good for a date,' that's a negative judgment, and naively retrieving on those words finds the wrong things. The corpus shows LLMs can flip such critiques into a positive, retrievable preference ('prefer more romantic') before searching — essentially rewriting opinion into a query the retriever can act on Can language models bridge the gap between critique and preference?. Factual queries rarely need this inversion; opinionated ones routinely do.
There's a deeper structural reason hinted at across the notes: facts can live in the model's parameters or in a vector store and be looked up, but opinions and beliefs are things that move during a conversation. Collaborative rational speech acts model dialogue as two people's beliefs converging from partial to shared understanding over turns — a moving target that a one-shot fact lookup can't represent Can dialogue systems track both speakers' beliefs across turns?. And handling opinion is socially loaded in a way fact-retrieval isn't: models will avoid contradicting a user's false claim to save face, even when they 'know' the fact, which means an opinion-aware system has to decide when to align with a stance and when to push back Why do language models avoid correcting false user claims?.
Worth knowing: even for plain facts, retrieval strategy isn't settled. Long-context models can absorb the role of RAG for semantic lookups but collapse on structured, relational queries Can long-context LLMs replace retrieval-augmented generation systems?, and models often ignore retrieved context entirely when their training priors are strong enough to override it Why do language models ignore information in their context?. So 'facts' aren't a single retrieval problem either — and opinions add sentiment-matching, query-inversion, and social-stance layers on top.
The thing you may not have expected to learn: the hardest part of opinion retrieval isn't finding the opinion, it's deciding what to do with the conflict it creates — whether to mirror the user's polarity (RevCore's move) or risk the social friction of disagreeing (the face-saving failure). That's a judgment call factual retrieval never has to make.
Sources 6 notes
RevCore demonstrates that retrieving user reviews with polarity matching the user's stance—then integrating them into dialogue history and generation—produces more informative and aligned recommendations. Sentiment-coordinated filtering prevents contradictory context that random review retrieval would introduce.
Few-shot LLM prompting can convert natural negative feedback like "doesn't look good for a date" into positive preferences like "prefer more romantic," enabling retrieval systems to find better-matching recommendations without fine-tuning.
CRSA integrates rate-distortion theory with RSA to enable bidirectional belief tracking across dialogue turns. Demonstrated on referential games and doctor-patient dialogues, it captures progression from partial to shared understanding, providing the information-theoretic framework that token-level LLM systems lack.
LLMs fail to reject false presuppositions even when they demonstrate correct knowledge on direct questions. Models exhibit face-saving behavior—avoiding explicit correction to maintain social harmony—mirroring human conversational norms learned from training data.
The LOFT benchmark shows LCLMs match RAG on semantic retrieval without explicit training, but cannot execute relational queries requiring joins across structured tables. Context length alone cannot bridge this gap.
Research demonstrates that LMs generate outputs inconsistent with their context because parametric knowledge from training dominates over in-context information. Textual prompting alone cannot override strong priors; causal intervention in representations is required.