TOPIC

Personalized Recommenders

15 synthesis notes · 13 source papers
View as

Can aggregate reward models satisfy genuinely disagreeing users?

When users have conflicting preferences, do aggregate reward models face an impossible choice between satisfying majorities or sampling proportionally? What does this reveal about RLHF deployment?

Explore related Read →

Can generative AI scale personality-targeted political persuasion?

Does removing the human-writing bottleneck through generative AI make it feasible to target voters at scale based on individual psychological traits? This matters because it could reshape political microtargeting economics and capabilities.

Explore related Read →

Can bandit algorithms beat collaborative filtering for news?

News recommendation faces constant content churn and cold-start users—settings where traditional collaborative filtering struggles. Can a contextual bandit approach like LinUCB explicitly balance exploration and exploitation better than static methods?

Explore related Read →

Can retrieval enhancement fix explainable recommendations for sparse users?

When users have few historical interactions, embedded recommendation models struggle to generate personalized explanations. Can augmenting sparse histories with retrieved relevant reviews—selected by aspect—overcome this fundamental data limitation?

Explore related Read →

Can cross-user behavior reveal news relations that individual histories miss?

When a single user's reading history is too sparse for personalized recommendations, can patterns from many users' collective clicking behavior expose hidden connections between articles that no individual user alone could discover?

Explore related Read →

Why do LLMs generate polite reviews even when users hated products?

Large language models trained with RLHF develop a politeness bias that overrides negative sentiment in review generation. Understanding this bias and how to counteract it is crucial for creating accurate, user-aligned review systems.

Explore related Read →

Do prompt techniques work the same across all LLM tiers?

Do chain-of-thought and rephrasing prompts help or hurt recommendation tasks equally across cost-efficient and high-performance models? Understanding tier-dependent effects could optimize prompt selection.

Explore related Read →

Does LLM input augmentation beat direct LLM recommendation?

Can LLMs enrich item descriptions more effectively than making recommendations directly? This explores whether specialized models work better when LLMs focus on what they do best: content understanding rather than ranking.

Explore related Read →

Does preference data need more raters than examples?

Pairwise preference data violates the i.i.d. assumption because preferences vary across raters. Does this mean PAC bounds for reward models depend on rater diversity rather than just sample size?

Explore related Read →

What dominates AI compute in production systems today?

While public discussion centers on large language models, Facebook's infrastructure data reveals a different story about which AI workloads actually consume the most compute cycles in real production environments.

Explore related Read →

Does personalizing reward models amplify user echo chambers?

Personalized reward models solve the minority-preference problem but may introduce new risks by reinforcing existing user beliefs and narrowing exposure to diverse viewpoints.

Explore related Read →

Can users steer recommendations with natural language at inference?

Can recommendation systems let users specify their preferences in natural language at inference time without retraining? This matters because it would let new users and existing users dynamically adjust what they want to see.

Explore related Read →

Can one text encoder unify all recommendation tasks?

Does framing diverse recommendation problems—from sequential prediction to review generation—as natural language tasks allow a single model to learn shared structure? Can this approach generalize to unseen items and new task phrasings?

Explore related Read →

Can user history override an LLM's politeness bias in reviews?

LLMs trained on web text tend to be systematically polite, generating positive reviews even when users are dissatisfied. Can providing a user's prior reviews and ratings as context help the model generate authentically negative reviews that match the user's actual experience?

Explore related Read →

Can friends with different tastes improve recommendations?

Does incorporating social networks through friends' diverse preferences rather than similar tastes lead to better recommendations? This challenges conventional homophily-based approaches that assume friends like the same things.

Explore related Read →

Source papers 13

The Arxiv papers behind this sub-topic. Links may take you off-site to arxiv.org.