TOPIC

Personalized Recommenders

15 synthesis notes · 13 source papers

View as

Can aggregate reward models satisfy genuinely disagreeing users?

When users have conflicting preferences, do aggregate reward models face an impossible choice between satisfying majorities or sampling proportionally? What does this reveal about RLHF deployment?

Can generative AI scale personality-targeted political persuasion?

Does removing the human-writing bottleneck through generative AI make it feasible to target voters at scale based on individual psychological traits? This matters because it could reshape political microtargeting economics and capabilities.

Can bandit algorithms beat collaborative filtering for news?

News recommendation faces constant content churn and cold-start users—settings where traditional collaborative filtering struggles. Can a contextual bandit approach like LinUCB explicitly balance exploration and exploitation better than static methods?

Can retrieval enhancement fix explainable recommendations for sparse users?

When users have few historical interactions, embedded recommendation models struggle to generate personalized explanations. Can augmenting sparse histories with retrieved relevant reviews—selected by aspect—overcome this fundamental data limitation?

Can cross-user behavior reveal news relations that individual histories miss?

When a single user's reading history is too sparse for personalized recommendations, can patterns from many users' collective clicking behavior expose hidden connections between articles that no individual user alone could discover?

Why do LLMs generate polite reviews even when users hated products?

Large language models trained with RLHF develop a politeness bias that overrides negative sentiment in review generation. Understanding this bias and how to counteract it is crucial for creating accurate, user-aligned review systems.

Do prompt techniques work the same across all LLM tiers?

Do chain-of-thought and rephrasing prompts help or hurt recommendation tasks equally across cost-efficient and high-performance models? Understanding tier-dependent effects could optimize prompt selection.

Does LLM input augmentation beat direct LLM recommendation?

Can LLMs enrich item descriptions more effectively than making recommendations directly? This explores whether specialized models work better when LLMs focus on what they do best: content understanding rather than ranking.

Does preference data need more raters than examples?

Pairwise preference data violates the i.i.d. assumption because preferences vary across raters. Does this mean PAC bounds for reward models depend on rater diversity rather than just sample size?

What dominates AI compute in production systems today?

While public discussion centers on large language models, Facebook's infrastructure data reveals a different story about which AI workloads actually consume the most compute cycles in real production environments.

Does personalizing reward models amplify user echo chambers?

Personalized reward models solve the minority-preference problem but may introduce new risks by reinforcing existing user beliefs and narrowing exposure to diverse viewpoints.

Can users steer recommendations with natural language at inference?

Can recommendation systems let users specify their preferences in natural language at inference time without retraining? This matters because it would let new users and existing users dynamically adjust what they want to see.

Can one text encoder unify all recommendation tasks?

Does framing diverse recommendation problems—from sequential prediction to review generation—as natural language tasks allow a single model to learn shared structure? Can this approach generalize to unseen items and new task phrasings?

Can user history override an LLM's politeness bias in reviews?

LLMs trained on web text tend to be systematically polite, generating positive reviews even when users are dissatisfied. Can providing a user's prior reviews and ratings as context help the model generate authentically negative reviews that match the user's actual experience?

Can friends with different tastes improve recommendations?

Does incorporating social networks through friends' diverse preferences rather than similar tastes lead to better recommendations? This challenges conventional homophily-based approaches that assume friends like the same things.

Source papers 13

The Arxiv papers behind this sub-topic. Links may take you off-site to arxiv.org.

A Contextual-Bandit Approach to Personalized News Article Recommendation
“This paper addresses the challenge of identifying the most appropriate web-based content at the best time for individual users. Most service vendors acquire and maintain a large amount of content in …
A Personalized Recommender System based-on Knowledge Graph Embeddings
Abstract. Knowledge graphs have proven to be effective for modeling entities and their relationships through the use of ontologies. The recent emergence in interest for using knowledge graphs as a for…
A Probabilistic Model for Using Social Networks in Personalized Item Recommendation
“We first review previous research on using social networks to help recommend items to users. A crucial component of SPF is that it infers the influence that users have with each other. In previous wo…
Capturing Individual Human Preferences with Reward Features
Reinforcement learning from human feedback usually models preferences using a reward function that does not distinguish between people. We argue that this is unlikely to be a good design choice in con…
Explainable Recommendation with Personalized Review Retrieval and Aspect Learning
“Recent years have witnessed a growing interest in the development of explainable recommendation models [1, 2]. In general, there are three different kinds of frameworks for explainable recommendation…
Going Beyond Local: Global Graph-Enhanced Personalized News Recommendations
“News Recommendation (NR) is the process of recommending news articles to users to satisfy their need for information by optimizing the accuracy of predicting relevance between news articles and users…
LLM-Rec: Personalized Recommendation via Prompting Large Language Models
The use of large language models in recommender systems has garnered significant attention in recent research. Numerous studies have explored the direct use of LLMs as recommender models. The underlyi…
Preference Discerning with LLM-Enhanced Generative Retrieval
Sequential recommendation systems aim to provide personalized recommendations for users based on their interaction history. To achieve this, they often incorporate auxiliary information, such as textu…
Recommendation as Language Processing (RLP): A Unified Pretrain, Personalized Prompt & Predict Paradigm (P5)
different recommendation tasks typically require designing task-specific architectures and training objectives. As a result, it is hard to transfer the learned knowledge and representations from one t…
Review-LLM: Harnessing Large Language Models for Personalized Review Generation
Product review generation is an important task in recommender systems, which could provide explanation and persuasiveness for the recommendation. Recently, Large Language Models (LLMs, e.g., ChatGPT) …
Revisiting Prompt Engineering: A Comprehensive Evaluation for LLM-based Personalized Recommendation
Large language models (LLMs) can perform recommendation tasks by taking prompts written in natural language as input. Compared to traditional methods such as collaborative filtering, LLM-based recomme…
The Architectural Implications of Facebook’s DNN-based Personalized Recommendation
“Deep learning has become a cornerstone in many production scale data center services. As web-based applications continue to expand globally, so does the amount of compute and storage resources devote…
The persuasive effects of political microtargeting in the age of generative artificial intelligence
The increasing availability of microtargeted advertising and the accessibility of generative artificial intelligence (AI) tools, such as ChatGPT, have raised concerns about the potential misuse of lar…