Why do similar user profiles produce worse personalization errors?

When personalization systems replace a user's profile with a similar one, why does performance drop most sharply with near-matches rather than dissimilar profiles? This explores the confidence-driven failure modes in persona-based recommendation systems.

Synthesis note · 2026-02-23 · sourced from Personalization

PRIME's controlled profile-replacement experiment reveals a counterintuitive U-shaped error curve in personalization fidelity:

Performance is highest with the target user's own profile (Self). When profiles are swapped, the drop is steepest with the most similar replacement user, then partially recovers as replacements become more dissimilar. The most similar user's profile produces worse outcomes than a mid-range or even dissimilar user's profile.

The mechanism is confidence-driven misdirection. PRIME learns fine-grained, user-specific preferences — effectively a dedicated bias toward which responses that specific user finds compelling. Two users with superficially similar posting histories may differ sharply in the specific responses they find persuasive. When a similar-but-not-identical profile is substituted, the model confidently applies the wrong preferences. A dissimilar profile, being obviously mismatched, triggers weaker and less harmful predictions because the model's learned biases don't activate as strongly.

This is an "uncanny valley" effect for persona similarity: nearly-right is more dangerous than clearly-wrong because the system cannot distinguish genuine user-specific preferences from similar-user preferences. The closer the profile approximation, the more confident — and more specifically wrong — the model becomes.

Data scarcity amplifies the effect. On the CMV (Change My View) forum, limited active users mean that two users with similar posting histories may have genuinely different persuasion profiles. The fine-grained preference learning that makes PRIME effective on correct profiles makes it maximally vulnerable to similar-but-wrong profiles.

This has direct design implications for persona-level personalization. Since How do personalization granularity levels trade precision against scalability?, persona-level approaches that group "similar" users together may systematically trigger this confidence-misdirection failure mode. The efficiency gain from persona grouping comes at the cost of confidently applying nearly-right-but-wrong preferences — potentially worse than no personalization at all.

The finding also connects to simulation fidelity. Since How do we generate realistic personas at population scale?, similar profiles may amplify rather than reduce systematic bias, because the simulation confidently generates behavior for the "wrong similar" user.

Inquiring lines that use this note as a source 20

This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.

Related concepts in this collection 3

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map

12 direct connections · 74 in 2-hop network ·medium cluster Open in graph ↗

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

How do personalization granularity levels trade precision against scalability? LLM personalization operates at user, persona, and global levels, each with different tradeoffs. Understanding these tradeoffs helps determine when to invest in individual user data versus broader patterns.
persona-level grouping may trigger the confidence-misdirection failure systematically
How do we generate realistic personas at population scale? Current LLM-based persona generation relies on ad hoc methods that fail to capture real-world population distributions. The challenge is reconstructing the joint correlations between demographic, psychographic, and behavioral attributes from fragmented data.
similar profiles may amplify systematic bias through confident misapplication
Why do LLM judges fail at predicting sparse user preferences? When LLMs judge user preferences based on limited persona information, what causes their predictions to become unreliable? Understanding persona sparsity's role in judgment failure could improve personalization systems.
complementary: persona sparsity produces unreliable predictions; persona similarity produces confidently wrong predictions

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Original note title

similar user profile replacement produces the worst personalization errors — the uncanny valley of persona similarity where confident application of nearly-right preferences is more misleading than random profiles

Why do similar user profiles produce worse personalization errors?

Related concepts in this collection 3

Related papers in this collection 8

Search by related questions 4