SYNTHESIS NOTE
Recommender Systems Psychology, Society, and Alignment Agentic Systems and Tool Use

Can LLM agents realistically simulate filter bubble effects in recommendations?

Can generative agents with emotion and memory modules faithfully reproduce how recommendation systems create echo chambers and user fatigue? This matters because real-world A/B testing is expensive and slow.

Synthesis note · 2026-05-03 · sourced from Recommenders LLMs
Why do multi-agent systems fail despite individual capability? How do recommendation feeds shape what people see and believe? What breaks when specialized AI models reach real users?

Studying recommendation system effects on user populations typically requires either real-user A/B tests (expensive, slow, ethics-bound) or simplistic simulators (lack realism). Agent4Rec proposes a middle ground: 1,000 LLM-empowered generative agents per scenario, each initialized from real-world datasets (MovieLens, Steam, Amazon-Book) to capture authentic tastes and social traits.

Each agent has three modules. The profile module is a repository of personalized social traits and historical preferences, aligning the agent's portrait with genuine human characteristics. The memory module logs factual memories (what was viewed), interaction memories (system interactions), and emotional memories (feelings, fatigue) — and supports emotion-driven reflection. The action module enables both taste-driven actions (view, ignore, rate, generate post-viewing feelings) and emotion-driven actions (exit the system, evaluate the recommendation list, comment).

This separation is the key contribution. Most user simulators model only taste-driven behavior — they evaluate items based on preference and click on the highest-scored ones. Agent4Rec models emotion-driven exits and reactions, enabling phenomena that taste-only simulators miss: filter bubble effects, user fatigue, emotional withdrawal from systems showing repetitive content. Researchers can study causal interventions — changing the recommender algorithm and observing effects on agent populations — without real-user studies.

The methodological claim is that LLM-empowered agents can faithfully simulate real autonomous human behavior in recommendation contexts to a useful degree. The empirical evaluation tests both alignment (do agents match real user-personalized preferences?) and deviation (where do they diverge?), then explores experiments like emulating filter bubbles and discovering causal relationships in recommendation tasks. The framework generalizes: any domain with rich behavioral data to initialize from can use this kind of agent simulation for counterfactual study.

Inquiring lines that use this note as a source 3

This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.

Related concepts in this collection 5

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map
16 direct connections · 147 in 2-hop network ·dense cluster Open in graph ↗

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Original note title

Agent4Rec simulates 1000 generative agents per recommendation scenario — emotion-driven actions emulate filter bubble effects