Can language models learn to model human decision making?
Explores whether LLMs finetuned on psychological experiments can capture how people actually make decisions better than theories designed specifically for that purpose.
The claim is surprisingly strong: large language models, after finetuning on data from psychological experiments, produce more accurate representations of human behavior than traditional cognitive models in two well-studied decision-making domains — decisions from descriptions (choosing between gambles with known probabilities) and decisions from experience (learning probabilities through repeated interaction).
Three findings build the case. First, finetuned LLMs describe human behavior better than traditional cognitive models, verified through extensive model simulations confirming human-like behavioral characteristics. Second, embeddings from these finetuned models contain information necessary to capture individual differences — not just population-level averages but subject-level behavioral variation. Third, a model finetuned on two tasks predicts human behavior on a third, hold-out task — genuine cross-task transfer of cognitive modeling capability.
This is not just another "LLMs replicate human patterns" finding. Traditional cognitive models are theory-driven: they embed specific assumptions about how humans process information (prospect theory for gambles, reinforcement learning for experience-based decisions). The LLM approach is theory-agnostic — it captures behavioral regularities without specifying the mechanism. That it outperforms the theory-driven models suggests either that the theories are incomplete, or that LLMs are capturing interaction effects between cognitive mechanisms that modular theories miss.
The individual-differences finding is particularly notable because it connects to Can AI agents learn people better from interviews than surveys?. That work shows LLMs can simulate specific individuals; this work shows LLMs can model individual-level cognitive processes. Together they suggest LLM representations encode not just what people say but how people think — at least for domains well-represented in training data.
Two complementary findings extend this. First, since Can language summaries unlock hidden psychological patterns?, LLMs can predict responses on 9 psychological scales from only 20 Big Five items — with R² > 0.89 structural alignment to human data. The natural language summary serves as an intermediate representation that captures "emergent, second-order information — a conceptual gestalt" beyond what raw scores contain. Second, since Can we control personality in language models without prompting?, PsychAdapter demonstrates that psychological trait knowledge is already structurally present in pre-trained weights — fine-grained personality control requires only activating latent patterns, not teaching new ones. Together with the finetuned cognitive models documented here, these findings converge on a strong claim: LLMs encode human psychological structure at multiple levels — population-level cognitive processes (this note), cross-scale trait relationships (zero-shot profiling), and latent trait representations in weights (PsychAdapter).
The cross-task transfer challenges the view that LLMs are narrow pattern matchers. If finetuning on gamble decisions and experience-based learning transfers to a new task, the model is learning something about human cognition in general, not just memorizing task-specific response patterns. However, the scope remains constrained — both domains involve numerical decision-making, and transfer to qualitatively different cognitive tasks (e.g., language processing, spatial reasoning) is untested.
Inquiring lines that use this note as a source 44
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- Can LLMs infer situational context the way humans do pragmatically?
- Why does combining natural language with numerical scores improve prediction accuracy?
- Can statistical learning from language alone capture all aspects of cultural competence?
- Why does integrating world models with decision-making systems matter?
- Do token probability distributions in LLMs track human reaction time patterns?
- Do language models build world models or just task-specific heuristics?
- Does approaching human performance mean learning the same grammatical rules?
- Which linguistic abilities are learnable from human-sized data exposure?
- How do bimodal decision patterns in LLMs compare to human economic choice?
- Do language models calibrate to actual human pragmatic norms?
- How does personality priming change LLM strategic decision making?
- Which game type reveals minimax reasoning in language models?
- Can hybrid Bayesian architectures fix language model theory of mind failures?
- Can language models develop world models that ground meaning in causal reality?
- Can training LLMs to form ad-hoc conventions improve their pragmatic reasoning?
- How do language models predict collective social norms better than individual humans?
- How might human-LLM teams reinforce each other's causal reasoning mistakes?
- Do LLMs rely on surface statistical patterns instead of causal structure?
- What data presentation structures enable LLMs to learn decision-making from examples?
- Why do language models capture individual differences in cognitive behavior?
- Can a single model trained on two tasks predict untrained decision tasks?
- Why do language models approximate collective human judgment better than individuals?
- How do humans and R1 models differ in information gain patterns?
- How do LLMs default to surface-level strategies instead of genuine mental simulation?
- Do stated character beliefs predict decisions better when extracted from text?
- Why do automated selection methods outperform human judgments of relevant context?
- Can AI evaluation match human judgment quality in structured domain tasks?
- Can causal belief networks extracted from interviews predict how people respond to policy changes?
- Can theory of mind models generalize across structurally similar scenarios?
- Why do language models respond to human social influence patterns?
- Can models track dynamic mental state changes better than static beliefs?
- Can LLMs simulate belief revision in social systems without modeling thought?
- Do LLMs predict social norms more accurately than individual behavior?
- Can machine learning encode pragmatic reasoning about when rules should bend?
- Does preference optimization distort how models represent human communicative dynamics?
- What makes natural-language APIs particularly suited to LLM-based simulation?
- How do knowing and doing diverge in LLM decision-making?
- What role do humans play in converting language model outputs into meaningful events?
- Can in-context reinforcement learning match human sample efficiency on real problems?
- Do realistic LLM behaviors require simulating human thought or just behavior?
- Can language models beat human experts in domains with sparse historical signals?
- Does a single LLM judge capture diverse human preferences in alignment training?
- What capability boundary exists in LLM prediction of effect sizes?
- How does causal structure avoid behaviorist limitations in LLM social simulation?
Related concepts in this collection 5
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
Can AI agents learn people better from interviews than surveys?
Can rich interview transcripts seed more accurate generative agents than demographic data or survey responses? This matters because it challenges how we build digital simulations of real people.
individual-level behavioral replication from a different angle (social simulation vs cognitive modeling)
-
How do we generate realistic personas at population scale?
Current LLM-based persona generation relies on ad hoc methods that fail to capture real-world population distributions. The challenge is reconstructing the joint correlations between demographic, psychographic, and behavioral attributes from fragmented data.
population-level simulation shares the calibration challenge
-
Can AI personas reliably replicate human experiment results?
Exploring whether LLM-based persona simulations accurately reproduce experimental findings from published psychology and marketing research, and what factors determine when they succeed or fail.
convergent finding: LLMs capture strong effects, struggle with subtle ones
-
Can language summaries unlock hidden psychological patterns?
Do natural language compressions of personality scores capture information beyond the raw numbers themselves? This explores whether linguistic abstraction reveals emergent trait patterns that numerical data alone cannot.
zero-shot cross-scale inference with R² > 0.89; linguistic compression as mechanism
-
Can we control personality in language models without prompting?
Can lightweight adapter modules enable continuous, fine-grained control over psychological traits in transformer outputs independent of prompt engineering? This explores whether architecture-level personality modification outperforms prompt-based approaches.
psychological traits encoded in pre-trained weights; activation without retraining
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- Determinants of LLM-assisted Decision-Making
- Building Decision Making Models Through Language Model Regime
- Turning large language models into cognitive models
- Mind Your Step (by Step): Chain-of-Thought can Reduce Performance on Tasks where Thinking Makes Humans Worse
- Exploring the Frontiers of LLMs in Psychological Applications: A Comprehensive Review
- From Human to Machine Psychology: A Conceptual Framework for Understanding Well-Being in Large Language Models
- Cognitive Architectures for Language Agents
- Beyond Accuracy: Evaluating the Reasoning Behavior of Large Language Models -- A Survey
Original note title
llms finetuned on psychological experiment data become generalist cognitive models that outperform traditional cognitive models and capture individual differences