Can we extract causal belief networks from interview conversations?

Can natural language interviews be systematically parsed into causal graphs that capture how individuals reason about policy trade-offs? This matters for building auditable belief simulations that go beyond static opinion snapshots.

Synthesis note · 2026-05-03 · sourced from World Models

Building generative agents that simulate human reasoning rather than merely produce plausible stances — a goal motivated by the Can language models simulate belief change in people? critique — requires modeling individual internal logic. The proposed pipeline is a three-step process from natural language to executable causal belief networks (CBNs).

Step 1: Extract causal motifs from QA responses. LLM-conducted semi-structured interviews adaptively elicit causal explanations in everyday language ("why do you support X?" "what does Y influence?"). Responses are annotated with concept nodes and directional relations. For example: Q: "How might surveillance affect public safety?" A: "It can reduce crime by aiding investigations with more transparency, which increases public safety." Motif: Transparency → Crime rate → Public safety. A second QA might add: Privacy ← Transparency → Crime rate.

Step 2: Compose a Causal Belief Network. The motifs are compiled into a belief graph representing the participant's reasoning. Nodes are concepts (fairness, safety, family needs); edges are directional causal relations with confidence and polarity scores derived from motif density or respondent emphasis.

Step 3: Simulate belief change via intervention. Apply a hypothetical intervention such as do(Transparency = high), reflecting a policy shift like increased camera accountability. Use belief propagation over the CBN to update downstream posteriors. Example: P(Privacy Concern) shifts from 0.7 to 0.3, and P(Opposition to Surveillance) shifts from 0.7 to 0.2.

The chain demonstrates that motif-based causal modeling can simulate how individuals update beliefs in response to policy changes, moving beyond static opinion snapshots. But the paper acknowledges open challenges: extracting CBNs from natural language remains hard due to ambiguity in concept identification, causal direction, polarity, and conceptual granularity. And causality alone cannot capture the full range of human reasoning — people also rely on associative, analogical, and emotional processes that resist strict symbolic modeling. The initial focus on causality is described as a strategic and computationally tractable starting point, not an endpoint.

The pipeline's value is that it makes belief simulation auditable. Unlike a prompted persona whose reasoning is inscrutable, a CBN exposes the structure of belief and supports formal analyses (intervention, counterfactuals, sensitivity to specific edges). Policy simulation requires this auditability because stakeholders must be able to ask why a simulated agent reached a stance — an auditability requirement Can AI agents learn people better from interviews than surveys? approaches with content richness but not with structural transparency.

Inquiring lines that use this note as a source 9

This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.

Related concepts in this collection 6

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map

14 direct connections · 130 in 2-hop network ·dense cluster Open in graph ↗

Can we extract causal belief networks from inter… Can language models simulate belief change in peop… Can we measure reasoning quality beyond output pla… Can causal models alone capture how humans actuall… Can AI agents learn people better from interviews … Can structured cognitive models improve LLM patien… Can counterfactual invariance eliminate reward hac…

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Can language models simulate belief change in people? Current LLM social simulators treat behavior as input-output mappings without modeling internal belief formation or revision. Can they be redesigned to actually track how people think and change their minds?
extends: companion piece — this CBN pipeline is the concrete implementation of the cognitivist alternative the tension demands
Can we measure reasoning quality beyond output plausibility? How might we evaluate whether AI systems reason internally like humans do, rather than just producing human-like outputs? This matters because surface coherence can mask broken underlying reasoning.
exemplifies: CBNs operationalize all three properties — motifs supply traceability, do-operations supply counterfactual adaptability, motif reuse supplies compositionality
Can causal models alone capture how humans actually reason? Explores whether causal belief networks provide a complete picture of human cognition or whether associative, analogical, and emotional reasoning modes fall outside their scope.
tension: companion limit-piece — what CBNs can and cannot represent
Can AI agents learn people better from interviews than surveys? Can rich interview transcripts seed more accurate generative agents than demographic data or survey responses? This matters because it challenges how we build digital simulations of real people.
complements: Park et al. show interview-content as the simulation lever; CBN extracts that content into formal structure
Can structured cognitive models improve LLM patient simulations for therapy training? Does embedding Beck's Cognitive Conceptualization Diagram into language models produce more realistic patient simulations than generic LLMs? This matters because therapy training relies on exposure to diverse, believable patient presentations.
exemplifies: CCD as a domain-specific structured cognitive model in clinical use, parallel to CBN for policy use
Can counterfactual invariance eliminate reward hacking biases? Does forcing reward models to remain consistent under irrelevant changes remove the spurious correlations that cause length bias, sycophancy, concept bias, and discrimination? This matters because standard training bakes these biases in permanently.
complements: do-calculus framework applied at the reward layer rather than the agent-belief layer

Can we extract causal belief networks from interview conversations?

Related concepts in this collection 6

Related papers in this collection 8

Search by related questions 4