Can formal argumentation make AI decisions truly contestable?
Explores whether structuring AI decisions as formal argument graphs (with explicit attacks and defenses) enables users to meaningfully challenge and navigate reasoning in ways unstructured LLM outputs cannot.
Standard LLM outputs produce conclusions — sometimes with reasoning attached, sometimes not — but in a form that is not structured for contestation. A user who disagrees with a conclusion can ask for clarification, but they cannot navigate the reasoning structure to identify exactly which premise they reject or which argument they believe is defeated by a counterargument.
Argumentative LLMs apply formal argumentation theory — specifically Dung's abstract argumentation framework — to structure AI outputs. In Dung's framework, arguments attack each other; an argument is "accepted" if it is not defeated by an undefeated attacker. The framework is a directed graph of attack relations, and the "winner" is the argument that survives all attacks.
When an LLM's decision process is structured as a Dung argumentation framework, the output is not just a conclusion but a graph: these arguments support this conclusion, these counterarguments attack those supporting arguments, these rebuttals defend the original arguments. A user can inspect the graph, identify the specific argument they contest, and challenge it — the framework tells them exactly what would change the conclusion.
This is genuinely contestable AI output in a way that standard LLM reasoning is not. Can we measure how deeply models represent political ideology? and Does high refusal rate indicate ethical caution or shallow understanding? both point at the absence of navigable structure in LLM political/ideological reasoning. Formal argumentation provides that structure.
The connection to Do language models actually use their encoded knowledge? is direct: standard LLM outputs may not reflect the reasoning that produced them. Forcing the reasoning into a formal argumentation structure requires the model to generate the argument graph that justifies the conclusion — making it harder to produce outputs whose reasoning cannot be reconstructed.
The limitation: formal argumentation requires the argument space to be enumerable and structured, which works for some domains (medical diagnosis, policy analysis) but not for open-ended creative or subjective tasks.
Extension to opinion domains via Key Point Hierarchies: KPH (Key Point Hierarchies) applies entailment-graph structure to opinion summarization. Key points extracted from reviews are organized by specificity into a hierarchy — users quickly grasp high-level themes (the hotel is beautiful, great service) then drill down to fine-grained insights (check-in was quick and easy). Same-meaning key points cluster into single nodes, reducing redundancy. This extends formal argumentation's navigability principle from logical argument structure to opinion structure: flat lists of key points are hard to consume, but hierarchical entailment structure makes them tractable for navigation and sense-making.
The Social Transparency (ST) perspective extends this further: even when algorithm-level explainability is achieved, it may be insufficient. Most consequential AI systems are embedded in socio-organizational tapestries where groups of humans interact with the system. "If the boundary is traced along the bounds of an algorithm, we risk excluding the human and social factors that significantly impact the way people make sense of a system." Two identified pitfalls — Solutionism (always seeking technical solutions) and Formalism (seeking abstract mathematical solutions) — are deeply embedded in AI research and widen the gap between algorithmic and social explanation. Formal argumentation addresses the algorithm boundary; Social Transparency addresses the socio-organizational boundary beyond it.
Engineering design as argumentative discourse: The product development process is inherently argumentative — solving engineering problems requires discourse where experiments, calculations, and simulations inform reasoning but do not replace it. Representing this argumentative discourse as a digital artifact would: (1) improve documentation (archiving reasoning, not just CAD files), (2) make past design decisions traceable, (3) improve collaborative design, (4) enable machine participation in the reasoning process. LLMs embedded in a predefined causal framework ("querying a language model becomes a computational primitive") produce interpretable reasoning traces with reduced hallucination — formal argumentation structure provides guardrails that natural language reasoning lacks.
Inquiring lines that use this note as a source 51
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- Can AI output be verified without understanding the reasoning behind it?
- Does verification of AI outputs face the same circularity problem?
- Can AI arguments participate in discourse without temporal grounding?
- Does conversational format make AI arguments more persuasive than static text?
- Can beam search and ranking functions evaluate claims without understanding counterarguments?
- What would an AI trained for emancipatory reasoning look like?
- Can traditional cross-examination methods work against AI that never concedes?
- Can AI systems produce genuinely new validity claims without community participation?
- Does structured debate between agent groups improve evaluation consensus more than independent scoring?
- Can verification mechanisms prevent AI agents from inventing false citations?
- Can structured dissent mechanisms replace genuine multi-model debate?
- Can contextual design decisions resist formalization into evaluation rubrics?
- Why does static grounding prevent AI systems from supporting dialectical reconciliation?
- Can single-model internal dialogue replace multi-agent debate systems?
- Can debate-style multi-agent systems be trusted on contested factual domains?
- What are rational speech acts and how do they enable AI legibility?
- Can AI evaluation tools solve the verification problem they help create?
- What second- and third-order interpretations actually govern AI adoption decisions?
- Can current AI safety defenses actually stop semantic-level persuasion attacks?
- What mitigation frameworks exist for managing AI persuasion capabilities?
- What distinguishes capability-based refusal from principle-based refusal in practice?
- What happens when AI discourse lacks a position to defend?
- Should XAI designers treat explanations as arguments for adoption?
- Can Socratic questioning replace external evidence verification in multi-agent systems?
- How do graph-based reasoning topologies map to multi-agent interaction patterns?
- Can AI distinguish when validation helps versus when confrontation is needed?
- Can AI systems deliberately align arguments to audience presuppositions?
- What infrastructure could replace search for verifying AI outputs?
- What happens when comfortable AI interactions replace the productive friction of disagreement?
- Can users interrogate AI outputs without verifying every single claim?
- Does training on self-play disagreement data improve multi-agent reasoning outcomes?
- Can artificial systems develop the authority to challenge expert claims?
- What role could knowledge custodians play in validating AI output?
- What makes reasoning auditable in medical AI decision support?
- Why do AI-generated answers carry unearned authority in decision-making contexts?
- Why does AI output lack the argumentative turbulence of human thinking?
- How do expert communities develop and enforce standards for valid arguments?
- What implicit warrants do expert arguments rely on that AI cannot reliably access?
- How does the first-order and second-order distinction unify classical and modern argument theory?
- Can automated tools close the gap between AI generation and verification?
- Can argumentation structure improve reasoning through decomposition alone?
- Why do logic-based arguments make AI persuasion feel objective and impartial?
- Why does showing counterarguments restore users' ability to discriminate?
- Can verifier-based objectives preserve reasoning transparency alongside correctness?
- How can outcome-based rules govern AI deployment faster than traditional legislation?
- Does argument-scheme prompting improve reasoning in non-code domains the same way?
- What makes an argument fallacious according to formal linguistic criteria?
- Can formal argumentation structure replace ad-hoc fallacy classifications?
- Do computational systems need formal argument analysis for explainability?
- How do different legal AI tools compare in accuracy across case eras?
- How does persuasive framing replace evidence in contested domains?
Related concepts in this collection 5
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
Can we measure how deeply models represent political ideology?
This research explores whether LLMs vary not just in political stance but in the internal richness of their political representation. Understanding this distinction could reveal how deeply models have internalized ideological concepts versus merely parroting positions.
ideological reasoning needs navigable structure; formal argumentation provides it
-
Does high refusal rate indicate ethical caution or shallow understanding?
When LLMs refuse political questions at high rates, does this reflect principled safety training or a capability gap? This matters because refusal rates are often used to evaluate model safety.
shallow representation means no argument graph to navigate; deep representation would enable formal argumentation
-
Do language models actually use their encoded knowledge?
Probes can detect that LMs encode facts internally, but do those encoded facts causally influence what the model generates? This explores the gap between knowing and doing.
formal argumentation forces causal transparency: the argument graph must be generated, not just implied
-
Does chain of thought reasoning actually explain model decisions?
When language models show their reasoning steps in agentic pipelines, does the quality of those steps predict or explain the quality of final outputs? This matters for trusting and debugging AI systems.
documents the problem formal argumentation addresses: in agentic pipelines, CoT produces explanations without explainability because thought quality is weakly correlated with response quality; formal argumentation's traversable graph structure is the remedy — forcing reasoning into inspectable attack/defense relations rather than free-form chains
-
Can symbolic solvers fix how LLMs reason about logic?
LLMs excel at understanding natural language but fail at precise logical inference. Can pairing them with deterministic symbolic solvers—using solver feedback to refine attempts—overcome this fundamental weakness?
architectural family: both approaches improve reasoning reliability by imposing formal structure on LLM outputs — symbolic solvers enforce logical precision, argumentation frameworks enforce justification structure; both compensate for LLMs' unreliable natural-language reasoning
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- Argumentative Large Language Models for Explainable and Contestable Decision-Making
- Can Large Language Models Understand Argument Schemes?
- Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying
- AI Argues Differently: Distinct Argumentative and Linguistic Patterns of LLMs in Persuasive Contexts
- The Thin Line Between Comprehension and Persuasion in LLMs
- A Hybrid Human-AI Approach for Argument Map Creation From Transcripts
- SocraSynth: Multi-LLM Reasoning with Conditional Statistics
- Argunauts: Open LLMs that Master Argument Analysis with Argdown
Original note title
structured formal argumentation frameworks make ai decisions explainable and contestable in ways that unstructured llm outputs cannot