Can LLMs reason creatively beyond conventional problem-solving?
Explores whether large language models can engage in truly creative reasoning that expands or redefines solution spaces, rather than just decomposing known problems. This matters because existing reasoning methods may miss creative capabilities entirely.
The Universe of Thoughts (UoT) paper identifies a blind spot in the LLM reasoning literature: all existing methods (CoT, ToT, GoT, Forest-of-Thought) focus on conventional problem-solving — decomposing known problem types into manageable steps. None address creative reasoning, where the solution space itself must be expanded or redefined.
Drawing on Boden's established cognitive science framework, UoT defines three creative reasoning paradigms:
1. Combinational Creative Reasoning: Identifying solutions from other domains that are relevant to the target problem but have not been previously applied there. The mechanism: cross-pollination of known solutions across domain boundaries. A collage is combinational — existing visuals arranged in unconventional ways.
2. Exploratory Creative Reasoning: Adopting individual building blocks (not solutions) from outside the target solution space. New conceptual primitives expand what's possible within the existing framework. Impressionism was exploratory — brushstrokes used in a functionally new way within painting's existing rules.
3. Transformational Creative Reasoning: Fundamentally altering or dropping the core rules that define the solution space. This changes what solutions are even conceivable. Cubism was transformational — breaking the rule of direct representation to depict objects from multiple angles.
The hierarchy is important: combinational reuses, exploratory expands, transformational redefines. Each requires progressively deeper deviation from conventional reasoning patterns.
UoT introduces evaluation metrics orthogonal to standard reasoning benchmarks: feasibility as a constraint (creative solutions must still be implementable), with utility and novelty as metrics. This three-axis evaluation addresses a gap identified by Can LLMs generate more novel ideas than human experts? — LLMs can generate novel outputs but cannot evaluate their own creativity.
The connection to Why do LLMs generate novel ideas from narrow ranges? is direct: diversity collapse may occur precisely because existing reasoning methods explore only one paradigm (combinational at best) while neglecting exploratory and transformational modes. Explicitly prompting for each paradigm could address the diversity problem.
Inquiring lines that use this note as a source 40
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- What makes colorless green ideas fail where Jabberwocky succeeds?
- Why do LLMs generate ideas that sound novel but fail during execution?
- How do constrained versus unconstrained domains flip LLM novelty patterns?
- Why does embedding evaluation criteria in prompts reduce creative scope?
- Can few-shot examples narrow generative diversity in creative tasks?
- Why does LLM research ideation collapse into low diversity despite high novelty?
- How can LLMs evaluate their own creative outputs for utility and novelty?
- What graph structures would enable transformational creative reasoning in LLMs?
- Can prompting for specific creative paradigms improve ideation diversity?
- When should an LLM engage extended reasoning versus responding directly?
- Why do research ideation systems suffer from diversity collapse despite high novelty metrics?
- Can LLMs improve at metaphor if they handle decoupled semantics better?
- Can combinational creativity alone drive open-ended learning in agents?
- Why do LLM-generated ideas score higher novelty yet lower feasibility than expert ideas?
- Why do LLMs plateau on creativity tasks while humans reach further?
- How does prompt design alter what kind of creativity LLMs can express?
- Why do LLM research ideas lack diversity despite high average novelty?
- Why do LLMs generate novel ideas but lack evaluative commitment?
- What internal mechanisms explain LLM reasoning and representation limits?
- Do LLMs generate more novel ideas than they can evaluate?
- Which knowledge types do LLMs handle better than humans in reasoning tasks?
- Can LLMs reliably generate novel working architectures without structured representations?
- Why do structured and creative domains exhibit opposite entropy dynamics?
- Can language models reason without relying on surface level pattern matching?
- Can reasoning style be steered as a single linear direction?
- What role does curriculum design play in reasoning emergence?
- Do higher asymptote recipes unlock genuinely novel reasoning strategies?
- Why do LLMs generate novel ideas but struggle to evaluate them?
- Can AI provide creative evaluation or only generative idea production?
- Can LLMs generate more novel research ideas than human experts?
- Do novelty and feasibility always trade off in idea generation?
- Can extended thinking modes introduce genuine rhetorical exploration to LLMs?
- Does structured decomposition improve LLM reasoning in other compound tasks?
- Does RL amplify existing reasoning or create genuinely new computational strategies?
- Can unfilled cells in the periodic table represent undiscovered argument schemes?
- Can cognitive scaffolding replace tool-based reasoning augmentation in language models?
- What makes creative writing diversity different from code diversity fundamentally?
- Why do preference-tuned models produce different diversity patterns in code versus creative writing?
- Can training alone produce genuine disagreement in collaborative LLM reasoning?
- Can tools unlock reasoning strategies that require abstract insight beyond computation?
Related concepts in this collection 4
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
Why do LLMs generate novel ideas from narrow ranges?
LLM research agents produce individually novel ideas but cluster them in homogeneous sets. This explores why high average novelty coexists with poor diversity coverage and what it means for automated ideation.
creative paradigm diversity could address the diversity collapse problem
-
Can LLMs generate more novel ideas than human experts?
Research shows LLM-generated ideas score higher for novelty than expert-generated ones, yet LLMs avoid the evaluative reasoning that characterizes expert thinking. What explains this apparent contradiction?
UoT's evaluation framework (feasibility + utility + novelty) addresses the dissociation
-
Can reasoning topologies be formally classified as graph types?
This explores whether Chain of Thought, Tree of Thought, and Graph of Thought represent distinct formal graph structures with different computational properties. Understanding this matters because the topology itself determines what reasoning strategies are possible.
existing topology is for conventional reasoning; creative reasoning may require different graph structures
-
Why do LLMs struggle to connect unrelated entities speculatively?
LLMs reliably organize and summarize evidence but fail when asked to speculate about connections between dissimilar entities. Understanding this failure could reveal fundamental limits in how models handle complex analytical reasoning.
combinational creativity is exactly the "speculative connections" capability that's missing
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- Universe of Thoughts: Enabling Creative Reasoning with Large Language Models
- Reasoning LLMs are Wandering Solution Explorers
- Has the Creativity of Large-Language Models peaked? —an analysis of inter- and intra-LLM variability —
- The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity
- Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers
- Eliciting Reasoning in Language Models with Cognitive Tools
- Opportunities for large language models and discourse in engineering design
- Meta-Reasoner: Dynamic Guidance for Optimized Inference-time Reasoning in Large Language Models
Original note title
creative reasoning requires three distinct paradigms — combinational exploratory and transformative — that existing reasoning methods do not address