Can iterative revision cycles match how humans actually write?
Does framing research writing as a diffusion process—where drafts are refined through retrieval-augmented cycles—better capture human cognition than linear pipelines and reduce information loss?
Existing deep research agents combine test-time scaling techniques (CoT, best-of-n, MCTS, debate, self-refinement) without a deliberate cognitive design. Most public agents employ a linear or parallelized pipeline of planning → searching → generation, which loses global context and misses critical dependencies. Cognitive studies of human writing (Flower and Hayes, 1981) show that people do not write linearly — they establish a high-level plan, draft based on the plan, and then engage in multiple revision cycles that interleave further information gathering with rewriting.
TTD-DR observes a structural similarity between this human pattern and retrieval-augmented diffusion sampling: a noisy initial draft is iteratively denoised toward higher-quality outputs, with each denoising step informed by retrieved external information. The framework operationalizes this as report-level diffusion — a preliminary draft serves as an updatable skeleton that evolves through iterative refinement, with each step augmented by targeted retrieval. The draft is a global anchor that maintains coherence across iterations, addressing the information-loss problem of linear pipelines.
Two mechanisms make the analogy useful in practice. Denoising with retrieval drives report-level evolution: the draft and research plan jointly steer the next retrieval, and retrieved content drives the next denoising step. Self-evolution operates at the component level: each unit agent (plan generator, question generator, answer searcher, report generator) undergoes its own optimization, mitigating per-component information loss across long agentic trajectories. The interplay is essential — without component-level self-evolution, the draft-level diffusion lacks high-quality context to refine on.
The conceptual yield is that diffusion is not just a generation technique but a process model for cognitively-inspired iterative work. Any task that humans approach as draft-and-revise rather than write-once-correctly — research reports, design documents, complex prose — is a candidate for the same draft-centric, retrieval-augmented diffusion treatment. The draft becomes the persistent state that the agentic system refines, rather than a final output produced by a feed-forward pipeline — analogous to how Why does vanilla RAG produce shallow and redundant results? argues iterative loops are required for depth.
Inquiring lines that use this note as a source 5
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- What distinguishes redundant cycles from productive reconsidering cycles?
- Why do linear research pipelines lose global context across planning and generation steps?
- What makes draft-centric systems better anchors for coherence than feed-forward outputs?
- Can retrieval strategies drive both draft refinement and new research question generation?
- Can human researchers improve LLM ideas through iterative feedback?
Related concepts in this collection 6
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
Can diffusion models perform evolutionary search in parameter space?
Diffusion models and evolutionary algorithms share equivalent mathematical structures. Can we leverage this equivalence to build evolutionary search methods that preserve solution diversity better than traditional algorithms?
extends: same diffusion-as-process equivalence; this note applies it to multi-step agentic work rather than parameter-space search
-
What makes deep research fundamentally different from RAG?
Explores whether current systems using the label 'deep research' actually meet a rigorous three-component definition involving multi-step gathering, cross-source synthesis, and iterative refinement, or if they're performing something narrower.
exemplifies: TTD-DR meets all three components — its draft-level diffusion is iterative query refinement made structural
-
Can retrieval be extended into multi-step chains like reasoning?
Standard RAG retrieves once, but multi-hop tasks need intermediate steps. Can we train models to plan retrieval sequences the way chain-of-thought trains reasoning, and scale retrieval at test time?
complements: CoRAG scales retrieval as chain-of-thought; TTD-DR scales it as denoising-of-draft
-
Why does vanilla RAG produce shallow and redundant results?
Standard RAG systems get stuck in a single semantic neighborhood because their initial query determines what documents are discoverable. The question asks whether fixed retrieval strategies fundamentally limit knowledge depth compared to iterative exploration.
extends: same iterative-depth argument; TTD-DR organizes the iteration around a persistent draft rather than around expanding queries
-
Does limiting reasoning per turn improve multi-turn search quality?
When language models engage in iterative search cycles, does capping reasoning at each turn—rather than just total compute—help preserve context for subsequent retrievals and improve overall search effectiveness?
complements: per-turn budget constraint applies to TTD-DR's component-level self-evolution
-
Can RAG systems safely learn from their own generated answers?
Explores whether retrieval-augmented generation can feed its outputs back into the corpus without corrupting knowledge with hallucinations. The core problem: how to prevent feedback loops from compounding errors.
complements: write-back as the inter-session analog of TTD-DR's draft-as-persistent-state
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- Deep Researcher with Test-Time Diffusion
- Chain of Draft: Thinking Faster by Writing Less
- OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking
- Re3: Generating Longer Stories With Recursive Reprompting and Revision
- AI for Auto-Research: Roadmap & User Guide
- The Alien Space of Science: Sampling Coherent but Cognitively Unavailable Research Directions
- DOC: Improving Long Story Coherence With Detailed Outline Control
- Has the Creativity of Large-Language Models peaked? —an analysis of inter- and intra-LLM variability —
Original note title
research report writing maps onto diffusion sampling — drafts are noisy outputs and revision cycles are denoising steps augmented by retrieval