SYNTHESIS NOTE
Model Architecture and Internals Reasoning, Retrieval, and Evaluation Training, RL, and Test-Time Scaling

Does adding randomness to recursive models actually help reasoning?

GRAM's ablations test whether stochasticity alone improves recursive architectures, or whether the gains depend on a specific training framework. This matters because it separates surface mechanisms from the methods that make them work.

Synthesis note · 2026-05-28 · sourced from Reasoning Architectures

It would be easy to read GRAM's result as "adding noise to recursive reasoning helps." The paper explicitly forecloses that reading. Its ablations show that naive stochastic alternatives — randomness bolted onto existing recursive architectures without the proper training objective — yield no improvement. The gains stem specifically from the variational framework: GRAM is trained with amortized variational inference, which couples the stochastic latent transitions to a principled generative objective p(y|x) and p(x).

This is the load-bearing distinction. Stochasticity is necessary but not sufficient; what makes it productive is that the sampling is learned to approximate a posterior over latent trajectories, not injected as undirected perturbation. The variational training shapes where and how the model branches, so the sampled trajectories correspond to genuinely different hypotheses rather than random walks around a single attractor. The authors frame stochastic guidance as a general-purpose extension that consistently improves any recursive architecture — but only when delivered through the variational machinery.

This is a recurring lesson in latent-reasoning research and worth flagging as a methodological guardrail: the surface mechanism (here, randomness) is rarely the explanation; the framework that disciplines it is. The same caution applies to claims about reasoning-trace content elsewhere in the vault — apparent mechanisms can be confounded with the training regime that produced them. Counterpoint to test in future work: if the variational objective is what matters, one would predict that other principled posterior-approximation schemes (not just amortized VI) would deliver comparable gains; if only VI works, the explanation is narrower than "variational." Why it matters: it prevents a cargo-cult takeaway and tells practitioners that the engineering effort belongs in the training objective, not in the noise injection.

Inquiring lines that use this note as a source 4

This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.

Related concepts in this collection 2

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map
13 direct connections · 158 in 2-hop network ·dense cluster Open in graph ↗

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Original note title

the gains from stochastic recursive reasoning come from the variational framework not from mere randomness