How do compact latent dynamics enable planning without explicit chain of thought?

This explores how reasoning that happens inside a model's hidden states — rather than spelled out as text — can still produce planning and problem-solving, and what the corpus knows about why latent reasoning works at all.

This explores how reasoning that lives in a model's internal representations, rather than in written-out chain-of-thought tokens, can still carry planning — and the corpus turns out to have a surprisingly direct answer: the reasoning was already in there, and chain-of-thought is just one way to surface it. The most striking evidence is that you can trigger reasoning by steering a single internal feature. Can we trigger reasoning without explicit chain-of-thought prompts? shows that nudging one sparse-autoencoder-identified direction matches or beats explicit chain-of-thought across six model families — and it activates early, overriding surface instructions. That reframes the whole question: latent reasoning isn't a workaround for missing CoT, it's the underlying capability that CoT happens to invoke.

If reasoning is a direction in activation space, it's also compressible and controllable. Can we steer reasoning toward brevity without retraining? finds that the difference between rambling and terse reasoning is a single linear vector extracted from 50 examples — cut verbosity by 67% with no accuracy loss. So the 'compact' in compact latent dynamics is literal: the same plan can occupy far fewer tokens, or none, because the trajectory through hidden space is what's doing the work. Can energy minimization unlock reasoning without domain-specific training? pushes this further, replacing token-by-token generation with gradient descent on an energy landscape — deliberation becomes an optimization over internal states rather than a written transcript, and it generalizes better out of distribution.

Where planning specifically comes in, two complementary moves matter: making latent dynamics stochastic, and conditioning them on the future. Can stochastic latent reasoning help models explore multiple solutions? replaces deterministic latent updates with sampling, so a recursive reasoner can hold a distribution over solutions instead of committing to one path — exactly what you need for ambiguous problems with several valid strategies. Its sibling result Can reasoning systems scale wider instead of only deeper? shows those stochastic latents let you scale reasoning 'wider' — sampling many parallel internal trajectories — instead of only deeper, sidestepping the latency cost of long serial chains. Planning here looks like exploring a branching space of latent futures, none of it written down.

The other route to planning-without-CoT is to teach the model to anticipate. Can embedding future information in training data improve planning? bakes future-information tokens into training data so a standard model learns goal-conditioned generation — better planning and algorithmic reasoning with no architectural change. And Can modular cognitive tools unlock reasoning without training? makes explicit what ties all of this together: the modular-tools paper lifted GPT-4.1 on AIME from 27% to 43% with no training at all, because the reasoning was latent and just needed structured isolation to come out. The thread across the corpus is consistent — capable models already contain the plans; compact latent dynamics are about steering, sampling, and conditioning those internal states rather than transcribing them into text.

Sources 7 notes

Can we trigger reasoning without explicit chain-of-thought prompts?

SAE-identified reasoning features can be directly steered to match or exceed chain-of-thought performance across six model families. This reasoning mode activates early in generation and overrides surface-level instructions, suggesting latent reasoning is a fundamental capability independent of explicit prompting.

Can we steer reasoning toward brevity without retraining?

Activation-Steered Compression extracts a single vector from 50 paired examples to reduce chain-of-thought length by 67% while maintaining accuracy and achieving 2.73x speedup. The method is training-free and generalizes across model sizes and domains.

Can energy minimization unlock reasoning without domain-specific training?

Energy-Based Transformers assign energy values to input-prediction pairs and use gradient descent minimization for inference, yielding 35% higher training scaling rates and 29% more inference-compute gains than Transformer++, while generalizing better on out-of-distribution data without domain-specific scaffolding.

Can stochastic latent reasoning help models explore multiple solutions?

GRAM replaces deterministic latent updates with stochastic sampling, enabling models to represent distributions over solutions rather than single predictions. This allows handling of ambiguous problems and multiple valid strategies that deterministic designs cannot represent.

Can reasoning systems scale wider instead of only deeper?

GRAM shows that stochastic latent transitions enabling parallel trajectory sampling sidestep the serial latency cost of depth-only scaling. Width matches token-level parallelism benefits: independent paths sample the solution space without variance inflation.

Can embedding future information in training data improve planning?

TRELAWNEY augments training data with special tokens encapsulating future information, allowing models to learn goal-conditioned generation using standard infrastructure. Results show improved planning, algorithmic reasoning, and story generation without modifying architecture or training procedures.

Can modular cognitive tools unlock reasoning without training?

Four cognitive tools implemented as sandboxed LLM calls improved GPT-4.1 on AIME2024 from 26.7% to 43.3% without any RL training. Modularity enforces operation isolation that pure prompting cannot guarantee, eliciting pre-existing reasoning capability.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a research analyst tracking latent reasoning in LLMs. The question: **Can models plan and reason without explicit chain-of-thought, using only compact internal dynamics?**

What a curated library found — and when (dated claims, not current truth):
Findings span 2023–2026; treat these as perishable constraints to re-test:
- Steering a single sparse-autoencoder feature matches or beats chain-of-thought performance across six model families, activating before surface instruction (2026).
- Verbosity vs. concision occupy distinct linear subspaces in activation space; a single vector extracted from 50 examples cuts tokens by 67% with no accuracy loss (2025-07).
- Stochastic latent sampling lets recursive reasoners hold uncertainty distributions and scale reasoning 'wider' instead of only deeper, avoiding serial latency (2026-05).
- Energy-based optimization over internal states replaces token-by-token generation; deliberation becomes gradient descent on activation landscapes with better OOD generalization (2025-07).
- Future-conditioning via 'lookahead tokens' baked into training data enables goal-directed planning without architectural change (2025-04).

Anchor papers (verify; mind their dates):
- arXiv:2601.08058 (2026-01): Reasoning Beyond Chain-of-Thought: A Latent Computational Mode in Large Language Models
- arXiv:2507.04742 (2025-07): Activation Steering for Chain-of-Thought Compression
- arXiv:2605.19376 (2026-05): Generative Recursive Reasoning
- arXiv:2506.12115 (2025-06): Eliciting Reasoning in Language Models with Cognitive Tools

Your task:
(1) **RE-TEST EACH CONSTRAINT.** For each finding above, probe whether newer scaling laws, training paradigms (RL pretraining, curriculum learning), inference optimizations (speculative decoding, batched lookahead), or evaluation suites have since relaxed or overturned it. Separate the durable insight (latent planning exists) from perishable limitation (method X is necessary). Cite what relaxed each constraint, and flag where it still holds.
(2) **Surface contradicting work.** Identify the strongest recent papers (~last 6 months) that challenge the claim that CoT is redundant or that latent reasoning suffices for all planning regimes. Note which problem classes still seem to require written-out reasoning.
(3) **Propose 2 forward questions** that assume the regime has moved: e.g., What does planning look like when latent dynamics are combined with learned memory abstractions? Can stochastic latent sampling scale to 100+-step horizons without divergence?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

How do compact latent dynamics enable planning without explicit chain of thought?

Sources 7 notes

Next inquiring lines