SYNTHESIS NOTE

Can question features alone predict when to retrieve?

Can lightweight external features of a question—rather than expensive model uncertainty checks—reliably decide whether retrieval is needed? This matters because uncertainty-based methods promise efficiency but add computation.

Synthesis note · 2026-06-03 · sourced from RAG

Adaptive retrieval aims to retrieve only when necessary, saving compute and reducing the misinformation RAG can introduce. But the dominant approach frames the decision as LLM-based uncertainty estimation — querying the model's internal states or outputs to gauge confidence — which is expensive and impractical at scale, often eating the very efficiency gains it promises. This paper proposes an LLM-independent alternative: decide whether to retrieve from lightweight external features of the question itself — 27 features across 7 groups, plus hybrids — with no model calls. Across 6 QA datasets it matches complex LLM-uncertainty methods on overall QA while delivering large efficiency gains, and outperforms them on complex questions.

The keeper is the relocation of the retrieval-trigger signal from the model's interior to the question's surface, and the demonstration that the two signal sources are complementary. This puts the paper in direct dialogue — and partial tension — with Can simple uncertainty estimates beat complex adaptive retrieval?, which argued uncertainty estimation beats heuristic triggers cheaply; the resolution is that "heuristics" here are not crude rules but learned external-feature predictors that rival uncertainty at lower cost and win on hard queries. It also sharpens the choice flagged by When should retrieval happen during model generation?: uncertainty-gating and external-feature-gating are alternative, combinable trigger sources, not a settled hierarchy.

Inquiring lines that use this note as a source 7

This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.

Related concepts in this collection 2

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map

12 direct connections · 104 in 2-hop network ·medium cluster Open in graph ↗

Can question features alone predict when to retr… Can simple uncertainty estimates beat complex adap… When should retrieval happen during model generati…

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Can simple uncertainty estimates beat complex adaptive retrieval? Does measuring a language model's own confidence on token probabilities outperform expensive multi-call adaptive retrieval pipelines? This matters because it could simplify RAG systems while reducing computational overhead.
in tension: this note shows learned external features rival uncertainty estimation at lower cost and win on complex questions
When should retrieval happen during model generation? Explores whether retrieval should occur continuously, at fixed intervals, or only when the model signals uncertainty. Standard RAG retrieves once; long-form generation requires dynamic triggering based on confidence signals.
external-feature gating is an alternative, combinable trigger source to uncertainty gating

Can question features alone predict when to retrieve?

Related concepts in this collection 2

Related papers in this collection 8

Search by related questions 4