Can question features alone predict when to retrieve?
Can lightweight external features of a question—rather than expensive model uncertainty checks—reliably decide whether retrieval is needed? This matters because uncertainty-based methods promise efficiency but add computation.
Adaptive retrieval aims to retrieve only when necessary, saving compute and reducing the misinformation RAG can introduce. But the dominant approach frames the decision as LLM-based uncertainty estimation — querying the model's internal states or outputs to gauge confidence — which is expensive and impractical at scale, often eating the very efficiency gains it promises. This paper proposes an LLM-independent alternative: decide whether to retrieve from lightweight external features of the question itself — 27 features across 7 groups, plus hybrids — with no model calls. Across 6 QA datasets it matches complex LLM-uncertainty methods on overall QA while delivering large efficiency gains, and outperforms them on complex questions.
The keeper is the relocation of the retrieval-trigger signal from the model's interior to the question's surface, and the demonstration that the two signal sources are complementary. This puts the paper in direct dialogue — and partial tension — with Can simple uncertainty estimates beat complex adaptive retrieval?, which argued uncertainty estimation beats heuristic triggers cheaply; the resolution is that "heuristics" here are not crude rules but learned external-feature predictors that rival uncertainty at lower cost and win on hard queries. It also sharpens the choice flagged by When should retrieval happen during model generation?: uncertainty-gating and external-feature-gating are alternative, combinable trigger sources, not a settled hierarchy.
Inquiring lines that use this note as a source 7
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- What role does retrieval mechanism design play in forecast accuracy?
- Are uncertainty estimation and external feature signals complementary for retrieval?
- Why do external feature triggers outperform uncertainty on complex questions?
- Can question-only features replace model uncertainty checks at scale?
- What are the 27 external features that predict retrieval need?
- Can retrieval systems decide when to retrieve instead of always querying?
- How can models select the optimal question to ask given multiple uncertainties?
Related concepts in this collection 2
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
Can simple uncertainty estimates beat complex adaptive retrieval?
Does measuring a language model's own confidence on token probabilities outperform expensive multi-call adaptive retrieval pipelines? This matters because it could simplify RAG systems while reducing computational overhead.
in tension: this note shows learned external features rival uncertainty estimation at lower cost and win on complex questions
-
When should retrieval happen during model generation?
Explores whether retrieval should occur continuously, at fixed intervals, or only when the model signals uncertainty. Standard RAG retrieves once; long-form generation requires dynamic triggering based on confidence signals.
external-feature gating is an alternative, combinable trigger source to uncertainty gating
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- LLM-Independent Adaptive RAG: Let the Question Speak for Itself
- Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back Home
- Deep Research: A Systematic Survey
- RAG-R1 : Incentivize the Search and Reasoning Capabilities of LLMs through Multi-query Parallelism
- Rethinking with Retrieval: Faithful Large Language Model Inference
- Generator-Retriever-Generator: A Novel Approach to Open-domain Question Answering
- R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
- Don't "Overthink" Passage Reranking: Is Reasoning Truly Necessary?
Original note title
adaptive retrieval can decide when to retrieve from lightweight external question features instead of expensive LLM uncertainty estimation