SYNTHESIS NOTE
Reasoning, Retrieval, and Evaluation

Can RAG systems refuse to answer without reliable evidence?

Explores whether retrieval-augmented generation can be designed to abstain from answering when sources are corrupted or insufficient, rather than filling gaps with plausible-sounding guesses. This matters for historical text where OCR errors and language drift are common.

Synthesis note · 2026-05-03
Where do retrieval systems fail and why?

A hybrid multilingual RAG system for question answering over noisy historical newspapers handles two kinds of corruption that modern RAG benchmarks largely ignore: OCR errors that scramble surface text and language drift where vocabulary and orthography shift across centuries within the same corpus. Its defense against both is structural rather than denoising. The pipeline uses semantic query expansion to widen what counts as a match, multi-query retrieval with Reciprocal Rank Fusion to consolidate evidence across query variants, and — most importantly — a grounded generation prompt that only produces answers when evidence is actually retrieved.

The grounded-refusal step is what distinguishes this from a typical noisy-RAG approach. When sources are corrupted, the temptation is for the generator to fill in the gaps from prior knowledge, which produces plausible-sounding but ungrounded answers. The grounded prompt makes refusal the default when retrieval fails, which preserves the integrity of the answer at the cost of coverage. Combined with the semantic and multi-query expansion that improves recall on degraded text, the system trades hallucination for honest "I cannot find this" responses. The cost of this trade is real: Does reasoning fine-tuning make models worse at declining to answer? shows that recent training trends actively work against this kind of refusal posture.

The general principle is that corruption-tolerant RAG should expand retrieval aggressively while constraining generation conservatively — recall up, but only generate when grounded. This inverts the implicit policy of most RAG systems, which is to retrieve narrowly and generate freely. For high-noise corpora the inversion is the correct trade.

Inquiring lines that use this note as a source 70

This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.

Related concepts in this collection 4

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map
17 direct connections · 139 in 2-hop network ·medium cluster Open in graph ↗

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Original note title

grounded generation that refuses to answer without evidence is the noise-tolerant RAG primitive — OCR errors and language drift do not justify confabulation