Does the timing of AI feedback relative to user reasoning change its effectiveness?
This reads the question as: when AI offers feedback *during* a person's reasoning (interrupting mid-thought) versus *after* it (or framed as a prompt to think rather than an answer), does the same content land differently — and the corpus says yes, sharply.
This explores whether *when* and *how* AI feedback arrives — mid-reasoning vs. after, as an answer vs. as a question — changes its effect, separate from whether the feedback is correct. The collection's most direct answer is that timing carries a hidden cost. AI suggestions injected while someone is mid-thought sever what one note calls cognitive immersion: even *correct* interventions degrade performance because the user has to rebuild focus before continuing, and evaluations that only score the accuracy of a single suggestion miss this entirely Does AI assistance always help reasoning or does it carry hidden costs?. So the same true statement can help or hurt depending on whether it lands in a gap or in the middle of a reasoning stride.
The corpus then suggests the *form* of feedback matters as much as its moment — and the two are linked, because a question defers its effect to when the user chooses to engage it, while an answer imposes itself immediately. A lab study of thinking assistants found that combining reflection questions with advice beat agents that only advised, only questioned, or did neither: Socratic prompts that hand reasoning back to the user outperformed authoritative answers delivered up front Do reflection questions help people make better decisions with AI?. The same instinct shows up in 'learning to guide' rather than 'learning to defer,' where machines supply interpretive guidance — pointing at which features matter — instead of issuing a decision the user anchors onto. That reframing eliminated anchoring bias precisely because it doesn't pre-empt the human's own reasoning step Can AI guidance reduce anchoring bias better than AI decisions?.
There's a striking parallel on the *machine's* side of reasoning, which is where the question gets genuinely surprising. When models train under reinforcement learning, plain numerical rewards arrive only at the end and carry no information about *why* a path failed — and models stall on plateaus. Swapping in chain-of-thought critiques (feedback that engages the reasoning trace itself, not just the final verdict) breaks those plateaus Can natural language feedback overcome numerical reward plateaus?. So both for humans and models, feedback that touches the reasoning *process* outperforms feedback delivered as a terminal judgment — a cross-domain echo of the reflection-question result.
Two more notes complicate the naive 'feedback always helps' view by showing reasoning has its own internal timing. Accuracy follows an inverted-U with thinking length — models overthink easy problems and underthink hard ones, and accuracy collapsed from 87% to 70% when thinking tokens ballooned Does more thinking time always improve reasoning accuracy?, Why does chain of thought accuracy eventually decline with length?. If reasoning itself has a sweet spot, then feedback that extends or interrupts it can push a reasoner off that peak — making *when* you intervene a variable that interacts with how much reasoning was already optimal.
The thread running through all of this: the field is shifting from scoring feedback by its content alone to scoring it by how it fits the reasoner's flow and stage. The thing you might not have known you wanted to know is that a correct AI suggestion and a well-timed AI question are not the same intervention — and the badly-timed correct one can be the worse of the two.
Sources 6 notes
Well-intentioned AI suggestions can damage reasoning performance by severing cognitive immersion, forcing users to rebuild focus before continuing. Evaluation must measure flow preservation across entire tasks, not just local suggestion accuracy.
A lab study of 80 participants found that thinking assistants combining reflection questions with advice significantly outperformed agents that only advised, only questioned, or did neither. Prioritizing Socratic questioning over authoritative answers enhanced cognitive outcomes.
Learning to Guide eliminates anchoring bias and unassisted hard cases by having machines supply interpretive guidance rather than autonomous decisions, keeping responsibility with humans while improving their judgment through enhanced perception.
Critique-GRPO shows that models stuck on performance plateaus can generate correct solutions when given chain-of-thought critiques, revealing that numerical rewards lack critical information about why failures occur and how to improve.
Increasing thinking tokens from ~1,100 to ~16K reduced benchmark accuracy from 87.3% to 70.3%, revealing a non-monotonic relationship where models overthink easy problems and underthink hard ones.
Task accuracy peaks at intermediate CoT length, with optimal length increasing alongside task difficulty but decreasing with model capability. RL training naturally gravitates toward shorter chains as models improve, revealing that simplicity emerges from reward signals rather than explicit training.