How does search scale like reasoning in agent systems?
Can test-time scaling laws that govern reasoning tokens also apply to search steps in agentic systems? This explores whether deep research follows the same compute-performance curve as reasoning, opening a new axis for inference-time optimization.