A new Apple research paper introduces a reviewer agent that evaluates tool-calling trajectories during inference. This method moves error detection into the active execution loop rather than relying on post-hoc analysis. It enables agents to course-correct in real time. Practitioners can now reduce reliance on slow prompt-tuning or retraining to fix tool-selection errors.