A specialized reviewer agent now evaluates tool-calling trajectories during the execution loop at inference time. This approach moves beyond post-hoc assessments used by Apple researchers to fix errors via retraining. By integrating feedback immediately, agents can correct parameter inaccuracies and tool selection mistakes. Practitioners gain a mechanism for real-time error recovery.