The GRASP planner optimizes trajectories by lifting them into virtual states, allowing parallel optimization across time. This approach solves the instability common in gradient-based planning for learned dynamics. By adding stochasticity directly to the process, it enables agents to navigate complex environments over longer horizons. Practitioners can now scale world model planning without exponential compute costs.