Mirage stores scene information in latent space rather than pixel-based point clouds. This architecture reduces compute time and memory usage while maintaining consistency during long camera movements. It struggles to track moving objects across segments. The approach offers a more efficient path toward persistent 3D environments in Microsoft Research video models.