Apple developed STARFlow-V to challenge the dominance of diffusion models in video generation. This architecture utilizes normalizing flows for end-to-end learning and native likelihood estimation. It offers more robust causal prediction than current standards. Researchers can now explore likelihood-based alternatives to diffusion for complex spatiotemporal data, though practical scalability remains an open question.