The OpenWorldLib framework explicitly excludes text-to-video generators like Sora from its definition of world models. Researchers argue these generators lack the necessary internal state and predictive consistency. This distinction forces a technical pivot. Practitioners must now differentiate between superficial visual mimicry and actual environmental simulation in their architectural designs.