Google DeepMind launched Nano Banana 2 Lite and Gemini Omni Flash for immediate developer use. These lightweight models prioritize low-latency inference and reduced memory footprints. They enable complex multimodal tasks on edge devices without cloud reliance. Practitioners can now deploy faster, smaller models for real-time applications without sacrificing core reasoning capabilities.