A wave of flagship releases including Gemma 4, DeepSeek V4, and Kimi K2.6 just hit the open-weights scene. These models arrive alongside the CAISI V4 assessment framework. The sudden density of high-performance releases forces a rapid re-evaluation of open-source benchmarks. Practitioners should prioritize testing these specific versions against proprietary baselines immediately.