A flood of flagship releases including Gemma 4 and DeepSeek V4 hit the open-weights scene this month. These models join Kimi K2.6 and GLM-5.1 in a dense release cycle. The surge tests the limits of current evaluation frameworks. Practitioners must now benchmark these diverse architectures against CAISI's V4 assessment to determine real-world utility.