A series of 17 game attempts on OpenRouter tested the open-weights GLM 5.2 against Gemini 3 Flash. The evaluator tracked specific achievements across five text adventures using a strict $0.15 budget per attempt. This incremental benchmark provides a narrow look at model reasoning. It offers limited utility for general performance scaling.