Debugging GLM-5 at scale revealed critical bottlenecks in how coding agents handle long-context state. The team identified specific memory leaks and latency spikes during complex repository traversal. These findings highlight the gap between prototype performance and production stability. Engineers must now optimize state management to prevent agent crashes during deep-code analysis.