A score of 1,618 on the GDPval-AA v2 test puts Claude Sonnet 5 ahead of the larger Opus 4.8. Anthropic designed the model to beat its predecessor across all benchmarks. It intentionally scores low on cybersecurity tasks to avoid government blocks. This provides a cheaper, high-performance alternative for knowledge-heavy workflows.