A score of 1,618 on the GDPval-AA v2 test puts Claude Sonnet 5 ahead of the larger Opus 4.8. Anthropic designed the model to beat its predecessor across all benchmarks. It remains intentionally limited in cybersecurity tasks to avoid government blocks. This narrows the performance gap between mid-tier and premium models.