A Stanford study finds that multi-agent systems often derive their performance gains from increased compute rather than inherent architectural superiority. The researchers identified specific exceptions where agent collaboration actually adds value. This finding warns developers that scaling agent counts may simply mask inefficient resource use. Practitioners should benchmark single-model alternatives before deploying complex agentic workflows.