Amazon engineers are distilling Anthropic models into smaller, cheaper versions for internal operations. This move preempts a shift to token-based pricing next year that threatens to spike expenses. The company is also vetting OpenAI as an alternative. These efficiency plays show how enterprises are aggressively optimizing model footprints to maintain margins.