Amazon engineers are distilling Anthropic models into smaller, cheaper versions for internal operations. The move precedes a shift to token-based pricing next year, which threatens to spike expenses. To further hedge costs, the company is exploring OpenAI as an alternative. This highlights the growing enterprise struggle to balance model performance with inference spend.