A 100x difference in inference efficiency can occur based solely on the software environment and contextual documents provided to a model. Researchers including Neil Thompson found that these scaffolds often influence price-performance more than the underlying model choice. This interaction varies by task, meaning a single scaffold rarely optimizes all models equally.