The release of GPT-5.6 brings renewed scrutiny to the volatility of inference costs and speeds. Early tests show inconsistent latency across different prompt lengths. This instability complicates deployment for developers relying on predictable token delivery. The update offers incremental improvements, but the erratic performance suggests OpenAI has not yet solved the scaling bottleneck.