The GPT 5.6 Preview system card uses a Multi Select Virology Troubleshooting benchmark to measure biological risk. This analysis focuses on the Pareto frontier, weighing task success against the number of tokens consumed. Practitioners can use this resource-efficiency metric to identify when model capabilities scale dangerously relative to compute costs.