One command now launches a vLLM server on Hugging Face Jobs. This integration removes the manual overhead of configuring infrastructure for high-throughput LLM serving. It targets developers who need rapid prototyping without managing raw GPU clusters. The update streamlines the path from model selection to a live API endpoint for practitioners.