OpenAI's Triton language now supports Huawei Ascend NPUs. This integration removes the need for manual CUDA-to-CANN translation when porting kernels. Developers can now write high-performance GPU code that runs on Chinese hardware. It simplifies the deployment pipeline for researchers operating outside the Nvidia ecosystem.