Deploying high-performance AI models for Windows applications on NVIDIA RTX AI PCS

Today, Microsoft makes Windows ML available to developers. Windows ML allows C#, C++, and Python developers to optimally run AI models across PC hardware, including CPU, NPU and GPUs. NVIDIA RTX GPUs leverage the GPU tensor core and NVIDIA TENSORTORT for RTX Execution Providers (EPs) that leverage the GPU tensor core and architecture advances such as FP8 and FP4 to provide the fastest AI inference performance on Windows-based RTX AI PCS.

“Windows ML unlocks full Tensort acceleration for GeForce RTX and RTX Pro GPUs and delivers excellent AI performance in Windows 11.” “We generally look forward to being able to build and deploy powerful AI experiences at scale.”