NVIDIA Announces Record Adoption of Its New Turing T4 Cloud GPU

“Just 60 days after the T4’s launch, it’s now available in the cloud and is supported by a worldwide network of server makers. We have never before seen such rapid adoption of a datacenter processor,” said Ian Buck, vice president and general manager of Accelerated Computing at NVIDIA.

The NVIDIA T4 accelerates diverse cloud workloads, including high performance computing (HPC), deep learning training and inference, machine learning, data analytics, and graphics. Based on the new NVIDIA Turing architecture, it features multi-precision Turing Tensor Cores and new RT Cores, which, when combined with accelerated containerized software stacks, would deliver unprecedented performance at scale.

Designed to meet the unique needs of scale-out public and enterprise cloud environments, T4 would maximize throughput, utilization and user concurrency, helping customers “efficiently” address exploding user and data growth.

Roughly the size of a candy bar, the low-profile 70-watt T4 GPU would have the flexibility to fit into a standard server or any Open Compute Project (OPC) hyperscale server design. Server designs can range from a single T4 GPU all the way up to 20 GPUs in a single node.

AI Workloads

T4’s multi-precision capabilities would power breakthrough Artificial Intelligence performance for a wide range of AI workloads at four different levels of precision, offering 8.1 TFLOPS at FP32, 65 TFLOPS at FP16 as well as 130 TOPS of INT8 and 260 TOPS of INT4. For AI inference workloads, a server with two T4 GPUs can replace 54 CPU-only servers. For AI training, a server with two T4 GPUs can replace nine dual-socket CPU-only servers.

“Real-time visualization and online inference workloads need low latency for their end users. We are delighted to partner with NVIDIA to offer T4 GPU support for Google Cloud customers,” said Damion Heredia, senior director of Product Management at Google Cloud. “NVIDIA T4 GPUs for Google Cloud offer a highly scalable, cost-effective, low-latency platform for our ML and visualization customers. Google Cloud’s network capabilities together with the T4 offering enable customers to innovate in new ways, speeding up applications while reducing costs.”

Among server companies featuring the T4 are Dell EMC, Hewlett Packard Enterprise, IBM, Lenovo and Supermicro.