Author: Shivam Raj | NVIDIA Technical Blog

Shivam Raj

Shivam Raj is a senior architect in the GPU Architecture group at NVIDIA. He focuses on training and inference performance of data center AI workloads. Shivam holds an M.Sc. in electrical engineering from the University of Southern California.

Posts by Shivam Raj

Generative AI Aug 12, 2024

NVIDIA NVLink and NVIDIA NVSwitch Supercharge Large Language Model Inference

Large language models (LLM) are getting larger, increasing the amount of compute required to process inference requests. To meet real-time latency requirements... 8 MIN READ

Data Center / Cloud Jun 12, 2024

Demystifying AI Inference Deployments for Trillion Parameter Large Language Models

AI is transforming every industry, addressing grand human scientific challenges such as precision drug discovery and the development of autonomous vehicles, as... 14 MIN READ