Shivam Raj

Shivam Raj is a senior architect in the GPU Architecture group at NVIDIA. He focuses on training and inference performance of data center AI workloads. Shivam holds an M.Sc. in electrical engineering from the University of Southern California.
Avatar photo

Posts by Shivam Raj

Decorative image of linked modules.
Generative AI

NVIDIA NVLink and NVIDIA NVSwitch Supercharge Large Language Model Inference

Large language models (LLM) are getting larger, increasing the amount of compute required to process inference requests. To meet real-time latency requirements... 8 MIN READ
Decorative image.
Data Center / Cloud

Demystifying AI Inference Deployments for Trillion Parameter Large Language Models

AI is transforming every industry, addressing grand human scientific challenges such as precision drug discovery and the development of autonomous vehicles, as... 14 MIN READ