Deep dive

Feb 25, 2025

Configurable Graph-Based Task Solving with the Marco Multi-AI Agent Framework for Chip Design

Chip and hardware design presents numerous challenges stemming from its complexity and advancing technologies. These challenges result in longer turn-around...

8 MIN READ

Feb 25, 2025

Defining LLM Red Teaming

There is an activity where people provide inputs to generative AI technologies, such as large language models (LLMs), to see if the outputs can be made to...

10 MIN READ

Feb 25, 2025

Agentic Autonomy Levels and Security

Agentic workflows are the next evolution in AI-powered tools. They enable developers to chain multiple AI models together to perform complex activities, enable...

14 MIN READ

Feb 25, 2025

NVIDIA cuDSS Advances Solver Technologies for Engineering and Scientific Computing

NVIDIA cuDSS is a first-generation sparse direct solver library designed to accelerate engineering and scientific computing. cuDSS is increasingly adopted in...

12 MIN READ

A person looking over an AV equipment bank.

Feb 24, 2025

NVIDIA Video Codec SDK 13.0 Powered by NVIDIA Blackwell

The release of NVIDIA Video Codec SDK 13.0 marks a significant upgrade, adding support for the latest-generation NVIDIA Blackwell GPUs. This version brings a...

10 MIN READ

Feb 20, 2025

Transforming Product Design Workflows in Manufacturing with Generative AI

Traditional design and engineering workflows in the manufacturing industry have long been characterized by a sequential, iterative approach that is often...

7 MIN READ

Mixture of experts icons for attention kernels.

Feb 12, 2025

Automating GPU Kernel Generation with DeepSeek-R1 and Inference Time Scaling

As AI models extend their capabilities to solve more sophisticated challenges, a new scaling law known as test-time scaling or inference-time scaling is...

6 MIN READ

Three icons in a row, including DGX in the middle.

Feb 11, 2025

NVIDIA DGX Cloud Introduces Ready-To-Use Templates to Benchmark AI Platform Performance

In the rapidly evolving landscape of AI systems and workloads, achieving optimal model training performance extends far beyond chip speed. It requires a...

7 MIN READ

Picture of the NVIDIA Grace CPU on a black background.

Feb 10, 2025

NVIDIA Grace CPU Integrates with the Arm Software Ecosystem

The NVIDIA Grace CPU is transforming data center design by offering a new level of power-efficient performance. Built specifically for data center scale, the...

6 MIN READ

Feb 05, 2025

Improving Translation Quality with Domain-Specific Fine-Tuning and NVIDIA NIM

Translation plays an essential role in enabling companies to expand across borders, with requirements varying significantly in terms of tone, accuracy, and...

8 MIN READ

Jan 31, 2025

Dynamic Loading in the CUDA Runtime

Historically, the GPU device code is compiled alongside the application with offline tools such as nvcc. In this case, the GPU device code is managed internally...

8 MIN READ

A diagram of how JSON data is processed.

Jan 29, 2025

Accelerating JSON Processing on Apache Spark with GPUs

JSON is a popular format for text-based data that allows for interoperability between systems in web applications as well as data management. The format has...

9 MIN READ

Jan 29, 2025

Mastering LLM Techniques: Evaluation

Evaluating large language models (LLMs) and retrieval-augmented generation (RAG) systems is a complex and nuanced process, reflecting the sophisticated and...

12 MIN READ

Three icons, with text LLMs, Optimize, Deploy.

Jan 24, 2025

Dynamic Memory Compression

Despite the success of large language models (LLMs) as general-purpose AI tools, their high demand for computational resources make their deployment challenging...

9 MIN READ

NVIDIA NeMo Curator icon on a purple background.

Jan 13, 2025

Enhancing Generative AI Model Accuracy with NVIDIA NeMo Curator

In the rapidly evolving landscape of artificial intelligence, the quality of the data used for training models is paramount. High-quality data ensures that...

5 MIN READ

Jan 13, 2025

Evaluating GenMol as a Generalist Foundation Model for Molecular Generation

Traditional computational drug discovery relies almost exclusively on highly task-specific computational models for hit identification and lead optimization....

8 MIN READ