Deep dive

Feb 25, 2025
Configurable Graph-Based Task Solving with the Marco Multi-AI Agent Framework for Chip Design
Chip and hardware design presents numerous challenges stemming from its complexity and advancing technologies. These challenges result in longer turn-around...
8 MIN READ

Feb 25, 2025
Defining LLM Red Teaming
There is an activity where people provide inputs to generative AI technologies, such as large language models (LLMs), to see if the outputs can be made to...
10 MIN READ

Feb 25, 2025
Agentic Autonomy Levels and Security
Agentic workflows are the next evolution in AI-powered tools. They enable developers to chain multiple AI models together to perform complex activities, enable...
14 MIN READ

Feb 25, 2025
NVIDIA cuDSS Advances Solver Technologies for Engineering and Scientific Computing
NVIDIA cuDSS is a first-generation sparse direct solver library designed to accelerate engineering and scientific computing. cuDSS is increasingly adopted in...
12 MIN READ

Feb 24, 2025
NVIDIA Video Codec SDK 13.0 Powered by NVIDIA Blackwell
The release of NVIDIA Video Codec SDK 13.0 marks a significant upgrade, adding support for the latest-generation NVIDIA Blackwell GPUs. This version brings a...
10 MIN READ

Feb 20, 2025
Transforming Product Design Workflows in Manufacturing with Generative AI
Traditional design and engineering workflows in the manufacturing industry have long been characterized by a sequential, iterative approach that is often...
7 MIN READ

Feb 12, 2025
Automating GPU Kernel Generation with DeepSeek-R1 and Inference Time Scaling
As AI models extend their capabilities to solve more sophisticated challenges, a new scaling law known as test-time scaling or inference-time scaling is...
6 MIN READ

Feb 11, 2025
NVIDIA DGX Cloud Introduces Ready-To-Use Templates to Benchmark AI Platform Performance
In the rapidly evolving landscape of AI systems and workloads, achieving optimal model training performance extends far beyond chip speed. It requires a...
7 MIN READ

Feb 10, 2025
NVIDIA Grace CPU Integrates with the Arm Software Ecosystem
The NVIDIA Grace CPU is transforming data center design by offering a new level of power-efficient performance. Built specifically for data center scale, the...
6 MIN READ

Feb 05, 2025
Improving Translation Quality with Domain-Specific Fine-Tuning and NVIDIA NIM
Translation plays an essential role in enabling companies to expand across borders, with requirements varying significantly in terms of tone, accuracy, and...
8 MIN READ

Jan 31, 2025
Dynamic Loading in the CUDA Runtime
Historically, the GPU device code is compiled alongside the application with offline tools such as nvcc. In this case, the GPU device code is managed internally...
8 MIN READ

Jan 29, 2025
Accelerating JSON Processing on Apache Spark with GPUs
JSON is a popular format for text-based data that allows for interoperability between systems in web applications as well as data management. The format has...
9 MIN READ

Jan 29, 2025
Mastering LLM Techniques: Evaluation
Evaluating large language models (LLMs) and retrieval-augmented generation (RAG) systems is a complex and nuanced process, reflecting the sophisticated and...
12 MIN READ

Jan 24, 2025
Dynamic Memory Compression
Despite the success of large language models (LLMs) as general-purpose AI tools, their high demand for computational resources make their deployment challenging...
9 MIN READ

Jan 13, 2025
Enhancing Generative AI Model Accuracy with NVIDIA NeMo Curator
In the rapidly evolving landscape of artificial intelligence, the quality of the data used for training models is paramount. High-quality data ensures that...
5 MIN READ

Jan 13, 2025
Evaluating GenMol as a Generalist Foundation Model for Molecular Generation
Traditional computational drug discovery relies almost exclusively on highly task-specific computational models for hit identification and lead optimization....
8 MIN READ