AI Agent – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-03-03T23:45:44Z https://developer.nvidia.com/blog/feed/ Aditi Bodhankar <![CDATA[Measuring the Effectiveness and Performance of AI Guardrails in Generative AI Applications]]> https://developer.nvidia.com/blog/?p=96562 2025-03-03T17:22:13Z 2025-03-03T17:22:09Z Safeguarding AI agents and other conversational AI applications to ensure safe, on-brand and reliable behavior is essential for enterprises. NVIDIA NeMo...]]> Safeguarding AI agents and other conversational AI applications to ensure safe, on-brand and reliable behavior is essential for enterprises. NVIDIA NeMo...Decorative image of the guardrail process.

Safeguarding AI agents and other conversational AI applications to ensure safe, on-brand and reliable behavior is essential for enterprises. NVIDIA NeMo Guardrails offers robust protection with AI guardrails for content safety, topic control, jailbreak detection, and more to evaluate and optimize guardrail performance. In this post, we explore techniques for measuring and optimizing your AI…

Source

]]>
0
Mehran Maghoumi <![CDATA[Build an AI Agent with Expert Reasoning Capabilities Using the DeepSeek-R1 NIM]]> https://developer.nvidia.com/blog/?p=96030 2025-02-28T20:23:54Z 2025-02-28T20:23:51Z AI agents are transforming business operations by automating processes, optimizing decision-making, and streamlining actions. Their effectiveness hinges on...]]> AI agents are transforming business operations by automating processes, optimizing decision-making, and streamlining actions. Their effectiveness hinges on...

AI agents are transforming business operations by automating processes, optimizing decision-making, and streamlining actions. Their effectiveness hinges on expert reasoning, enabling smarter planning and efficient execution. Agentic AI applications could benefit from the capabilities of models such as DeepSeek-R1. Built for solving problems that require advanced AI reasoning…

Source

]]>
0
Anu Srivastava <![CDATA[Latest Multimodal Addition to Microsoft Phi SLMs Trained on NVIDIA GPUs]]> https://developer.nvidia.com/blog/?p=96519 2025-02-28T17:13:38Z 2025-02-26T22:05:00Z Large language models (LLMs) have permeated every industry and changed the potential of technology. However, due to their massive size they are not practical...]]> Large language models (LLMs) have permeated every industry and changed the potential of technology. However, due to their massive size they are not practical...An image of a phone with a chatbot dialog on the screen but also showing the inside of the phone.

Large language models (LLMs) have permeated every industry and changed the potential of technology. However, due to their massive size they are not practical for the current resource constraints that many companies have. The rise of small language models (SLMs) bridge quality and cost by creating models with a smaller resource footprint. SLMs are a subset of language models that tend to…

Source

]]>
0
Shubham Agrawal <![CDATA[Vision Language Model Prompt Engineering Guide for Image and Video Understanding]]> https://developer.nvidia.com/blog/?p=96229 2025-02-26T16:25:37Z 2025-02-26T16:25:34Z Vision language models (VLMs) are evolving at a breakneck speed. In 2020, the first VLMs revolutionized the generative AI landscape by bringing visual...]]> Vision language models (VLMs) are evolving at a breakneck speed. In 2020, the first VLMs revolutionized the generative AI landscape by bringing visual...A GIF of a warehouse with people walking around.

Vision language models (VLMs) are evolving at a breakneck speed. In 2020, the first VLMs revolutionized the generative AI landscape by bringing visual understanding to large language models (LLMs) through the use of a vision encoder. These initial VLMs were limited in their abilities, only able to understand text and single image inputs. Fast-forward a few years and VLMs are now capable of…

Source

]]>
0
Mark Ren <![CDATA[Configurable Graph-Based Task Solving with the Marco Multi-AI Agent Framework for Chip Design]]> https://developer.nvidia.com/blog/?p=96209 2025-02-25T22:17:31Z 2025-02-25T22:17:28Z Chip and hardware design presents numerous challenges stemming from its complexity and advancing technologies. These challenges result in longer turn-around...]]> Chip and hardware design presents numerous challenges stemming from its complexity and advancing technologies. These challenges result in longer turn-around...A picture of a computer chip.

Chip and hardware design presents numerous challenges stemming from its complexity and advancing technologies. These challenges result in longer turn-around time (TAT) for optimizing performance, power, area, and cost (PPAC) during synthesis, verification, physical design, and reliability loops. Large language models (LLMs) have shown a remarkable capacity to comprehend and generate natural…

Source

]]>
0
Joanne Chang <![CDATA[Featured Computer Vision and Video Analytics Sessions at NVIDIA GTC 2025]]> https://developer.nvidia.com/blog/?p=96193 2025-02-20T15:50:53Z 2025-02-20T17:00:00Z Explore visually perceptive AI agents, the latest vision AI technologies, hands-on training, and inspiring deployments.]]> Explore visually perceptive AI agents, the latest vision AI technologies, hands-on training, and inspiring deployments.

Explore visually perceptive AI agents, the latest vision AI technologies, hands-on training, and inspiring deployments.

Source

]]>
0
Terry Chen <![CDATA[Automating GPU Kernel Generation with DeepSeek-R1 and Inference Time Scaling]]> https://developer.nvidia.com/blog/?p=95998 2025-02-20T15:56:57Z 2025-02-12T18:00:00Z As AI models extend their capabilities to solve more sophisticated challenges, a new scaling law known as test-time scaling or inference-time scaling is...]]> As AI models extend their capabilities to solve more sophisticated challenges, a new scaling law known as test-time scaling or inference-time scaling is...Mixture of experts icons for attention kernels.

As AI models extend their capabilities to solve more sophisticated challenges, a new scaling law known as test-time scaling or inference-time scaling is emerging. Also known as AI reasoning or long-thinking, this technique improves model performance by allocating additional computational resources during inference to evaluate multiple possible outcomes and then selecting the best one…

Source

]]>
2
Chris Krapu <![CDATA[Lessons Learned from Building an AI Sales Assistant]]> https://developer.nvidia.com/blog/?p=95231 2025-02-06T19:34:04Z 2025-01-21T20:34:41Z At NVIDIA, the Sales Operations team equips the Sales team with the tools and resources needed to bring cutting-edge hardware and software to market. Managing...]]> At NVIDIA, the Sales Operations team equips the Sales team with the tools and resources needed to bring cutting-edge hardware and software to market. Managing...Decorative image of an AI sales assistant workflow with icons.

At NVIDIA, the Sales Operations team equips the Sales team with the tools and resources needed to bring cutting-edge hardware and software to market. Managing this across NVIDIA’s diverse technology is a complex challenge shared by many enterprises. Through collaboration with our Sales team, we found that they rely on internal and external documentation…

Source

]]>
1
Aditi Bodhankar <![CDATA[How to Safeguard AI Agents for Customer Service with NVIDIA NeMo Guardrails]]> https://developer.nvidia.com/blog/?p=94928 2025-02-04T19:53:15Z 2025-01-16T14:00:00Z AI agents present a significant opportunity for businesses to scale and elevate customer service and support interactions. By automating routine inquiries and...]]> AI agents present a significant opportunity for businesses to scale and elevate customer service and support interactions. By automating routine inquiries and...

AI agents present a significant opportunity for businesses to scale and elevate customer service and support interactions. By automating routine inquiries and enhancing response times, these agents improve efficiency and customer satisfaction, helping organizations stay competitive. However, alongside these benefits, AI agents come with risks. Large language models (LLMs) are vulnerable to…

Source

]]>
0
Samuel Ochoa <![CDATA[Build a Video Search and Summarization Agent with NVIDIA AI Blueprint]]> https://developer.nvidia.com/blog/?p=86011 2025-02-13T20:44:57Z 2025-01-07T04:20:00Z This post was originally published July 29, 2024 but has been extensively revised with NVIDIA AI Blueprint information. Traditional video analytics applications...]]> This post was originally published July 29, 2024 but has been extensively revised with NVIDIA AI Blueprint information. Traditional video analytics applications...Decorative image of icons and a molecular structure in green.

This post was originally published July 29, 2024 but has been extensively revised with NVIDIA AI Blueprint information. Traditional video analytics applications and their development workflow are typically built on fixed-function, limited models that are designed to detect and identify only a select set of predefined objects. With generative AI, NVIDIA NIM microservices…

Source

]]>
2
Chintan Patel <![CDATA[Llama Nemotron Models Accelerate Agentic AI Workflows with Accuracy and Efficiency]]> https://developer.nvidia.com/blog/?p=94595 2025-01-09T19:23:09Z 2025-01-07T03:40:00Z Agentic AI, the next wave of generative AI, is a paradigm shift with the potential to revolutionize industries by enabling AI systems to act autonomously and...]]> Agentic AI, the next wave of generative AI, is a paradigm shift with the potential to revolutionize industries by enabling AI systems to act autonomously and...Llama Nemotron icon with a picture of Jensen Huang's avatar.

Agentic AI, the next wave of generative AI, is a paradigm shift with the potential to revolutionize industries by enabling AI systems to act autonomously and achieve complex goals. Agentic AI combines the power of large language models (LLMs) with advanced reasoning and planning capabilities, opening a world of possibilities across industries, from healthcare and finance to manufacturing and…

Source

]]>
0
Joseph Lucas <![CDATA[Sandboxing Agentic AI Workflows with WebAssembly]]> https://developer.nvidia.com/blog/?p=93975 2024-12-16T21:06:56Z 2024-12-16T20:33:46Z Agentic AI workflows often involve the execution of large language model (LLM)-generated code to perform tasks like creating data visualizations. However, this...]]> Agentic AI workflows often involve the execution of large language model (LLM)-generated code to perform tasks like creating data visualizations. However, this...

Agentic AI workflows often involve the execution of large language model (LLM)-generated code to perform tasks like creating data visualizations. However, this code should be sanitized and executed in a safe environment to mitigate risks from prompt injection and errors in the returned code. Sanitizing Python with regular expressions and restricted runtimes is insufficient…

Source

]]>
0
Zenodia Charpy <![CDATA[Build Your First Human-in-the-Loop AI Agent with NVIDIA NIM]]> https://developer.nvidia.com/blog/?p=91339 2024-12-12T19:38:38Z 2024-11-21T22:45:13Z AI agents powered by large language models (LLMs) help organizations streamline and reduce manual workloads. These agents use multilevel, iterative reasoning to...]]> AI agents powered by large language models (LLMs) help organizations streamline and reduce manual workloads. These agents use multilevel, iterative reasoning to...

AI agents powered by large language models (LLMs) help organizations streamline and reduce manual workloads. These agents use multilevel, iterative reasoning to analyze problems, devise solutions, and execute tasks with various tools. Unlike traditional chatbots, LLM-powered agents automate complex tasks by effectively understanding and processing information. To avoid potential risks in specific…

Source

]]>
20
Trisha Tripathi <![CDATA[Expanding AI Agent Interface Options with 2D and 3D Digital Human Avatars]]> https://developer.nvidia.com/blog/?p=91882 2024-11-14T17:10:33Z 2024-11-14T00:53:23Z When interfacing with generative AI applications, users have multiple communication options—text, voice, or through digital avatars.  Traditional chatbot...]]> When interfacing with generative AI applications, users have multiple communication options—text, voice, or through digital avatars.  Traditional chatbot...

When interfacing with generative AI applications, users have multiple communication options—text, voice, or through digital avatars. Traditional chatbot or copilot applications have text interfaces where users type in queries and receive text-based responses. For hands-free communication, speech AI technologies like automatic speech recognition (ASR) and text-to-speech (TTS) facilitate…

Source

]]>
1
Samuel Ochoa <![CDATA[Build Multimodal Visual AI Agents Powered by NVIDIA NIM]]> https://developer.nvidia.com/blog/?p=90989 2024-11-14T19:40:37Z 2024-10-31T20:20:01Z The exponential growth of visual data—ranging from images to PDFs to streaming videos—has made manual review and analysis virtually impossible....]]> The exponential growth of visual data—ranging from images to PDFs to streaming videos—has made manual review and analysis virtually impossible....Decorative image.

The exponential growth of visual data—ranging from images to PDFs to streaming videos—has made manual review and analysis virtually impossible. Organizations are struggling to transform this data into actionable insights at scale, leading to missed opportunities and increased risks. To solve this challenge, vision-language models (VLMs) are emerging as powerful tools…

Source

]]>
0
Charu Chaubal <![CDATA[Enhanced Security and Streamlined Deployment of AI Agents with NVIDIA AI Enterprise]]> https://developer.nvidia.com/blog/?p=90647 2024-11-27T18:39:53Z 2024-10-29T16:00:00Z AI agents are emerging as the newest way for organizations to increase efficiency, improve productivity, and accelerate innovation. These agents are more...]]> AI agents are emerging as the newest way for organizations to increase efficiency, improve productivity, and accelerate innovation. These agents are more...NVIDIA AI Enterprise use cases as cards on a black background, with the logo in front.

AI agents are emerging as the newest way for organizations to increase efficiency, improve productivity, and accelerate innovation. These agents are more advanced than prior AI applications, with the ability to autonomously reason through tasks, call out to other tools, and incorporate both enterprise data and employee knowledge to produce valuable business outcomes. They’re being embedded into…

Source

]]>
0
Aaron Erickson <![CDATA[Optimizing Data Center Performance with AI Agents and the OODA Loop Strategy]]> https://developer.nvidia.com/blog/?p=88729 2025-02-17T05:11:15Z 2024-09-17T14:30:00Z For any data center, operating large, complex GPU clusters is not for the faint of heart! There is a tremendous amount of complexity. Cooling, power,...]]> For any data center, operating large, complex GPU clusters is not for the faint of heart! There is a tremendous amount of complexity. Cooling, power,...Decorative image of a robot next to several NVIDIA icons.

For any data center, operating large, complex GPU clusters is not for the faint of heart! There is a tremendous amount of complexity. Cooling, power, networking, and even such benign things like fan replacement cycles all must be managed effectively and governed well in accelerated computing data centers. Managing all of this requires an accelerated understanding of the petabytes of telemetry data…

Source

]]>
7
Tianna Nguy <![CDATA[Hands-On Training at NVIDIA AI Summit in Washington, DC]]> https://developer.nvidia.com/blog/?p=88598 2024-09-05T17:57:08Z 2024-09-04T17:47:42Z Immerse yourself in NVIDIA technology with our full-day, hands-on technical workshops at our AI Summit in Washington D.C. on October 7, 2024.]]> Immerse yourself in NVIDIA technology with our full-day, hands-on technical workshops at our AI Summit in Washington D.C. on October 7, 2024.Image of a person taking a hands-on lab at GTC

Immerse yourself in NVIDIA technology with our full-day, hands-on technical workshops at our AI Summit in Washington D.C. on October 7, 2024.

Source

]]>
0
Joanne Chang <![CDATA[Webinar: Build Visual AI Agents With Generative AI and NVIDIA NIM]]> https://developer.nvidia.com/blog/?p=87551 2024-08-22T18:24:51Z 2024-08-19T15:00:00Z Learn how to build high-performance solutions with NVIDIA visual AI agents that help streamline operations across a range of industries.]]> Learn how to build high-performance solutions with NVIDIA visual AI agents that help streamline operations across a range of industries.

Learn how to build high-performance solutions with NVIDIA visual AI agents that help streamline operations across a range of industries.

Source

]]>
0
Hayden Wolff <![CDATA[Building AI Agents with NVIDIA NIM Microservices and LangChain]]> https://developer.nvidia.com/blog/?p=86543 2024-10-28T21:55:34Z 2024-08-07T16:00:00Z NVIDIA NIM, part of NVIDIA AI Enterprise, now supports tool-calling for models like Llama 3.1. It also integrates with LangChain to provide you with a...]]> NVIDIA NIM, part of NVIDIA AI Enterprise, now supports tool-calling for models like Llama 3.1. It also integrates with LangChain to provide you with a...Image of a person standing in front of an AI kiosk in a retail location.

NVIDIA NIM, part of NVIDIA AI Enterprise, now supports tool-calling for models like Llama 3.1. It also integrates with LangChain to provide you with a production-ready solution for building agentic workflows. NIM microservices provide the best performance for open-source models such as Llama 3.1 and are available to test for free from NVIDIA API Catalog in LangChain applications.

Source

]]>
0