Smart Cities / Spaces

Feb 26, 2025
Vision Language Model Prompt Engineering Guide for Image and Video Understanding
Vision language models (VLMs) are evolving at a breakneck speed. In 2020, the first VLMs revolutionized the generative AI landscape by bringing visual...
12 MIN READ

Feb 20, 2025
Featured Computer Vision and Video Analytics Sessions at NVIDIA GTC 2025
Explore visually perceptive AI agents, the latest vision AI technologies, hands-on training, and inspiring deployments.
1 MIN READ

Feb 13, 2025
Upcoming Webinar: Unlocking Video Analytics With AI Agents
Master prompt engineering, fine-tuning, and customization to build video analytics AI agents.
1 MIN READ

Jan 06, 2025
Build a Video Search and Summarization Agent with NVIDIA AI Blueprint
This post was originally published July 29, 2024 but has been extensively revised with NVIDIA AI Blueprint information. Traditional video analytics applications...
11 MIN READ

Dec 19, 2024
AI Vision Helps Green Recycling Plants
Each year, the world recycles only around 13% of its two billion-plus tons of municipal waste. By 2050, the world's annual municipal waste will reach 3.88B...
4 MIN READ

Dec 09, 2024
Just Released: NVIDIA VILA VLM
Now available in preview, NVIDIA VILA is an advanced multimodal VLM that provides visual understanding of multi-images and video.
1 MIN READ

Dec 03, 2024
Scaling Action Recognition Models with Synthetic Data
Action recognition models such as PoseClassificationNet have been around for some time, helping systems identify and classify human actions like walking,...
11 MIN READ

Dec 03, 2024
Build an Agentic Video Workflow with Video Search and Summarization
Building a question-answering chatbot with large language models (LLMs) is now a common workflow for text-based interactions. What about creating an AI system...
11 MIN READ

Oct 31, 2024
Build Multimodal Visual AI Agents Powered by NVIDIA NIM
The exponential growth of visual data—ranging from images to PDFs to streaming videos—has made manual review and analysis virtually impossible....
11 MIN READ

Oct 29, 2024
AI-Powered Devices Track Howls to Save Wolves
A new cell-phone-sized device—which can be deployed in vast, remote areas—is using AI to identify and geolocate wildlife to help conservationists track...
5 MIN READ

Oct 25, 2024
NVIDIA Showcases the Future of Intelligent Robots at CoRL 2024
From humanoids to policy, explore the work NVIDIA is bringing to the robotics community.
1 MIN READ

Oct 24, 2024
Powering the Next Wave of AI Robotics with Three ComputersÂ
NVIDIA has built three computers and accelerated development platforms to enable developers to create physical AI.
1 MIN READ

Oct 14, 2024
AI Research Revs Up EV Charging for Large-Scale Optimization, Speed, and Savings
Electric vehicle (EV) charging is getting a jolt with an innovative new AI algorithm that boosts efficiency, reduces cost, and keeps the grid from...
4 MIN READ

Oct 07, 2024
Generate Image and Text Embeddings with NV-CLIP
NV-CLIP, a cutting-edge multimodal embeddings model for image and text, is now generally available.
1 MIN READ

Sep 30, 2024
Improve Reinforcement Learning from Human Feedback with Leaderboard-Topping Reward Model
Llama 3.1 Nemotron 70B Reward model helps generate high-quality training data that aligns with human preferences for finance, retail, healthcare, scientific...
1 MIN READ

Aug 28, 2024
New Foundational Models and Training Capabilities with NVIDIA TAO 5.5
NVIDIA TAO is a framework designed to simplify and accelerate the development and deployment of AI models. It enables you to use pretrained models, fine-tune...
13 MIN READ