NVIDIA Holoscan Reference Applications

Home
Workflows
Workflows
- Real-Time End-to-end AI Surgical Video
  Real-Time End-to-end AI Surgical Video
Applications
Applications
- Advanced Networking Benchmark
  Advanced Networking Benchmark
- AJA Video Capture
  AJA Video Capture
- An Example of Async Lock-free Buffer with SCHED_DEADLINE
  An Example of Async Lock-free Buffer with SCHED_DEADLINE
- Basic Networking Ping
  Basic Networking Ping
- Basic Pulse Description Word (PDW) Generator
  Basic Pulse Description Word (PDW) Generator
- Body Pose Estimation
  Body Pose Estimation
- Colonoscopy Polyp Segmentation
  Colonoscopy Polyp Segmentation
- CUDA Quantum Variational Quantum Eigensolver (VQE)
  CUDA Quantum Variational Quantum Eigensolver (VQE)
- Dds
  Dds
  - DDS Video: Real-time Video Streaming with RTI Connext
    
    DDS Video: Real-time Video Streaming with RTI Connext
- Deltacast Videomaster Transmitter
  Deltacast Videomaster Transmitter
- Depth Anything V2
  Depth Anything V2
- Distributed
  Distributed
  - Grpc
    Grpc
    
    Distributed Endoscopy Tool Tracking with gRPC Streaming
    
    Distributed Endoscopy Tool Tracking with gRPC Streaming
    
    Distributed H.264 Endoscopy Tool Tracking with gRPC Streaming
    
    Distributed H.264 Endoscopy Tool Tracking with gRPC Streaming
  - Ucx
    Ucx
    
    Distributed H.264 Endoscopy Tool Tracking
    
    Distributed H.264 Endoscopy Tool Tracking
    
    UCX-based Distributed Endoscopy Tool Tracking
    
    UCX-based Distributed Endoscopy Tool Tracking
    
    UCX-based Distributed Endoscopy Tool Tracking (C++)
    
    UCX-based Distributed Endoscopy Tool Tracking (C++)
    
    UCX-based Distributed Endoscopy Tool Tracking (Python)
    
    UCX-based Distributed Endoscopy Tool Tracking (Python)
- Ehr query llm
  Ehr query llm
  - EHR Agent Framework
    
    EHR Agent Framework
  - FHIR Client for Retrieving and Posting FHIR Resources
    
    FHIR Client for Retrieving and Posting FHIR Resources
- Endoscopy Depth Estimation
  Endoscopy Depth Estimation
- Endoscopy Out of Body Detection
  Endoscopy Out of Body Detection
  - Endoscopy Out of Body Detection (C++)
    
    Endoscopy Out of Body Detection (C++)
  - Endoscopy Out of Body Detection (Python)
    
    Endoscopy Out of Body Detection (Python)
- Endoscopy Tool Segmentation from MONAI Model Zoo
  Endoscopy Tool Segmentation from MONAI Model Zoo
- Endoscopy Tool Tracking
  Endoscopy Tool Tracking
  - Endoscopy Tool Tracking (C++)
    
    Endoscopy Tool Tracking (C++)
  - Endoscopy Tool Tracking (Python)
    
    Endoscopy Tool Tracking (Python)
- Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
  Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
- FM Radio Automatic Speech Recognition
  FM Radio Automatic Speech Recognition
- GPU-Accelerated Orthorectification with NVIDIA OptiX
  GPU-Accelerated Orthorectification with NVIDIA OptiX
- Gstreamer
  Gstreamer
  - GStreamer Video Recorder
    
    GStreamer Video Recorder
- H264
  H264
  - H.264 Endoscopy Tool Tracking
    
    H.264 Endoscopy Tool Tracking
  - H.264 Video Decode
    
    H.264 Video Decode
- High Speed Endoscopy
  High Speed Endoscopy
  - High-Speed Endoscopy (C++)
    
    High-Speed Endoscopy (C++)
  - High-Speed Endoscopy (Python)
    
    High-Speed Endoscopy (Python)
- HoloChat
  HoloChat
- Holoscan ros2
  Holoscan ros2
  - Holoscan ROS2 Publisher/Subscriber Examples
    
    Holoscan ROS2 Publisher/Subscriber Examples
  - Holoscan ROS2 VB1940 (Eagle) Camera
    
    Holoscan ROS2 VB1940 (Eagle) Camera
- Holoviz
  Holoviz
  - Holoviz HDR
    
    Holoviz HDR
  - Holoviz sRGB
    
    Holoviz sRGB
  - Holoviz UI
    
    Holoviz UI
  - Holoviz vsync
    
    Holoviz vsync
  - Holoviz YUV
    
    Holoviz YUV
- Hyperspectral Image Segmentation
  Hyperspectral Image Segmentation
- Imaging AI Whole Body Segmentation
  Imaging AI Whole Body Segmentation
- Industrial I/O (IIO) - ADALM-Pluto SDR Integration
  Industrial I/O (IIO) - ADALM-Pluto SDR Integration
- Intel RealSense Camera Visualizer
  Intel RealSense Camera Visualizer
- Isaac Sim Holoscan Bridge
  Isaac Sim Holoscan Bridge
- Laser detection latency
  Laser detection latency
  - EVT Camera Calibration
    
    EVT Camera Calibration
  - Laser Detection
    
    Laser Detection
  - USB Camera Calibration
    
    USB Camera Calibration
- Live Streaming Data Web Dashboard with NATS
  Live Streaming Data Web Dashboard with NATS
- Matlab gpu coder
  Matlab gpu coder
  - Image Processing with MATLAB GPU Coder
    
    Image Processing with MATLAB GPU Coder
  - Ultrasound Beamforming with MATLAB GPU Coder
    
    Ultrasound Beamforming with MATLAB GPU Coder
- Medical Image Viewer in XR
  Medical Image Viewer in XR
  - Operators
    Operators
    
    User interface Control
    
    User interface Control
    
    User interface Control
    
    User interface Control
    
    User interface Render
    
    User interface Render
    
    XrFrame
    
    XrFrame
    
    Convert Depth To Screen Space
    
    Convert Depth To Screen Space
    
    XRBeginFrame
    
    XRBeginFrame
    
    XrEndFrame
    
    XrEndFrame
  - Utils
    Utils
    
    XR Demo
    
    XR Demo
    
    XR Basic Rendering
    
    XR Basic Rendering
- Multi AI SSD Detection and MONAI Endoscopic Tool Segmentation
  Multi AI SSD Detection and MONAI Endoscopic Tool Segmentation
- Multi AI Ultrasound
  Multi AI Ultrasound
  - Multi-AI Ultrasound (C++)
    
    Multi-AI Ultrasound (C++)
  - Multi-AI Ultrasound (Python)
    
    Multi-AI Ultrasound (Python)
  - Operators
    Operators
    
    Visualizer iCardio
    
    Visualizer iCardio
- Nvidia nim
  Nvidia nim
  - Chat with NVIDIA NIM
    
    Chat with NVIDIA NIM
  - Medical Imaging Segmentation with NVIDIA Vista-3D NIM
    
    Medical Imaging Segmentation with NVIDIA Vista-3D NIM
  - NVIDIA NV-CLIP NIM
    
    NVIDIA NV-CLIP NIM
- Nvidia video codec
  Nvidia video codec
  - NVIDIA Video Codec: Encode-Decode Video
    
    NVIDIA Video Codec: Encode-Decode Video
  - NVIDIA Video Codec: Endoscopy Tool Tracking
    
    NVIDIA Video Codec: Endoscopy Tool Tracking
  - NVIDIA Video Codec: H.264 File Decoder
    
    NVIDIA Video Codec: H.264 File Decoder
  - NVIDIA Video Codec: Video Writer
    
    NVIDIA Video Codec: Video Writer
- Object Detection using PyTorch Faster R-CNN
  Object Detection using PyTorch Faster R-CNN
- OpenIGTLink 3D Slicer: Bidirectional Video Streaming with AI Segmentation
  OpenIGTLink 3D Slicer: Bidirectional Video Streaming with AI Segmentation
- Orsi
  Orsi
  - In-Out Body Detection and Surgical Video Anonymization
    
    In-Out Body Detection and Surgical Video Anonymization
  - Multi AI and AR Visualization
    
    Multi AI and AR Visualization
  - Surgical Tool Segmentation and AR Overlay
    
    Surgical Tool Segmentation and AR Overlay
- Polyp Detection
  Polyp Detection
- Power Spectral Density with cuNumeric
  Power Spectral Density with cuNumeric
- ProHawk Video Replayer
  ProHawk Video Replayer
- PVA-Accelerated Image Sharpening
  PVA-Accelerated Image Sharpening
- Qt Video Replayer
  Qt Video Replayer
- Radar Signal Processing over Network
  Radar Signal Processing over Network
- Real-Time Face and Text Deidentification
  Real-Time Face and Text Deidentification
- Real-time Riva ASR to local-LLM
  Real-time Riva ASR to local-LLM
- SAM 2: Segment Anything in Images and Videos
  SAM 2: Segment Anything in Images and Videos
- Simple CV-CUDA
  Simple CV-CUDA
- Simple radar pipeline
  Simple radar pipeline
  - Simple Radar Pipeline (C++)
    
    Simple Radar Pipeline (C++)
  - Simple Radar Pipeline (Python)
    
    Simple Radar Pipeline (Python)
- Slang
  Slang
  - Slang Gamma Correction Example
    
    Slang Gamma Correction Example
  - Slang Simple Compute Kernel Example
    
    Slang Simple Compute Kernel Example
- Software Defined Radio FM Demodulation
  Software Defined Radio FM Demodulation
- Speech-to-text + Large Language Model
  Speech-to-text + Large Language Model
- SSD Detection for Endoscopy Tools
  SSD Detection for Endoscopy Tools
- Stereo Vision
  Stereo Vision
- Streaming Synthetic Aperture Radar
  Streaming Synthetic Aperture Radar
- Surgical Scene Reconstruction with Gaussian Splatting
  Surgical Scene Reconstruction with Gaussian Splatting
- TAO PeopleNet Detection Model on V4L2 Video Stream
  TAO PeopleNet Detection Model on V4L2 Video Stream
- Ultrasound Bone Scoliosis Segmentation
  Ultrasound Bone Scoliosis Segmentation
  - Ultrasound Bone Scoliosis Segmentation (C++)
    
    Ultrasound Bone Scoliosis Segmentation (C++)
  - Ultrasound Bone Scoliosis Segmentation (Python)
    
    Ultrasound Bone Scoliosis Segmentation (Python)
- Velodyne VLP-16 Lidar Viewer
  Velodyne VLP-16 Lidar Viewer
- VILA Live
  VILA Live
- VITA 49 Power Spectral Density (PSD)
  VITA 49 Power Spectral Density (PSD)
  - Data Writer
    
    Data Writer
- Video Streaming Demo
  Video Streaming Demo
  - Video Streaming Client Demo
    
    Video Streaming Client Demo
  - Video Streaming Server Demo
    
    Video Streaming Server Demo
- Volume rendering using ClaraViz
  Volume rendering using ClaraViz
- VPI Stereo Vision
  VPI Stereo Vision
- WebRTC Holoviz Server
  WebRTC Holoviz Server
- WebRTC Video Client
  WebRTC Video Client
- WebRTC Video Server
  WebRTC Video Server
- XR + Gaussian Splatting
  XR + Gaussian Splatting
- XR + Holoviz
  XR + Holoviz
- Yolo Object Detection
  Yolo Object Detection
Operators
Operators
- Advanced Network library
  Advanced Network library
- AJA Source
  AJA Source
- AprilTag Detection
  AprilTag Detection
- Basic networking
  Basic networking
- Custom LSTM Inference
  Custom LSTM Inference
- CVCUDA Holoscan Interoperability
  CVCUDA Holoscan Interoperability
- Data-Distribution Service (DDS)
  Data-Distribution Service (DDS)
  - DDS Base
    
    DDS Base
  - DDS Shape Subscriber
    
    DDS Shape Subscriber
  - DDS Video
    
    DDS Video
- DELTACAST VideoMaster
  DELTACAST VideoMaster
- Deidentification
  Deidentification
  - Pixelator
    
    Pixelator
- EHR Query LLM
  EHR Query LLM
  - FHIR Client
    
    FHIR Client
  - FHIR Resource Sanitizer
    
    FHIR Resource Sanitizer
  - ZeroMQ Publisher
    
    ZeroMQ Publisher
  - ZeroMQ Subscriber
    
    ZeroMQ Subscriber
- Emergent Source
  Emergent Source
- Fast Fourier Transform (FFT)
  Fast Fourier Transform (FFT)
- Gamma Correction
  Gamma Correction
- GStreamer Bridge Components
  GStreamer Bridge Components
- GXF Tensor to VideoBuffer Converter
  GXF Tensor to VideoBuffer Converter
- High Rate PSD
  High Rate PSD
- Holohub gRPC Plugins for Holoscan SDK
  Holohub gRPC Plugins for Holoscan SDK
- Holoscan ROS2 Bridge Extension
  Holoscan ROS2 Bridge Extension
- IIO Controller
  IIO Controller
- Intel RealSense Camera
  Intel RealSense Camera
- Low Rate PSD
  Low Rate PSD
- Medical Imaging
  Medical Imaging
  - Clara Viz
    
    Clara Viz
  - DICOM Data Loader
    
    DICOM Data Loader
  - DICOM Encapsulated PDF Writer
    
    DICOM Encapsulated PDF Writer
  - DICOM Segmentation Writer
    
    DICOM Segmentation Writer
  - DICOM Series Selector
    
    DICOM Series Selector
  - DICOM Series to Volume
    
    DICOM Series to Volume
  - DICOM Text SR Writer
    
    DICOM Text SR Writer
  - Inference
    
    Inference
  - MONAI Bundle Inference
    
    MONAI Bundle Inference
  - MONAI Segmentation Inference
    
    MONAI Segmentation Inference
  - NIfTI Data Loader
    
    NIfTI Data Loader
  - PNG Converter
    
    PNG Converter
  - Publisher
    
    Publisher
  - STL Conversion
    
    STL Conversion
- Mesh to USD
  Mesh to USD
- NPP Filter
  NPP Filter
- NVIDIA Video Codec
  NVIDIA Video Codec
- OpenIGTLink
  OpenIGTLink
- OpenXR
  OpenXR
- Orsi Academy
  Orsi Academy
- Prohawk Video Processing
  Prohawk Video Processing
- Qt Video
  Qt Video
- Slang Shader
  Slang Shader
- Streaming Server
  Streaming Server
- StreamingClient
  StreamingClient
- Tensor to File
  Tensor to File
- Tool Tracking Postprocessor
  Tool Tracking Postprocessor
- Unzip
  Unzip
- Velodyne lidar
  Velodyne lidar
  - Velodyne Lidar
    
    Velodyne Lidar
- VITA49 PSD Packetizer
  VITA49 PSD Packetizer
- Video encoder
  Video encoder
  - Video Encoder Request
    
    Video Encoder Request
- Video Streaming
  Video Streaming
  - Streaming Server
    
    Streaming Server
  - StreamingClient
    
    StreamingClient
- Volume Loader
  Volume Loader
- Volume Renderer
  Volume Renderer
- VTK Renderer
  VTK Renderer
- WebRTC Client
  WebRTC Client
- WebRTC Server
  WebRTC Server
- YUAN QCAP Source
  YUAN QCAP Source
Tutorials
Tutorials
- A Study using Asynchronous Lock-free Buffer with SCHED_DEADLINE
  A Study using Asynchronous Lock-free Buffer with SCHED_DEADLINE
- Adding a GUI to Holoscan Python Applications
  Adding a GUI to Holoscan Python Applications
- Best Practices to integrate external libraries into Holoscan pipelines
  Best Practices to integrate external libraries into Holoscan pipelines
- Creating Multi Node Applications
  Creating Multi Node Applications
- CUDA MPS Tutorial for Holoscan Applications
  CUDA MPS Tutorial for Holoscan Applications
- Debugging
  Debugging
  - Holoscan SDK Visual Studio Code Dev Container Template
    
    Holoscan SDK Visual Studio Code Dev Container Template
  - Interactively Debugging a Holoscan
    
    Interactively Debugging a Holoscan
- Deploying Llama-2 70b model on the edge with IGX Orin
  Deploying Llama-2 70b model on the edge with IGX Orin
- High Performance Networking with Holoscan
  High Performance Networking with Holoscan
  - 0.1
- Holoscan Playground on AWS
  Holoscan Playground on AWS
- Holoscan SDK Response-Time Analysis
  Holoscan SDK Response-Time Analysis
- Interoperability between Holoscan and a Windows on a Single Machine
  Interoperability between Holoscan and a Windows on a Single Machine
- NVIDIA Holoscan Bootcamp
  NVIDIA Holoscan Bootcamp
- Pretrained foundational models
  Pretrained foundational models
  - Self-Supervised Contrastive Learning for Surgical videos
    
    Self-Supervised Contrastive Learning for Surgical videos
- Processing DICOM to USD with MONAI Deploy and Holoscan
  Processing DICOM to USD with MONAI Deploy and Holoscan
- Setting up CloudXR Runtime with Holoscan XR Applications
  Setting up CloudXR Runtime with Holoscan XR Applications
- Taking advantage of GPU Direct Storage on the latest NVIDIA Edge platform
  Taking advantage of GPU Direct Storage on the latest NVIDIA Edge platform
- Using Holohub in External Applications
  Using Holohub in External Applications
Benchmarks
Benchmarks
- Benchmark Model
  Benchmark Model
- Exclusive Display Benchmark
  Exclusive Display Benchmark
- Green Context CUDA Kernel Launch-Start Time Benchmark
  Green Context CUDA Kernel Launch-Start Time Benchmark
- Holoscan Flow Benchmarking for HoloHub
  Holoscan Flow Benchmarking for HoloHub
- Real-time Thread Scheduling Benchmark
  Real-time Thread Scheduling Benchmark
- Release Benchmarking Guide
  Release Benchmarking Guide

Depth Anything V2 #

Authors: Holoscan Team (NVIDIA)
Supported platforms: x86_64, aarch64
Language: Python
Last modified: August 5, 2025
Latest version: 1.0
Minimum Holoscan SDK version: 2.5.0
Tested Holoscan SDK versions: 2.8.0
Contribution metric: Level 2 - Trusted

This application uses the Depth Anything V2 model for monocular depth estimation. Monocular Depth Estimation refers to the task of predicting the distance of objects in a scene from a single 2D image captured by a standard camera.

Model#

This application uses the Depth Anything V2 model from DepthAnythingV2 for monocular depth estimation. The model is downloaded when building the Docker image.

NOTE: The user is responsible for checking if the model license is suitable for the intended purpose.

Data#

This application downloads a pre-recorded video from Pexels when the application is built. Please review the license terms from Pexels.

NOTE: The user is responsible for ensuring the dataset license is suitable for the intended purpose.

Input#

This app supports two different input options. If you have a v4l2 compatible device plugged into your machine such as a webcam, you can run this application with option 1. Otherwise you can run this application using a pre-recorded video with option 2.

v4l2 compatible input device (default, see V4L2 Support below)
pre-recorded video (see Video Replayer Support below)

To see the list of v4l2 devices connected to your machine, install v4l-utils if it's not already installed:

sudo apt-get install v4l-utils

Then run:

v4l2-ctl --list-devices

Run Instructions#

V4L2 Support#

This application supports v4l2 compatible devices as input. To run this application with your v4l2 compatible device, please plug in your input device and run:

./holohub run depth_anything_v2

By default, this application expects the input device to be mounted at /dev/video0. If this is not the case, update applications/depth_anything_v2/depth_anything_v2.yaml file to set the corresponding input device before running the application. You can also override the default input device on the command line by running:

./holohub run depth_anything_v2 --run-args="--video_device /dev/video0"

Video Replayer Support#

If you don't have a v4l2 compatible device plugged in, you can also run this application on a pre-recorded video. To launch the application using the Video Stream Replayer as the input source, run:

./holohub run depth_anything_v2 --run-args="--source replayer"

Display Modes#

This application has multiple display modes which you can toggle through using the left mouse button.

original: output the original image from input source
depth: output the color depthmap based on the depthmap returned from Depth Anything V2 model
side-by-side: output a side-by-side view of the original image next to the color depthmap
interactive: allow user

In interactive mode, the middle or right mouse button can be used to modify the ratio of original image vs color depthmap is shown.

Acknowledgement#

This project is based on the following projects: - Depth-Anything-V2 - Depth Anything V2 - depth-anything-tensorrt - Depth Anything TensorRT CLI

Known Issues#

There is a known issue running this application on IGX w/ iGPU and on Jetson AGX (see #500). The workaround is to update the device to avoid picking up the libnvv4l2.so library.

cd /usr/lib/aarch64-linux-gnu/
ls -l libv4l2.so.0.0.999999
sudo rm libv4l2.so.0.0.999999
sudo ln -s libv4l2.so.0.0.0.0  libv4l2.so.0.0.999999

Depth Anything V2#