Holoscan Reference Applications

Home
Workflows
Workflows
- Real-Time End-to-end AI Surgical Video
  Real-Time End-to-end AI Surgical Video
Applications
Applications
- Advanced Networking Benchmark
  Advanced Networking Benchmark
- AJA Video Capture
  AJA Video Capture
- Basic Networking Ping
  Basic Networking Ping
- Basic Pulse Description Word (PDW) Generator
  Basic Pulse Description Word (PDW) Generator
- Body Pose Estimation
  Body Pose Estimation
- Colonoscopy Polyp Segmentation
  Colonoscopy Polyp Segmentation
- CUDA Quantum Variational Quantum Eigensolver (VQE)
  CUDA Quantum Variational Quantum Eigensolver (VQE)
- Dds
  Dds
  - DDS Video: Real-time Video Streaming with RTI Connext
    
    DDS Video: Real-time Video Streaming with RTI Connext
- Deltacast Videomaster Transmitter
  Deltacast Videomaster Transmitter
- Depth Anything V2
  Depth Anything V2
- Distributed
  Distributed
  - Grpc
    Grpc
    
    Distributed Endoscopy Tool Tracking with gRPC Streaming
    
    Distributed Endoscopy Tool Tracking with gRPC Streaming
    
    Distributed H.264 Endoscopy Tool Tracking with gRPC Streaming
    
    Distributed H.264 Endoscopy Tool Tracking with gRPC Streaming
  - Ucx
    Ucx
    
    Distributed H.264 Endoscopy Tool Tracking
    
    Distributed H.264 Endoscopy Tool Tracking
    
    Ucx endoscopy tool tracking
    Ucx endoscopy tool tracking
    
    UCX-based Distributed Endoscopy Tool Tracking (C++)
    
    UCX-based Distributed Endoscopy Tool Tracking (C++)
    
    UCX-based Distributed Endoscopy Tool Tracking (Python)
    
    UCX-based Distributed Endoscopy Tool Tracking (Python)
- Ehr query llm
  Ehr query llm
  - EHR Agent Framework
    
    EHR Agent Framework
  - FHIR Client for Retrieving and Posting FHIR Resources
    
    FHIR Client for Retrieving and Posting FHIR Resources
- Endoscopy Depth Estimation
  Endoscopy Depth Estimation
- Endoscopy out of body detection
  Endoscopy out of body detection
  - Endoscopy Out of Body Detection (C++)
    
    Endoscopy Out of Body Detection (C++)
  - Endoscopy Out of Body Detection (Python)
    
    Endoscopy Out of Body Detection (Python)
- Endoscopy Tool Segmentation from MONAI Model Zoo
  Endoscopy Tool Segmentation from MONAI Model Zoo
- Endoscopy tool tracking
  Endoscopy tool tracking
  - Endoscopy Tool Tracking (C++)
    
    Endoscopy Tool Tracking (C++)
  - Endoscopy Tool Tracking (Python)
    
    Endoscopy Tool Tracking (Python)
- Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
  Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
- FM Radio Automatic Speech Recognition
  FM Radio Automatic Speech Recognition
- GPU-Accelerated Orthorectification with NVIDIA OptiX
  GPU-Accelerated Orthorectification with NVIDIA OptiX
- H264
  H264
  - H.264 Endoscopy Tool Tracking
    
    H.264 Endoscopy Tool Tracking
  - H.264 Video Decode
    
    H.264 Video Decode
- High speed endoscopy
  High speed endoscopy
  - High-Speed Endoscopy (C++)
    
    High-Speed Endoscopy (C++)
  - High-Speed Endoscopy (Python)
    
    High-Speed Endoscopy (Python)
- HoloChat
  HoloChat
- Holoviz
  Holoviz
  - Holoviz HDR
    
    Holoviz HDR
  - Holoviz sRGB
    
    Holoviz sRGB
  - Holoviz UI
    
    Holoviz UI
  - Holoviz vsync
    
    Holoviz vsync
  - Holoviz YUV
    
    Holoviz YUV
- Hyperspectral Image Segmentation
  Hyperspectral Image Segmentation
- Imaging AI Whole Body Segmentation
  Imaging AI Whole Body Segmentation
- Intel RealSense Camera Visualizer
  Intel RealSense Camera Visualizer
- Isaac Holoscan Bridge
  Isaac Holoscan Bridge
- Laser detection latency
  Laser detection latency
  - EVT Camera Calibration
    
    EVT Camera Calibration
  - Laser Detection
    
    Laser Detection
  - USB Camera Calibration
    
    USB Camera Calibration
- Matlab gpu coder
  Matlab gpu coder
  - Image Processing with MATLAB GPU Coder
    
    Image Processing with MATLAB GPU Coder
  - Ultrasound Beamforming with MATLAB GPU Coder
    
    Ultrasound Beamforming with MATLAB GPU Coder
- Medical Image Viewer in XR
  Medical Image Viewer in XR
  - Operators
    Operators
    
    User interface Control
    
    User interface Control
    
    User interface Control
    
    User interface Control
    
    User interface Render
    
    User interface Render
    
    XrFrame
    
    XrFrame
    
    Convert Depth To Screen Space
    
    Convert Depth To Screen Space
    
    XRBeginFrame
    
    XRBeginFrame
    
    XrEndFrame
    
    XrEndFrame
  - Utils
    Utils
    
    XR Demo
    
    XR Demo
    
    XR Basic Rendering
    
    XR Basic Rendering
- Multi AI SSD Detection and MONAI Endoscopic Tool Segmentation
  Multi AI SSD Detection and MONAI Endoscopic Tool Segmentation
- Multiai ultrasound
  Multiai ultrasound
  - Multi-AI Ultrasound (C++)
    
    Multi-AI Ultrasound (C++)
  - Multi-AI Ultrasound (Python)
    
    Multi-AI Ultrasound (Python)
  - Operators
    Operators
    
    Visualizer iCardio
    
    Visualizer iCardio
- Nvidia nim
  Nvidia nim
  - Chat with NVIDIA NIM
    
    Chat with NVIDIA NIM
  - Medical Imaging Segmentation with NVIDIA Vista-3D NIM
    
    Medical Imaging Segmentation with NVIDIA Vista-3D NIM
  - NVIDIA NV-CLIP NIM
    
    NVIDIA NV-CLIP NIM
- Nvidia video codec
  Nvidia video codec
  - Nvc decode
    Nvc decode
    
    NVIDIA Video Codec: H.264 File Decoder
    
    NVIDIA Video Codec: H.264 File Decoder
  - Nvc encode decode
    Nvc encode decode
    
    NVIDIA Video Codec: Encode-Decode Video
    
    NVIDIA Video Codec: Encode-Decode Video
  - Nvc encode writer
    Nvc encode writer
    
    NVIDIA Video Codec: Video Writer
    
    NVIDIA Video Codec: Video Writer
- Object Detection using PyTorch Faster R-CNN
  Object Detection using PyTorch Faster R-CNN
- OpenIGTLink 3D Slicer: Bidirectional Video Streaming with AI Segmentation
  OpenIGTLink 3D Slicer: Bidirectional Video Streaming with AI Segmentation
- Orsi
  Orsi
  - Orsi Academy In-Out Body Detection and Surgical Video Anonymization
    
    Orsi Academy In-Out Body Detection and Surgical Video Anonymization
  - Orsi Academy Multi AI and AR Visualization
    
    Orsi Academy Multi AI and AR Visualization
  - Orsi Academy Surgical Tool Segmentation and AR Overlay
    
    Orsi Academy Surgical Tool Segmentation and AR Overlay
- Polyp Detection
  Polyp Detection
- Power Spectral Density with cuNumeric
  Power Spectral Density with cuNumeric
- ProHawk Video Replayer
  ProHawk Video Replayer
- PVA-Accelerated Image Sharpening
  PVA-Accelerated Image Sharpening
- Qt Video Replayer
  Qt Video Replayer
- Radar Signal Processing over Network
  Radar Signal Processing over Network
- Real-Time Face and Text Deidentification
  Real-Time Face and Text Deidentification
- Real-time Riva ASR to local-LLM
  Real-time Riva ASR to local-LLM
- SAM 2: Segment Anything in Images and Videos
  SAM 2: Segment Anything in Images and Videos
- Simple CV-CUDA
  Simple CV-CUDA
- Simple radar pipeline
  Simple radar pipeline
  - Simple Radar Pipeline (C++)
    
    Simple Radar Pipeline (C++)
  - Simple Radar Pipeline (Python)
    
    Simple Radar Pipeline (Python)
- Slang
  Slang
  - Slang Simple Compute Kernel Example
    
    Slang Simple Compute Kernel Example
- Software Defined Radio FM Demodulation
  Software Defined Radio FM Demodulation
- Speech-to-text + Large Language Model
  Speech-to-text + Large Language Model
- SSD Detection for Endoscopy Tools
  SSD Detection for Endoscopy Tools
- Stereo Vision
  Stereo Vision
- Streaming Synthetic Aperture Radar
  Streaming Synthetic Aperture Radar
- TAO PeopleNet Detection Model on V4L2 Video Stream
  TAO PeopleNet Detection Model on V4L2 Video Stream
- Ultrasound segmentation
  Ultrasound segmentation
  - Ultrasound Bone Scoliosis Segmentation (C++)
    
    Ultrasound Bone Scoliosis Segmentation (C++)
  - Ultrasound Bone Scoliosis Segmentation (Python)
    
    Ultrasound Bone Scoliosis Segmentation (Python)
- Velodyne VLP-16 Lidar Viewer
  Velodyne VLP-16 Lidar Viewer
- VILA Live
  VILA Live
- VITA 49 Power Spectral Density (PSD)
  VITA 49 Power Spectral Density (PSD)
  - Data Writer
    
    Data Writer
- Volume rendering using ClaraViz
  Volume rendering using ClaraViz
- VPI Stereo Vision
  VPI Stereo Vision
- WebRTC Holoviz Server
  WebRTC Holoviz Server
- WebRTC Video Client
  WebRTC Video Client
- WebRTC Video Server
  WebRTC Video Server
- XR + Gaussian Splatting
  XR + Gaussian Splatting
- XR + Holoviz
  XR + Holoviz
- Yolo Object Detection
  Yolo Object Detection
Operators
Operators
- Advanced Network library
  Advanced Network library
- AJA Source
  AJA Source
- ApriltagDetector
  ApriltagDetector
- Basic networking
  Basic networking
- Custom LSTM Inference
  Custom LSTM Inference
- CVCUDA Holoscan Interoperability
  CVCUDA Holoscan Interoperability
- Data-Distribution Service (DDS)
  Data-Distribution Service (DDS)
  - DDS Base
    
    DDS Base
  - DDS Shape Subscriber
    
    DDS Shape Subscriber
  - DDS Video
    
    DDS Video
- DELTACAST VideoMaster
  DELTACAST VideoMaster
- Deidentification
  Deidentification
  - Pixelator
    
    Pixelator
- EHR Query LLM
  EHR Query LLM
  - FHIR Client
    
    FHIR Client
  - FHIR Resource Sanitizer
    
    FHIR Resource Sanitizer
  - ZeroMQ Publisher
    
    ZeroMQ Publisher
  - ZeroMQ Subscriber
    
    ZeroMQ Subscriber
- EmergentSource
  EmergentSource
- Fast Fourier Transform (FFT)
  Fast Fourier Transform (FFT)
- GXF Tensor to VideoBuffer Converter
  GXF Tensor to VideoBuffer Converter
- High Rate PSD
  High Rate PSD
- Holohub gRPC Plugins for Holoscan SDK
  Holohub gRPC Plugins for Holoscan SDK
- Intel RealSense Camera
  Intel RealSense Camera
- Low Rate PSD
  Low Rate PSD
- Medical Imaging
  Medical Imaging
  - Clara Viz
    
    Clara Viz
  - DICOM Data Loader
    
    DICOM Data Loader
  - DICOM Encapsulated PDF Writer
    
    DICOM Encapsulated PDF Writer
  - DICOM Segmentation Writer
    
    DICOM Segmentation Writer
  - DICOM Series Selector
    
    DICOM Series Selector
  - DICOM Series to Volume
    
    DICOM Series to Volume
  - DICOM Text SR Writer
    
    DICOM Text SR Writer
  - Inference
    
    Inference
  - MONAI Bundle Inference
    
    MONAI Bundle Inference
  - MONAI Segmentation Inference
    
    MONAI Segmentation Inference
  - NIfTI Data Loader
    
    NIfTI Data Loader
  - PNG Converter
    
    PNG Converter
  - Publisher
    
    Publisher
  - STL Conversion
    
    STL Conversion
- NPP Filter
  NPP Filter
- NVIDIA Video Codec
  NVIDIA Video Codec
- OpenIGTLink
  OpenIGTLink
- OpenXR
  OpenXR
- Orsi
  Orsi
  - FormatConverter
    
    FormatConverter
  - OrsiVisualization
    
    OrsiVisualization
  - SegmentationPostprocessor
    
    SegmentationPostprocessor
  - SegmentationPreprocessor
    
    SegmentationPreprocessor
- Prohawk
  Prohawk
- QCAPSource
  QCAPSource
- Qt Video
  Qt Video
- SendMeshToUSD
  SendMeshToUSD
- Slang Shader
  Slang Shader
- Tensor to File
  Tensor to File
- Tool Tracking Postprocessor
  Tool Tracking Postprocessor
- Unzip
  Unzip
- Velodyne lidar
  Velodyne lidar
  - Velodyne Lidar
    
    Velodyne Lidar
- VITA49 PSD Packetizer
  VITA49 PSD Packetizer
- Video encoder
  Video encoder
  - Video Encoder Request
    
    Video Encoder Request
- Volume Loader
  Volume Loader
- Volume Renderer
  Volume Renderer
- VTK Renderer
  VTK Renderer
- WebRTC Client
  WebRTC Client
- WebRTC Server
  WebRTC Server
Tutorials
Tutorials
- Adding a GUI to Holoscan Python Applications
  Adding a GUI to Holoscan Python Applications
- Best Practices to integrate external libraries into Holoscan pipelines
  Best Practices to integrate external libraries into Holoscan pipelines
- Creating Multi Node Applications
  Creating Multi Node Applications
- CUDA MPS Tutorial for Holoscan Applications
  CUDA MPS Tutorial for Holoscan Applications
- Debugging
  Debugging
  - Holoscan SDK Visual Studio Code Dev Container Template
    
    Holoscan SDK Visual Studio Code Dev Container Template
  - Interactively Debugging a Holoscan
    
    Interactively Debugging a Holoscan
- Deploying Llama-2 70b model on the edge with IGX Orin
  Deploying Llama-2 70b model on the edge with IGX Orin
- High Performance Networking with Holoscan
  High Performance Networking with Holoscan
  - 0.1
- Holoscan Playground on AWS
  Holoscan Playground on AWS
- Holoscan SDK Response-Time Analysis
  Holoscan SDK Response-Time Analysis
- Interoperability between Holoscan and a Windows on a Single Machine
  Interoperability between Holoscan and a Windows on a Single Machine
- NVIDIA Holoscan Bootcamp
  NVIDIA Holoscan Bootcamp
- Pretrained foundational models
  Pretrained foundational models
  - Self-Supervised Contrastive Learning for Surgical videos
    
    Self-Supervised Contrastive Learning for Surgical videos
- Processing DICOM to USD with MONAI Deploy and Holoscan
  Processing DICOM to USD with MONAI Deploy and Holoscan
- Setting up CloudXR Runtime with Holoscan XR Applications
  Setting up CloudXR Runtime with Holoscan XR Applications
- Taking advantage of GPU Direct Storage on the latest NVIDIA Edge platform
  Taking advantage of GPU Direct Storage on the latest NVIDIA Edge platform
- Using Holohub in External Applications
  Using Holohub in External Applications
Benchmarks
Benchmarks
- Benchmark Model
  Benchmark Model
- Exclusive Display Benchmark
  Exclusive Display Benchmark
- Holoscan Flow Benchmarking for HoloHub
  Holoscan Flow Benchmarking for HoloHub
- Release Benchmarking Guide
  Release Benchmarking Guide

Stereo Vision #

Authors: Holoscan Team (NVIDIA)
Supported platforms: x86_64, aarch64
Language: C++
Last modified: August 5, 2025
Latest version: 1.0
Minimum Holoscan SDK version: 2.4.0
Tested Holoscan SDK versions: 2.4.0
Contribution metric: Level 1 - Highly Reliable

Holoscan Stereo Vision

Overview#

A demo pipeline showcasing stereo disparity estimation.

Description#

This pipeline takes video from a stereo camera and estimates disparity using DNN ESS. The disparity map is displayed through Holoviz.

Requirements#

This application requires a V4L2 stereo camera or recorded stereo video as input. A video acquired from a StereoLabs ZED camera is downloaded when running the get_data_and_models.sh script when building the application. A script for obtaining the calibration for StereoLabs cameras is also provided. Holoscan SDK >=2.0,<=2.5 is required for TensorRT 8.6 compatibility.

Camera Calibration#

The default calibration will work for the sample video. If using a stereolabs camera the calibration can be retrieved using get_zed_calibration.py and the devices serial number.

python3 get_zed_calibration.py -s [Serial Number]

Input video#

For the input video stream, either use a v4l2 stereo camera such as those produced by stereolabs or included recorded video. The stereo-plants.mp4 video is provided here and will be downloaded and converted to the necessary format when building the application.

The source device in stereo_vision.yaml should be modified to match the device the v4l2 video is using. This can be found using v4l2-ctl --list-devices.

Models#

This demo requires the ESS DNN Stereo Disparity available from the NGC catalog for disparity estimation. This model is downloaded when you build the application.

ESS DNN#

The ESS engine files generated in this demo application is specific to TRT8.6; make sure you build the devcontainer with a compatible base_img as shown in the Build and Run Instructions section.

Build and Run Instructions#

Run the following command to build and run application using the recorded video:

./holohub run stereo_vision --base_img nvcr.io/nvidia/clara-holoscan/holoscan:v2.4.0-dgpu

To run the application using a v4l2 compatible stereo camera, run:

./holohub run stereo_vision --base_img nvcr.io/nvidia/clara-holoscan/holoscan:v2.4.0-dgpu --run-args="--source v4l2"

Stereo Vision#