NVIDIA Holoscan Reference Applications

Home
Workflows
Workflows
- Real-Time End-to-end AI Surgical Video
  Real-Time End-to-end AI Surgical Video
Applications
Applications
- Advanced Networking Benchmark
  Advanced Networking Benchmark
- AJA Video Capture
  AJA Video Capture
- An Example of Async Lock-free Buffer with SCHED_DEADLINE
  An Example of Async Lock-free Buffer with SCHED_DEADLINE
- Basic Networking Ping
  Basic Networking Ping
- Basic Pulse Description Word (PDW) Generator
  Basic Pulse Description Word (PDW) Generator
- Body Pose Estimation
  Body Pose Estimation
- Colonoscopy Polyp Segmentation
  Colonoscopy Polyp Segmentation
- CUDA Quantum Variational Quantum Eigensolver (VQE)
  CUDA Quantum Variational Quantum Eigensolver (VQE)
- Dds
  Dds
  - DDS Video: Real-time Video Streaming with RTI Connext
    
    DDS Video: Real-time Video Streaming with RTI Connext
- Deltacast Videomaster Receiver
  Deltacast Videomaster Receiver
- Deltacast Videomaster Transmitter
  Deltacast Videomaster Transmitter
- Depth Anything V2
  Depth Anything V2
- Distributed
  Distributed
  - Grpc
    Grpc
    
    Distributed Endoscopy Tool Tracking with gRPC Streaming
    
    Distributed Endoscopy Tool Tracking with gRPC Streaming
    
    Distributed H.264 Endoscopy Tool Tracking with gRPC Streaming
    
    Distributed H.264 Endoscopy Tool Tracking with gRPC Streaming
  - Ucx
    Ucx
    
    Distributed H.264 Endoscopy Tool Tracking
    
    Distributed H.264 Endoscopy Tool Tracking
    
    UCX-based Distributed Endoscopy Tool Tracking
    
    UCX-based Distributed Endoscopy Tool Tracking
    
    UCX-based Distributed Endoscopy Tool Tracking (C++)
    
    UCX-based Distributed Endoscopy Tool Tracking (C++)
    
    UCX-based Distributed Endoscopy Tool Tracking (Python)
    
    UCX-based Distributed Endoscopy Tool Tracking (Python)
- Ehr query llm
  Ehr query llm
  - EHR Agent Framework
    
    EHR Agent Framework
  - FHIR Client for Retrieving and Posting FHIR Resources
    
    FHIR Client for Retrieving and Posting FHIR Resources
- Endoscopy Depth Estimation
  Endoscopy Depth Estimation
- Endoscopy Out of Body Detection
  Endoscopy Out of Body Detection
  - Endoscopy Out of Body Detection (C++)
    
    Endoscopy Out of Body Detection (C++)
  - Endoscopy Out of Body Detection (Python)
    
    Endoscopy Out of Body Detection (Python)
- Endoscopy Tool Segmentation from MONAI Model Zoo
  Endoscopy Tool Segmentation from MONAI Model Zoo
- Endoscopy Tool Tracking
  Endoscopy Tool Tracking
  - Endoscopy Tool Tracking (C++)
    
    Endoscopy Tool Tracking (C++)
  - Endoscopy Tool Tracking (Python)
    
    Endoscopy Tool Tracking (Python)
- Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
  Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
- FM Radio Automatic Speech Recognition
  FM Radio Automatic Speech Recognition
- GPU-Accelerated Orthorectification with NVIDIA OptiX
  GPU-Accelerated Orthorectification with NVIDIA OptiX
- Gstreamer
  Gstreamer
  - GStreamer Video Recorder
    
    GStreamer Video Recorder
- H264
  H264
  - H.264 Endoscopy Tool Tracking
    
    H.264 Endoscopy Tool Tracking
  - H.264 Video Decode
    
    H.264 Video Decode
- High Speed Endoscopy
  High Speed Endoscopy
  - High-Speed Endoscopy (C++)
    
    High-Speed Endoscopy (C++)
  - High-Speed Endoscopy (Python)
    
    High-Speed Endoscopy (Python)
- HoloChat
  HoloChat
- Holoscan ros2
  Holoscan ros2
  - Holoscan ROS2 Publisher/Subscriber Examples
    
    Holoscan ROS2 Publisher/Subscriber Examples
  - Holoscan ROS2 VB1940 (Eagle) Camera
    
    Holoscan ROS2 VB1940 (Eagle) Camera
- Holoviz
  Holoviz
  - Holoviz HDR
    
    Holoviz HDR
  - Holoviz sRGB
    
    Holoviz sRGB
  - Holoviz UI
    
    Holoviz UI
  - Holoviz vsync
    
    Holoviz vsync
  - Holoviz YUV
    
    Holoviz YUV
- Hyperspectral Image Segmentation
  Hyperspectral Image Segmentation
- Imaging AI Whole Body Segmentation
  Imaging AI Whole Body Segmentation
- Industrial I/O (IIO) - ADALM-Pluto SDR Integration
  Industrial I/O (IIO) - ADALM-Pluto SDR Integration
- Intel RealSense Camera Visualizer
  Intel RealSense Camera Visualizer
- Isaac Sim Holoscan Bridge
  Isaac Sim Holoscan Bridge
- Kernel Flow BCI Real-Time Reconstruction and Visualization
  Kernel Flow BCI Real-Time Reconstruction and Visualization
- Laser detection latency
  Laser detection latency
  - EVT Camera Calibration
    
    EVT Camera Calibration
  - Laser Detection
    
    Laser Detection
  - USB Camera Calibration
    
    USB Camera Calibration
- Live Streaming Data Web Dashboard with NATS
  Live Streaming Data Web Dashboard with NATS
- Matlab gpu coder
  Matlab gpu coder
  - Image Processing with MATLAB GPU Coder
    
    Image Processing with MATLAB GPU Coder
  - Ultrasound Beamforming with MATLAB GPU Coder
    
    Ultrasound Beamforming with MATLAB GPU Coder
- Medical Image Viewer in XR
  Medical Image Viewer in XR
  - Operators
    Operators
    
    User interface Control
    
    User interface Control
    
    User interface Control
    
    User interface Control
    
    User interface Render
    
    User interface Render
    
    XrFrame
    
    XrFrame
    
    Convert Depth To Screen Space
    
    Convert Depth To Screen Space
    
    XRBeginFrame
    
    XRBeginFrame
    
    XrEndFrame
    
    XrEndFrame
  - Utils
    Utils
    
    XR Demo
    
    XR Demo
    
    XR Basic Rendering
    
    XR Basic Rendering
- Multi AI SSD Detection and MONAI Endoscopic Tool Segmentation
  Multi AI SSD Detection and MONAI Endoscopic Tool Segmentation
- Multi AI Ultrasound
  Multi AI Ultrasound
  - Multi-AI Ultrasound (C++)
    
    Multi-AI Ultrasound (C++)
  - Multi-AI Ultrasound (Python)
    
    Multi-AI Ultrasound (Python)
  - Operators
    Operators
    
    Visualizer iCardio
    
    Visualizer iCardio
- Nvidia nim
  Nvidia nim
  - Chat with NVIDIA NIM
    
    Chat with NVIDIA NIM
  - Medical Imaging Segmentation with NVIDIA Vista-3D NIM
    
    Medical Imaging Segmentation with NVIDIA Vista-3D NIM
  - NVIDIA NV-CLIP NIM
    
    NVIDIA NV-CLIP NIM
- Nvidia video codec
  Nvidia video codec
  - NVIDIA Video Codec: Encode-Decode Video
    
    NVIDIA Video Codec: Encode-Decode Video
  - NVIDIA Video Codec: Endoscopy Tool Tracking
    
    NVIDIA Video Codec: Endoscopy Tool Tracking
  - NVIDIA Video Codec: H.264 File Decoder
    
    NVIDIA Video Codec: H.264 File Decoder
  - NVIDIA Video Codec: Video Writer
    
    NVIDIA Video Codec: Video Writer
- Object Detection using PyTorch Faster R-CNN
  Object Detection using PyTorch Faster R-CNN
- OpenIGTLink 3D Slicer: Bidirectional Video Streaming with AI Segmentation
  OpenIGTLink 3D Slicer: Bidirectional Video Streaming with AI Segmentation
- Orsi
  Orsi
  - In-Out Body Detection and Surgical Video Anonymization
    
    In-Out Body Detection and Surgical Video Anonymization
  - Multi AI and AR Visualization
    
    Multi AI and AR Visualization
  - Surgical Tool Segmentation and AR Overlay
    
    Surgical Tool Segmentation and AR Overlay
- Polyp Detection
  Polyp Detection
- Power Spectral Density with cuNumeric
  Power Spectral Density with cuNumeric
- ProHawk Video Replayer
  ProHawk Video Replayer
- PVA-Accelerated Image Sharpening
  PVA-Accelerated Image Sharpening
- Qt Video Replayer
  Qt Video Replayer
- Radar Signal Processing over Network
  Radar Signal Processing over Network
- Real-Time Face and Text Deidentification
  Real-Time Face and Text Deidentification
- Real-time Riva ASR to local-LLM
  Real-time Riva ASR to local-LLM
- SAM 2: Segment Anything in Images and Videos
  SAM 2: Segment Anything in Images and Videos
- Simple CV-CUDA
  Simple CV-CUDA
- Simple radar pipeline
  Simple radar pipeline
  - Simple Radar Pipeline (C++)
    
    Simple Radar Pipeline (C++)
  - Simple Radar Pipeline (Python)
    
    Simple Radar Pipeline (Python)
- Slang
  Slang
  - Slang Gamma Correction Example
    
    Slang Gamma Correction Example
  - Slang Simple Compute Kernel Example
    
    Slang Simple Compute Kernel Example
- Software Defined Radio FM Demodulation
  Software Defined Radio FM Demodulation
- Speech-to-text + Large Language Model
  Speech-to-text + Large Language Model
- SSD Detection for Endoscopy Tools
  SSD Detection for Endoscopy Tools
- Stereo Vision
  Stereo Vision
- Streaming Synthetic Aperture Radar
  Streaming Synthetic Aperture Radar
- Surgical Scene Reconstruction with Gaussian Splatting
  Surgical Scene Reconstruction with Gaussian Splatting
- TAO PeopleNet Detection Model on V4L2 Video Stream
  TAO PeopleNet Detection Model on V4L2 Video Stream
- Ultrasound Bone Scoliosis Segmentation
  Ultrasound Bone Scoliosis Segmentation
  - Ultrasound Bone Scoliosis Segmentation (C++)
    
    Ultrasound Bone Scoliosis Segmentation (C++)
  - Ultrasound Bone Scoliosis Segmentation (Python)
    
    Ultrasound Bone Scoliosis Segmentation (Python)
- Ultrasound Post-Processing Filter Design
  Ultrasound Post-Processing Filter Design
- Velodyne VLP-16 Lidar Viewer
  Velodyne VLP-16 Lidar Viewer
- VILA Live
  VILA Live
- VITA 49 Power Spectral Density (PSD)
  VITA 49 Power Spectral Density (PSD)
  - Data Writer
    
    Data Writer
- Video Streaming Demo
  Video Streaming Demo
  - Video Streaming Client Demo
    
    Video Streaming Client Demo
  - Video Streaming Server Demo
    
    Video Streaming Server Demo
- Volume rendering using ClaraViz
  Volume rendering using ClaraViz
- VPI Stereo Vision
  VPI Stereo Vision
- WebRTC Holoviz Server
  WebRTC Holoviz Server
- WebRTC Video Client
  WebRTC Video Client
- WebRTC Video Server
  WebRTC Video Server
- XR + Gaussian Splatting
  XR + Gaussian Splatting
- XR + Holoviz
  XR + Holoviz
- Yolo Object Detection
  Yolo Object Detection
Operators
Operators
- Advanced Network library
  Advanced Network library
- AJA Source
  AJA Source
- AprilTag Detection
  AprilTag Detection
- Basic networking
  Basic networking
- Custom LSTM Inference
  Custom LSTM Inference
- CVCUDA Holoscan Interoperability
  CVCUDA Holoscan Interoperability
- Data-Distribution Service (DDS)
  Data-Distribution Service (DDS)
  - DDS Base
    
    DDS Base
  - DDS Shape Subscriber
    
    DDS Shape Subscriber
  - DDS Video
    
    DDS Video
- DELTACAST VideoMaster
  DELTACAST VideoMaster
- Deidentification
  Deidentification
  - Pixelator
    
    Pixelator
- EHR Query LLM
  EHR Query LLM
  - FHIR Client
    
    FHIR Client
  - FHIR Resource Sanitizer
    
    FHIR Resource Sanitizer
  - ZeroMQ Publisher
    
    ZeroMQ Publisher
  - ZeroMQ Subscriber
    
    ZeroMQ Subscriber
- Emergent Source
  Emergent Source
- Fast Fourier Transform (FFT)
  Fast Fourier Transform (FFT)
- Gamma Correction
  Gamma Correction
- GStreamer Bridge Components
  GStreamer Bridge Components
- GXF Tensor to VideoBuffer Converter
  GXF Tensor to VideoBuffer Converter
- High Rate PSD
  High Rate PSD
- Holohub gRPC Plugins for Holoscan SDK
  Holohub gRPC Plugins for Holoscan SDK
- Holoscan ROS2 Bridge Extension
  Holoscan ROS2 Bridge Extension
- IIO Controller
  IIO Controller
- Intel RealSense Camera
  Intel RealSense Camera
- Low Rate PSD
  Low Rate PSD
- Medical Imaging
  Medical Imaging
  - Clara Viz
    
    Clara Viz
  - DICOM Data Loader
    
    DICOM Data Loader
  - DICOM Encapsulated PDF Writer
    
    DICOM Encapsulated PDF Writer
  - DICOM Segmentation Writer
    
    DICOM Segmentation Writer
  - DICOM Series Selector
    
    DICOM Series Selector
  - DICOM Series to Volume
    
    DICOM Series to Volume
  - DICOM Text SR Writer
    
    DICOM Text SR Writer
  - Inference
    
    Inference
  - MONAI Bundle Inference
    
    MONAI Bundle Inference
  - MONAI Segmentation Inference
    
    MONAI Segmentation Inference
  - NIfTI Data Loader
    
    NIfTI Data Loader
  - PNG Converter
    
    PNG Converter
  - Publisher
    
    Publisher
  - STL Conversion
    
    STL Conversion
- Mesh to USD
  Mesh to USD
- NPP Filter
  NPP Filter
- NVIDIA Video Codec
  NVIDIA Video Codec
- OpenIGTLink
  OpenIGTLink
- OpenXR
  OpenXR
- Orsi Academy
  Orsi Academy
- Prohawk Video Processing
  Prohawk Video Processing
- Qt Video
  Qt Video
- Slang Shader
  Slang Shader
- Streaming Server
  Streaming Server
- StreamingClient
  StreamingClient
- Tensor to File
  Tensor to File
- Tool Tracking Postprocessor
  Tool Tracking Postprocessor
- Unzip
  Unzip
- Velodyne lidar
  Velodyne lidar
  - Velodyne Lidar
    
    Velodyne Lidar
- VITA49 PSD Packetizer
  VITA49 PSD Packetizer
- Video encoder
  Video encoder
  - Video Encoder Request
    
    Video Encoder Request
- Video Streaming
  Video Streaming
  - Streaming Server
    
    Streaming Server
  - StreamingClient
    
    StreamingClient
- Volume Loader
  Volume Loader
- Volume Renderer
  Volume Renderer
- VTK Renderer
  VTK Renderer
- WebRTC Client
  WebRTC Client
- WebRTC Server
  WebRTC Server
- YUAN QCAP Source
  YUAN QCAP Source
Tutorials
Tutorials
- A Study using Asynchronous Lock-free Buffer with SCHED_DEADLINE
  A Study using Asynchronous Lock-free Buffer with SCHED_DEADLINE
- Adding a GUI to Holoscan Python Applications
  Adding a GUI to Holoscan Python Applications
- Best Practices to integrate external libraries into Holoscan pipelines
  Best Practices to integrate external libraries into Holoscan pipelines
- Creating Multi Node Applications
  Creating Multi Node Applications
- CUDA MPS Tutorial for Holoscan Applications
  CUDA MPS Tutorial for Holoscan Applications
- Debugging
  Debugging
  - Holoscan SDK Visual Studio Code Dev Container Template
    
    Holoscan SDK Visual Studio Code Dev Container Template
  - Interactively Debugging a Holoscan
    
    Interactively Debugging a Holoscan
- Deploying Llama-2 70b model on the edge with IGX Orin
  Deploying Llama-2 70b model on the edge with IGX Orin
- High Performance Networking with Holoscan
  High Performance Networking with Holoscan
  - 0.1
- Holoscan Playground on AWS
  Holoscan Playground on AWS
- Holoscan SDK Response-Time Analysis
  Holoscan SDK Response-Time Analysis
- Interoperability between Holoscan and a Windows on a Single Machine
  Interoperability between Holoscan and a Windows on a Single Machine
- NVIDIA Holoscan Bootcamp
  NVIDIA Holoscan Bootcamp
- Pretrained foundational models
  Pretrained foundational models
  - Self-Supervised Contrastive Learning for Surgical videos
    
    Self-Supervised Contrastive Learning for Surgical videos
- Processing DICOM to USD with MONAI Deploy and Holoscan
  Processing DICOM to USD with MONAI Deploy and Holoscan
- Setting up CloudXR Runtime with Holoscan XR Applications
  Setting up CloudXR Runtime with Holoscan XR Applications
- Taking advantage of GPU Direct Storage on the latest NVIDIA Edge platform
  Taking advantage of GPU Direct Storage on the latest NVIDIA Edge platform
- Using Holohub in External Applications
  Using Holohub in External Applications
Benchmarks
Benchmarks
- Benchmark Model
  Benchmark Model
- Exclusive Display Benchmark
  Exclusive Display Benchmark
- Green Context CUDA Kernel Launch-Start Time Benchmark
  Green Context CUDA Kernel Launch-Start Time Benchmark
- Holoscan Flow Benchmarking for HoloHub
  Holoscan Flow Benchmarking for HoloHub
- Real-time Thread Scheduling Benchmark
  Real-time Thread Scheduling Benchmark
- Release Benchmarking Guide
  Release Benchmarking Guide

MONAI Segmentation Inference Operator #

Authors: Holoscan SDK Team (NVIDIA)
Supported platforms: x86_64, aarch64
Language: Python
Last modified: August 5, 2025
Latest version: 1.1.0
Minimum Holoscan SDK version: 1.0.3
Tested Holoscan SDK versions: 2.2.0, 3.2.0
Contribution metric: Level 2 - Trusted

This segmentation operator uses MONAI transforms and Sliding Window Inference to segment medical images.

Overview#

The MonaiSegInferenceOperator performs pre-transforms on input images, runs segmentation inference using a specified model, and applies post-transforms. The segmentation result is returned as an in-memory image object and can optionally be saved to disk.

Requirements#

Holoscan SDK Python package
MONAI
torch

Example Usage#

from pathlib import Path
import torch
from monai.transforms import Compose, LoadImage, ScaleIntensity, EnsureChannelFirst
from holoscan.core import Fragment
from operators.medical_imaging.monai_segmentation_inference_operator import MonaiSegInferenceOperator
from operators.medical_imaging.core import AppContext, IOMapping, IOType, Image

# Initialize the fragment
fragment = Fragment()

# Create app context
app_context = AppContext({})

# Define transforms
pre_transforms = Compose([
    LoadImage(image_only=True),
    EnsureChannelFirst(),
    ScaleIntensity(),
])

post_transforms = Compose([
    # Add your post-processing transforms here
])

# Initialize the segmentation operator
seg_op = MonaiSegInferenceOperator(
    fragment,
    roi_size=(96, 96, 96),  # Example ROI size for 3D images
    pre_transforms=pre_transforms,
    post_transforms=post_transforms,
    app_context=app_context,
    model_name="unet",  # Example model name
    overlap=0.25,
    sw_batch_size=4,
    model_path=Path("/path/to/your/model.pt")  # Replace with your model path
)

MONAI Segmentation Inference Operator#

Overview#

Requirements#

Example Usage#

MONAI Segmentation Inference Operator #