NVIDIA Holoscan Reference Applications

Home
Workflows
Workflows
- Real-Time End-to-end AI Surgical Video
  Real-Time End-to-end AI Surgical Video
Applications
Applications
- Advanced Networking Benchmark
  Advanced Networking Benchmark
- AJA Video Capture
  AJA Video Capture
- An Example of Async Lock-free Buffer with SCHED_DEADLINE
  An Example of Async Lock-free Buffer with SCHED_DEADLINE
- Basic Networking Ping
  Basic Networking Ping
- Basic Pulse Description Word (PDW) Generator
  Basic Pulse Description Word (PDW) Generator
- Body Pose Estimation
  Body Pose Estimation
- Colonoscopy Polyp Segmentation
  Colonoscopy Polyp Segmentation
- CUDA Quantum Variational Quantum Eigensolver (VQE)
  CUDA Quantum Variational Quantum Eigensolver (VQE)
- Dds
  Dds
  - DDS Video: Real-time Video Streaming with RTI Connext
    
    DDS Video: Real-time Video Streaming with RTI Connext
- Deltacast Videomaster Transmitter
  Deltacast Videomaster Transmitter
- Depth Anything V2
  Depth Anything V2
- Distributed
  Distributed
  - Grpc
    Grpc
    
    Distributed Endoscopy Tool Tracking with gRPC Streaming
    
    Distributed Endoscopy Tool Tracking with gRPC Streaming
    
    Distributed H.264 Endoscopy Tool Tracking with gRPC Streaming
    
    Distributed H.264 Endoscopy Tool Tracking with gRPC Streaming
  - Ucx
    Ucx
    
    Distributed H.264 Endoscopy Tool Tracking
    
    Distributed H.264 Endoscopy Tool Tracking
    
    UCX-based Distributed Endoscopy Tool Tracking
    
    UCX-based Distributed Endoscopy Tool Tracking
    
    UCX-based Distributed Endoscopy Tool Tracking (C++)
    
    UCX-based Distributed Endoscopy Tool Tracking (C++)
    
    UCX-based Distributed Endoscopy Tool Tracking (Python)
    
    UCX-based Distributed Endoscopy Tool Tracking (Python)
- Ehr query llm
  Ehr query llm
  - EHR Agent Framework
    
    EHR Agent Framework
  - FHIR Client for Retrieving and Posting FHIR Resources
    
    FHIR Client for Retrieving and Posting FHIR Resources
- Endoscopy Depth Estimation
  Endoscopy Depth Estimation
- Endoscopy Out of Body Detection
  Endoscopy Out of Body Detection
  - Endoscopy Out of Body Detection (C++)
    
    Endoscopy Out of Body Detection (C++)
  - Endoscopy Out of Body Detection (Python)
    
    Endoscopy Out of Body Detection (Python)
- Endoscopy Tool Segmentation from MONAI Model Zoo
  Endoscopy Tool Segmentation from MONAI Model Zoo
- Endoscopy Tool Tracking
  Endoscopy Tool Tracking
  - Endoscopy Tool Tracking (C++)
    
    Endoscopy Tool Tracking (C++)
  - Endoscopy Tool Tracking (Python)
    
    Endoscopy Tool Tracking (Python)
- Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
  Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
- FM Radio Automatic Speech Recognition
  FM Radio Automatic Speech Recognition
- GPU-Accelerated Orthorectification with NVIDIA OptiX
  GPU-Accelerated Orthorectification with NVIDIA OptiX
- H264
  H264
  - H.264 Endoscopy Tool Tracking
    
    H.264 Endoscopy Tool Tracking
  - H.264 Video Decode
    
    H.264 Video Decode
- High Speed Endoscopy
  High Speed Endoscopy
  - High-Speed Endoscopy (C++)
    
    High-Speed Endoscopy (C++)
  - High-Speed Endoscopy (Python)
    
    High-Speed Endoscopy (Python)
- HoloChat
  HoloChat
- Holoscan ros2
  Holoscan ros2
  - Holoscan ROS2 Publisher/Subscriber Examples
    
    Holoscan ROS2 Publisher/Subscriber Examples
  - Holoscan ROS2 VB1940 (Eagle) Camera
    
    Holoscan ROS2 VB1940 (Eagle) Camera
- Holoviz
  Holoviz
  - Holoviz HDR
    
    Holoviz HDR
  - Holoviz sRGB
    
    Holoviz sRGB
  - Holoviz UI
    
    Holoviz UI
  - Holoviz vsync
    
    Holoviz vsync
  - Holoviz YUV
    
    Holoviz YUV
- Hyperspectral Image Segmentation
  Hyperspectral Image Segmentation
- Imaging AI Whole Body Segmentation
  Imaging AI Whole Body Segmentation
- Industrial I/O (IIO) - ADALM-Pluto SDR Integration
  Industrial I/O (IIO) - ADALM-Pluto SDR Integration
- Intel RealSense Camera Visualizer
  Intel RealSense Camera Visualizer
- Isaac Sim Holoscan Bridge
  Isaac Sim Holoscan Bridge
- Laser detection latency
  Laser detection latency
  - EVT Camera Calibration
    
    EVT Camera Calibration
  - Laser Detection
    
    Laser Detection
  - USB Camera Calibration
    
    USB Camera Calibration
- Matlab gpu coder
  Matlab gpu coder
  - Image Processing with MATLAB GPU Coder
    
    Image Processing with MATLAB GPU Coder
  - Ultrasound Beamforming with MATLAB GPU Coder
    
    Ultrasound Beamforming with MATLAB GPU Coder
- Medical Image Viewer in XR
  Medical Image Viewer in XR
  - Operators
    Operators
    
    User interface Control
    
    User interface Control
    
    User interface Control
    
    User interface Control
    
    User interface Render
    
    User interface Render
    
    XrFrame
    
    XrFrame
    
    Convert Depth To Screen Space
    
    Convert Depth To Screen Space
    
    XRBeginFrame
    
    XRBeginFrame
    
    XrEndFrame
    
    XrEndFrame
  - Utils
    Utils
    
    XR Demo
    
    XR Demo
    
    XR Basic Rendering
    
    XR Basic Rendering
- Multi AI SSD Detection and MONAI Endoscopic Tool Segmentation
  Multi AI SSD Detection and MONAI Endoscopic Tool Segmentation
- Multi AI Ultrasound
  Multi AI Ultrasound
  - Multi-AI Ultrasound (C++)
    
    Multi-AI Ultrasound (C++)
  - Multi-AI Ultrasound (Python)
    
    Multi-AI Ultrasound (Python)
  - Operators
    Operators
    
    Visualizer iCardio
    
    Visualizer iCardio
- Nvidia nim
  Nvidia nim
  - Chat with NVIDIA NIM
    
    Chat with NVIDIA NIM
  - Medical Imaging Segmentation with NVIDIA Vista-3D NIM
    
    Medical Imaging Segmentation with NVIDIA Vista-3D NIM
  - NVIDIA NV-CLIP NIM
    
    NVIDIA NV-CLIP NIM
- Nvidia video codec
  Nvidia video codec
  - NVIDIA Video Codec: Encode-Decode Video
    
    NVIDIA Video Codec: Encode-Decode Video
  - NVIDIA Video Codec: H.264 File Decoder
    
    NVIDIA Video Codec: H.264 File Decoder
  - NVIDIA Video Codec: Video Writer
    
    NVIDIA Video Codec: Video Writer
- Object Detection using PyTorch Faster R-CNN
  Object Detection using PyTorch Faster R-CNN
- OpenIGTLink 3D Slicer: Bidirectional Video Streaming with AI Segmentation
  OpenIGTLink 3D Slicer: Bidirectional Video Streaming with AI Segmentation
- Orsi
  Orsi
  - In-Out Body Detection and Surgical Video Anonymization
    
    In-Out Body Detection and Surgical Video Anonymization
  - Multi AI and AR Visualization
    
    Multi AI and AR Visualization
  - Surgical Tool Segmentation and AR Overlay
    
    Surgical Tool Segmentation and AR Overlay
- Polyp Detection
  Polyp Detection
- Power Spectral Density with cuNumeric
  Power Spectral Density with cuNumeric
- ProHawk Video Replayer
  ProHawk Video Replayer
- PVA-Accelerated Image Sharpening
  PVA-Accelerated Image Sharpening
- Qt Video Replayer
  Qt Video Replayer
- Radar Signal Processing over Network
  Radar Signal Processing over Network
- Real-Time Face and Text Deidentification
  Real-Time Face and Text Deidentification
- Real-time Riva ASR to local-LLM
  Real-time Riva ASR to local-LLM
- SAM 2: Segment Anything in Images and Videos
  SAM 2: Segment Anything in Images and Videos
- Simple CV-CUDA
  Simple CV-CUDA
- Simple radar pipeline
  Simple radar pipeline
  - Simple Radar Pipeline (C++)
    
    Simple Radar Pipeline (C++)
  - Simple Radar Pipeline (Python)
    
    Simple Radar Pipeline (Python)
- Slang
  Slang
  - Slang Gamma Correction Example
    
    Slang Gamma Correction Example
  - Slang Simple Compute Kernel Example
    
    Slang Simple Compute Kernel Example
- Software Defined Radio FM Demodulation
  Software Defined Radio FM Demodulation
- Speech-to-text + Large Language Model
  Speech-to-text + Large Language Model
- SSD Detection for Endoscopy Tools
  SSD Detection for Endoscopy Tools
- Stereo Vision
  Stereo Vision
- Streaming Synthetic Aperture Radar
  Streaming Synthetic Aperture Radar
- TAO PeopleNet Detection Model on V4L2 Video Stream
  TAO PeopleNet Detection Model on V4L2 Video Stream
- Ultrasound Bone Scoliosis Segmentation
  Ultrasound Bone Scoliosis Segmentation
  - Ultrasound Bone Scoliosis Segmentation (C++)
    
    Ultrasound Bone Scoliosis Segmentation (C++)
  - Ultrasound Bone Scoliosis Segmentation (Python)
    
    Ultrasound Bone Scoliosis Segmentation (Python)
- Velodyne VLP-16 Lidar Viewer
  Velodyne VLP-16 Lidar Viewer
- VILA Live
  VILA Live
- VITA 49 Power Spectral Density (PSD)
  VITA 49 Power Spectral Density (PSD)
  - Data Writer
    
    Data Writer
- Video Streaming Demo
  Video Streaming Demo
  - Video Streaming Client Demo
    
    Video Streaming Client Demo
  - Video Streaming Server Demo
    
    Video Streaming Server Demo
- Volume rendering using ClaraViz
  Volume rendering using ClaraViz
- VPI Stereo Vision
  VPI Stereo Vision
- WebRTC Holoviz Server
  WebRTC Holoviz Server
- WebRTC Video Client
  WebRTC Video Client
- WebRTC Video Server
  WebRTC Video Server
- XR + Gaussian Splatting
  XR + Gaussian Splatting
- XR + Holoviz
  XR + Holoviz
- Yolo Object Detection
  Yolo Object Detection
Operators
Operators
- Advanced Network library
  Advanced Network library
- AJA Source
  AJA Source
- AprilTag Detection
  AprilTag Detection
- Basic networking
  Basic networking
- Custom LSTM Inference
  Custom LSTM Inference
- CVCUDA Holoscan Interoperability
  CVCUDA Holoscan Interoperability
- Data-Distribution Service (DDS)
  Data-Distribution Service (DDS)
  - DDS Base
    
    DDS Base
  - DDS Shape Subscriber
    
    DDS Shape Subscriber
  - DDS Video
    
    DDS Video
- DELTACAST VideoMaster
  DELTACAST VideoMaster
- Deidentification
  Deidentification
  - Pixelator
    
    Pixelator
- EHR Query LLM
  EHR Query LLM
  - FHIR Client
    
    FHIR Client
  - FHIR Resource Sanitizer
    
    FHIR Resource Sanitizer
  - ZeroMQ Publisher
    
    ZeroMQ Publisher
  - ZeroMQ Subscriber
    
    ZeroMQ Subscriber
- Emergent Source
  Emergent Source
- Fast Fourier Transform (FFT)
  Fast Fourier Transform (FFT)
- Gamma Correction
  Gamma Correction
- GXF Tensor to VideoBuffer Converter
  GXF Tensor to VideoBuffer Converter
- High Rate PSD
  High Rate PSD
- Holohub gRPC Plugins for Holoscan SDK
  Holohub gRPC Plugins for Holoscan SDK
- Holoscan ROS2 Bridge Extension
  Holoscan ROS2 Bridge Extension
- IIO Controller
  IIO Controller
- Intel RealSense Camera
  Intel RealSense Camera
- Low Rate PSD
  Low Rate PSD
- Medical Imaging
  Medical Imaging
  - Clara Viz
    
    Clara Viz
  - DICOM Data Loader
    
    DICOM Data Loader
  - DICOM Encapsulated PDF Writer
    
    DICOM Encapsulated PDF Writer
  - DICOM Segmentation Writer
    
    DICOM Segmentation Writer
  - DICOM Series Selector
    
    DICOM Series Selector
  - DICOM Series to Volume
    
    DICOM Series to Volume
  - DICOM Text SR Writer
    
    DICOM Text SR Writer
  - Inference
    
    Inference
  - MONAI Bundle Inference
    
    MONAI Bundle Inference
  - MONAI Segmentation Inference
    
    MONAI Segmentation Inference
  - NIfTI Data Loader
    
    NIfTI Data Loader
  - PNG Converter
    
    PNG Converter
  - Publisher
    
    Publisher
  - STL Conversion
    
    STL Conversion
- Mesh to USD
  Mesh to USD
- NPP Filter
  NPP Filter
- NVIDIA Video Codec
  NVIDIA Video Codec
- OpenIGTLink
  OpenIGTLink
- OpenXR
  OpenXR
- Orsi Academy
  Orsi Academy
- Prohawk Video Processing
  Prohawk Video Processing
- Qt Video
  Qt Video
- Slang Shader
  Slang Shader
- Streaming Server
  Streaming Server
- StreamingClient
  StreamingClient
- Tensor to File
  Tensor to File
- Tool Tracking Postprocessor
  Tool Tracking Postprocessor
- Unzip
  Unzip
- Velodyne lidar
  Velodyne lidar
  - Velodyne Lidar
    
    Velodyne Lidar
- VITA49 PSD Packetizer
  VITA49 PSD Packetizer
- Video encoder
  Video encoder
  - Video Encoder Request
    
    Video Encoder Request
- Video Streaming
  Video Streaming
  - Streaming Server
    
    Streaming Server
  - StreamingClient
    
    StreamingClient
- Volume Loader
  Volume Loader
- Volume Renderer
  Volume Renderer
- VTK Renderer
  VTK Renderer
- WebRTC Client
  WebRTC Client
- WebRTC Server
  WebRTC Server
- YUAN QCAP Source
  YUAN QCAP Source
Tutorials
Tutorials
- A Study using Asynchronous Lock-free Buffer with SCHED_DEADLINE
  A Study using Asynchronous Lock-free Buffer with SCHED_DEADLINE
- Adding a GUI to Holoscan Python Applications
  Adding a GUI to Holoscan Python Applications
- Best Practices to integrate external libraries into Holoscan pipelines
  Best Practices to integrate external libraries into Holoscan pipelines
- Creating Multi Node Applications
  Creating Multi Node Applications
- CUDA MPS Tutorial for Holoscan Applications
  CUDA MPS Tutorial for Holoscan Applications
- Debugging
  Debugging
  - Holoscan SDK Visual Studio Code Dev Container Template
    
    Holoscan SDK Visual Studio Code Dev Container Template
  - Interactively Debugging a Holoscan
    
    Interactively Debugging a Holoscan
- Deploying Llama-2 70b model on the edge with IGX Orin
  Deploying Llama-2 70b model on the edge with IGX Orin
- High Performance Networking with Holoscan
  High Performance Networking with Holoscan
  - 0.1
- Holoscan Playground on AWS
  Holoscan Playground on AWS
- Holoscan SDK Response-Time Analysis
  Holoscan SDK Response-Time Analysis
- Interoperability between Holoscan and a Windows on a Single Machine
  Interoperability between Holoscan and a Windows on a Single Machine
- NVIDIA Holoscan Bootcamp
  NVIDIA Holoscan Bootcamp
- Pretrained foundational models
  Pretrained foundational models
  - Self-Supervised Contrastive Learning for Surgical videos
    
    Self-Supervised Contrastive Learning for Surgical videos
- Processing DICOM to USD with MONAI Deploy and Holoscan
  Processing DICOM to USD with MONAI Deploy and Holoscan
- Setting up CloudXR Runtime with Holoscan XR Applications
  Setting up CloudXR Runtime with Holoscan XR Applications
- Taking advantage of GPU Direct Storage on the latest NVIDIA Edge platform
  Taking advantage of GPU Direct Storage on the latest NVIDIA Edge platform
- Using Holohub in External Applications
  Using Holohub in External Applications
Benchmarks
Benchmarks
- Benchmark Model
  Benchmark Model
- Exclusive Display Benchmark
  Exclusive Display Benchmark
- Green Context CUDA Kernel Launch-Start Time Benchmark
  Green Context CUDA Kernel Launch-Start Time Benchmark
- Holoscan Flow Benchmarking for HoloHub
  Holoscan Flow Benchmarking for HoloHub
- Real-time Thread Scheduling Benchmark
  Real-time Thread Scheduling Benchmark
- Release Benchmarking Guide
  Release Benchmarking Guide

Custom LSTM Inference #

Authors: Holoscan Team (NVIDIA)
Supported platforms: x86_64, aarch64
Language: C++, Python
Last modified: October 9, 2025
Latest version: 1.0
Minimum Holoscan SDK version: 0.5.0
Tested Holoscan SDK versions: 0.5.0
Contribution metric: Level 1 - Highly Reliable

The lstm_tensor_rt_inference extension provides LSTM (Long-Short Term Memory) stateful inference module using TensorRT.

`nvidia::holoscan::lstm_tensor_rt_inference::TensorRtInference`#

Codelet, taking input tensors and feeding them into TensorRT for LSTM inference.

This implementation is based on nvidia::gxf::TensorRtInference. input_state_tensor_names and output_state_tensor_names parameters are added to specify tensor names for states in LSTM model.

Parameters#

model_file_path: Path to ONNX model to be loaded
type: std::string
engine_cache_dir: Path to a directory containing cached generated engines to be serialized and loaded from
type: std::string
plugins_lib_namespace: Namespace used to register all the plugins in this library (default: "")
type: std::string
force_engine_update: Always update engine regard less of existing engine file. Such conversion may take minutes (default: false)
type: bool
input_tensor_names: Names of input tensors in the order to be fed into the model
type: std::vector<std::string>
input_state_tensor_names: Names of input state tensors that are used internally by TensorRT
type: std::vector<std::string>
input_binding_names: Names of input bindings as in the model in the same order of what is provided in input_tensor_names
type: std::vector<std::string>
output_tensor_names: Names of output tensors in the order to be retrieved from the model
type: std::vector<std::string>
input_state_tensor_names: Names of output state tensors that are used internally by TensorRT
type: std::vector<std::string>
output_binding_names: Names of output bindings in the model in the same order of of what is provided in output_tensor_names
type: std::vector<std::string>
pool: Allocator instance for output tensors
type: gxf::Handle<gxf::Allocator>
cuda_stream_pool: Instance of gxf::CudaStreamPool to allocate CUDA stream
type: gxf::Handle<gxf::CudaStreamPool>
max_workspace_size: Size of working space in bytes (default: 67108864l (64MB))
type: int64_t
dla_core: DLA Core to use. Fallback to GPU is always enabled. Default to use GPU only (optional)
type: int32_t
max_batch_size: Maximum possible batch size in case the first dimension is dynamic and used as batch size (default: 1)
type: int32_t
enable_fp16_: Enable inference with FP16 and FP32 fallback (default: false)
type: bool
verbose: Enable verbose logging on console (default: false)
type: bool
relaxed_dimension_check: Ignore dimensions of 1 for input tensor dimension check (default: true)
type: bool
rx: List of receivers to take input tensors
type: std::vector<gxf::Handle<gxf::Receiver>>
tx: Transmitter to publish output tensors
type: gxf::Handle<gxf::Transmitter>

Custom LSTM Inference#

nvidia::holoscan::lstm_tensor_rt_inference::TensorRtInference#

Parameters#

Custom LSTM Inference #

`nvidia::holoscan::lstm_tensor_rt_inference::TensorRtInference`#