Skip to content
Snippets Groups Projects

add support for TensorRT 10.10

Merged matthieu marchal requested to merge mmarchal/aidge_export_tensorrt:trt-10 into main

Context

This MR aims to add support for newer version of TensorRT 10.10.

Modified files

Added tensorrt_10.10 folder which is a copy of tensorrt_8.6 folder in static/ directory. The following modifications have been applied to match with the 10.10 API version of TensorRT :

  • Modified tools/tensorrt10.10_compiler.Dockerfile & Makefile to use correct version of tensorrt
  • Updated unsupported dims32 → dims in src/Graph.cpp to match TRT 10 API
  • Updated deprecated PlatformHasFast*() → custom cudaHasFast*() for datamode() mapping in include/Utils.cpp and src/Graph.cpp
  • Updated trt_version = 10.10 by default in export() in __init__.py

Detailed major modifications

Added a static bool cudaSupportsDatatype(nvinfer1::Datatype datatype) to confirm that a datatype is supported on the CUDA SM. The support matrix is available at Support Hardware & Precision Matrix

Acknowledged deprecations

  • The implicit-quantization API (IInt8Calibrator) is deprecated in TensorRT 10.10; while aidge integrates ONNX Q/DQ layer support for the recommended explicit-quantization workflow, we’ll keep implicit quantization now and migrate in a future refactor. More informations are available at Quantized types documentation
Edited by matthieu marchal

Merge request reports

Loading
Loading

Activity

Filter activity
  • Approvals
  • Assignees & reviewers
  • Comments (from bots)
  • Comments (from users)
  • Commits & branches
  • Edits
  • Labels
  • Lock status
  • Mentions
  • Merge request status
  • Tracking
Please register or sign in to reply
Loading