Computer Vision / Video Analytics

TensorRT 5 RC Now Available

Sep 20, 2018

By Nefi Alarcon

AT GTC Japan, NVIDIA announced the latest version of the TensorRT’s high-performance deep learning inference optimizer and runtime. Today we are releasing the TensorRT 5 Release Candidate. TensorRT 5 supports the new Turing architecture, provides new optimizations, and INT8 APIs achieving up to 40x faster inference over CPU-only platforms. This latest version also dramatically speeds up inference of recommenders, neural machine translation, speech, and natural language processing apps.
TensorRT 5 Highlights:

Speeds up inference by 40x over CPUs for models such as translation using mixed precision on Turing Tensor Cores
Optimizes inference models with new INT8 APIs
Supports Xavier-based NVIDIA Drive platforms and the NVIDIA DLA accelerator for FP16

TensorRT 5 RC is available now to all members of the NVIDIA Developer Program.
Learn more>

Related resources

NGC Containers: TensorRT
NGC Containers: TensorRT PB October 2023 (PB 23h2)
NGC Containers: IGX - TensorRT PB October 2023 (PB 23h2)
SDK: Torch-TensorRT
SDK: TensorFlow-TensorRT
SDK: TensorRT

Discuss (0)

About the Authors

About Nefi Alarcon
Nefi Alarcon is a senior executive communications manager on NVIDIA's leadership team. He has years of media relations and communication experience, and has previously worked at Google, Mozilla, and CNN. He received his bachelor's degree in Journalism from George Washington University.

View all posts by Nefi Alarcon