featured
Apr 22, 2024
Developing Virtual Factory Solutions with OpenUSD and NVIDIA Omniverse
With NVIDIA AI, NVIDIA Omniverse, and the Universal Scene Description (OpenUSD) ecosystem, industrial developers are building virtual factory solutions that...
4 MIN READ
Apr 22, 2024
Turbocharging Meta Llama 3 Performance with NVIDIA TensorRT-LLM and NVIDIA Triton Inference Server
We're excited to announce support for the Meta Llama 3 family of models in NVIDIA TensorRT-LLM, accelerating and optimizing your LLM inference performance. You...
9 MIN READ
Apr 22, 2024
Enhanced DU Performance and Workload Consolidation for 5G/6G with NVIDIA Aerial CUDA-Accelerated RAN
Aerial CUDA-Accelerated radio access network (RAN) enables acceleration of telco workloads, delivering new levels of spectral efficiency (SE) on a cloud-native...
14 MIN READ
Apr 19, 2024
Measuring the GPU Occupancy of Multi-stream Workloads
NVIDIA GPUs are becoming increasingly powerful with each new generation. This increase generally comes in two forms. Each streaming multi-processor (SM), the...
11 MIN READ
Apr 18, 2024
New Standard for Speech Recognition and Translation from the NVIDIA NeMo Canary Model
NVIDIA NeMo is an end-to-end platform for the development of multimodal generative AI models at scale anywhere—on any cloud and on-premises. The NeMo team...
4 MIN READ
Apr 18, 2024
Turbocharge ASR Accuracy and Speed with NVIDIA NeMo Parakeet-TDT
NVIDIA NeMo, an end-to-end platform for developing multimodal generative AI models at scale anywhere—on any cloud and on-premises—recently released...
6 MIN READ
Apr 18, 2024
Pushing the Boundaries of Speech Recognition with NVIDIA NeMo Parakeet ASR Models
NVIDIA NeMo, an end-to-end platform for the development of multimodal generative AI models at scale anywhere—on any cloud and on-premises—released the...
6 MIN READ
Apr 17, 2024
Advancing Medical Image Decoding with GPU-Accelerated nvImageCodec
This post delves into the capabilities of decoding DICOM medical images within AWS HealthImaging using the nvJPEG2000 library. We'll guide you through the...
16 MIN READ
Apr 12, 2024
Explainer: What Is a Convolutional Neural Network?
A convolutional neural network is a type of deep learning network used primarily to identify and classify images and to recognize objects within images.
1 MIN READ
Apr 11, 2024
New Video Series: OpenUSD for Developers
Universal Scene Description, also called OpenUSD or USD, is an open and extensible framework for creating, editing, querying, rendering, collaborating, and...
3 MIN READ
Apr 10, 2024
How Generative AI is Empowering Climate Tech with NVIDIA Earth-2
In the context of global warming, NVIDIA Earth-2 has emerged as a pivotal platform for climate tech, generating actionable insights in the face of increasingly...
14 MIN READ
Apr 09, 2024
Next-Generation Live Media Apps on Repurposable Clusters with NVIDIA Holoscan for Media
NVIDIA Holoscan for Media is now available to all developers looking to build next-generation live media applications on fully repurposable clusters. ...
4 MIN READ
Apr 05, 2024
Explainer: What Is Retrieval-Augmented Generation?
Retrieval-augmented generation enhances large language model prompts with relevant data for more practical, accurate responses.
1 MIN READ
Apr 03, 2024
Optimizing Memory and Retrieval for Graph Neural Networks with WholeGraph, Part 2
Large-scale graph neural network (GNN) training presents formidable challenges, particularly concerning the scale and complexity of graph data. These challenges...
5 MIN READ
Apr 03, 2024
New Lab: Generative AI Inference with NVIDIA NIM
Get started with NVIDIA NIM for deploying large language models (LLMs). Request access to a free, hands-on lab today.
1 MIN READ
Apr 02, 2024
Tune and Deploy LoRA LLMs with NVIDIA TensorRT-LLM
Large language models (LLMs) have revolutionized natural language processing (NLP) with their ability to learn from massive amounts of text and generate fluent...
15 MIN READ