NVIDIA Corporation
- 9.2k followers
- 2788 San Tomas Expressway, Santa Clara, CA, 95051
- http://www.nvidia.com
Pinned
Repositories
-
- TensorRT-LLM Public
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
- cuda-quantum Public
C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
- TransformerEngine Public
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
- vgpu-device-manager Public
NVIDIA vGPU Device Manager manages NVIDIA vGPU devices on top of Kubernetes
- nim-deploy Public
A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deployment.
-
- nvidia-hpcg Public
NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.