A high-throughput and memory-efficient inference and serving engine for LLMs
-
Updated
Jun 11, 2024 - Python
A high-throughput and memory-efficient inference and serving engine for LLMs
AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/OpenCL/CPU back-ends.
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Two-moment AMR radiation hydrodynamics (with self-gravity, particles, and chemistry) on CPUs/GPUs for astrophysics
The PennyLane-Lightning plugin provides a fast state-vector simulator written in C++ for use with PennyLane
AWS CloudFormation templates to provision Linux or Windows EC2 instances with GUI running NICE DCV remote display server. Includes option to install GPU drivers
A deep learning package for many-body potential energy representation and molecular dynamics
Add a description, image, and links to the rocm topic page so that developers can more easily learn about it.
To associate your repository with the rocm topic, visit your repo's landing page and select "manage topics."