rocm

Star

Here are 121 public repositories matching this topic...

vllm-project / vllm

Star

A high-throughput and memory-efficient inference and serving engine for LLMs

amd cuda inference pytorch transformer llama gpt rocm model-serving mlops llm inferentia llmops llm-serving trainium

Updated Jun 11, 2024
Python

ROCm / rpp

Star

AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/OpenCL/CPU back-ends.

cpu computer-vision hpc amd gpu opencl histogram contrast bitwise hip rocm openvx rpp mivisionx radeon-performance-primitives warp-affine channel-extract agumentation

Updated Jun 11, 2024
C++

eliranwong / MultiAMDGPU_AIDev_Ubuntu

Star

Multi AMD GPU Setup for AI Development on Ubuntu with ROCM

ai ubuntu amd gpu amdgpu rocm amd-gpu freegenius

Updated Jun 11, 2024

patientx / ComfyUI-Zluda

Star

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface. Now ZLUDA enhanced for better AMD GPU performance.

windows amd cuda rocm stable-diffusion comfyui zluda

Updated Jun 11, 2024
Python

ROCm / rocBLAS

Star

Next generation BLAS implementation for ROCm platform

blas hip rocm

Updated Jun 11, 2024
C++

apache / tvm

Star

Open deep learning compiler stack for cpu, gpu and specialized accelerators

javascript machine-learning performance deep-learning metal compiler gpu vulkan opencl tensor spirv rocm tvm

Updated Jun 11, 2024
Python

DejvBayer / afft

Star

C++17 wrapper library for fft-related computations on CPUs and GPUs

cuda fft hip dct mkl dst cufft rocm dtt fftw3 pocketfft vkfft

Updated Jun 11, 2024
C++

quokka-astro / quokka

Star

Two-moment AMR radiation hydrodynamics (with self-gravity, particles, and chemistry) on CPUs/GPUs for astrophysics

gpu cuda particles astrophysics hip hydrodynamics astrochemistry rocm adaptive-mesh-refinement self-gravity

Updated Jun 11, 2024
C++

PennyLaneAI / pennylane-lightning

Star

The PennyLane-Lightning plugin provides a fast state-vector simulator written in C++ for use with PennyLane

hpc gpu parallel openmp mpi distributed-computing cuda quantum-computing rocm quantum-machine-learning

Updated Jun 11, 2024
C++

pika-org / pika

Star

pika builds on C++ std::execution with fiber, CUDA, HIP, and MPI support.

cplusplus cpp gpu concurrency mpi cuda parallelism hip rocm stdexec p2300

Updated Jun 11, 2024
C++

cupy / cupy

Sponsor

Star

NumPy & SciPy for GPU

python gpu numpy cuda cublas scipy tensor cudnn rocm cupy cusolver nccl curand cusparse nvrtc cutensor nvtx cusparselt

Updated Jun 11, 2024
Python

aws-samples / amazon-ec2-nice-dcv-samples

Star

AWS CloudFormation templates to provision Linux or Windows EC2 instances with GUI running NICE DCV remote display server. Includes option to install GPU drivers