Skip to content
Change the repository type filter

All

    Repositories list

    • Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
      C++
      Other
      1373302649Updated Jan 4, 2025Jan 4, 2025
    • This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit upstream at https://github.com/llvm/llvm-project.
      LLVM
      Other
      12k1263117Updated Jan 4, 2025Jan 4, 2025
    • rocPyDecode is a set of Python bindings to rocDecode C++ library which provides full HW acceleration for video decoding on AMD GPUs.
      C++
      MIT License
      7305Updated Jan 4, 2025Jan 4, 2025
    • rccl

      Public
      ROCm Communication Collectives Library (RCCL)
      C++
      Other
      1282861421Updated Jan 4, 2025Jan 4, 2025
    • Python
      Other
      51495Updated Jan 4, 2025Jan 4, 2025
    • flang

      Public
      Mirror of flang repo: The source repo is https://github.com/flang-compiler/flang . Once a day the master branch is updated from the upstream source repo and then locked. AOMP or ROCm developers may commit or create PRs on branch aomp-dev.
      C++
      Other
      86010Updated Jan 4, 2025Jan 4, 2025
    • rocRAND

      Public
      RAND library for HIP programming language
      C++
      MIT License
      7011416Updated Jan 4, 2025Jan 4, 2025
    • rpp

      Public
      AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/OpenCL/CPU back-ends.
      C++
      MIT License
      415748Updated Jan 4, 2025Jan 4, 2025
    • AMD's graph optimization engine.
      C++
      MIT License
      8819335051Updated Jan 4, 2025Jan 4, 2025
    • C++
      MIT License
      101786Updated Jan 4, 2025Jan 4, 2025
    • 8-bit CUDA functions for PyTorch
      Python
      MIT License
      6444283Updated Jan 4, 2025Jan 4, 2025
    • Tensile

      Public
      Stretching GPU performance for GEMMs and tensor contractions.
      Python
      MIT License
      15222858Updated Jan 3, 2025Jan 3, 2025
    • rocSPARSE

      Public
      Next generation SPARSE implementation for ROCm platform
      C++
      MIT License
      5611620Updated Jan 3, 2025Jan 3, 2025
    • xla

      Public
      A machine learning compiler for GPUs, CPUs, and ML accelerators
      C++
      Apache License 2.0
      4593018Updated Jan 3, 2025Jan 3, 2025
    • MIOpen

      Public
      AMD's Machine Intelligence Library
      Assembly
      Other
      2321.1k24864Updated Jan 3, 2025Jan 3, 2025
    • ONNX Runtime: cross-platform, high performance scoring engine for ML models
      C++
      MIT License
      3k606Updated Jan 3, 2025Jan 3, 2025
    • CMake modules used within the ROCm libraries
      CMake
      MIT License
      4363413Updated Jan 3, 2025Jan 3, 2025
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      Other
      23k2207640Updated Jan 3, 2025Jan 3, 2025
    • ROCm

      Public
      AMD ROCm™ Software - GitHub Home
      Shell
      MIT License
      3934.8k11417Updated Jan 3, 2025Jan 3, 2025
    • Fast and memory-efficient exact attention
      Python
      BSD 3-Clause "New" or "Revised" License
      1.4k1502412Updated Jan 3, 2025Jan 3, 2025
    • rocBLAS

      Public
      Next generation BLAS implementation for ROCm platform
      C++
      Other
      16935441Updated Jan 3, 2025Jan 3, 2025
    • hipTensor

      Public
      AMD’s C++ library for accelerating tensor primitives
      C++
      MIT License
      203702Updated Jan 3, 2025Jan 3, 2025
    • aotriton

      Public
      Ahead of Time (AOT) Triton Math Library
      Python
      MIT License
      1647101Updated Jan 3, 2025Jan 3, 2025
    • apex

      Public
      A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
      Python
      BSD 3-Clause "New" or "Revised" License
      1.4k19138Updated Jan 3, 2025Jan 3, 2025
    • HIP Python Low-level Bindings
      Shell
      MIT License
      31722Updated Jan 3, 2025Jan 3, 2025
    • dyninst

      Public
      DyninstAPI: Tools for binary instrumentation, analysis, and modification.
      C
      GNU Lesser General Public License v2.1
      157100Updated Jan 3, 2025Jan 3, 2025
    • ROCm Systems Profiler
      C++
      MIT License
      61202Updated Jan 3, 2025Jan 3, 2025
    • rocWMMA

      Public
      rocWMMA
      C++
      MIT License
      269622Updated Jan 3, 2025Jan 3, 2025
    • triton

      Public
      Development repository for the Triton language and compiler
      C++
      MIT License
      1.7k100943Updated Jan 3, 2025Jan 3, 2025
    • ROCm Documentation Python package for ReadTheDocs build standardization
      CSS
      Other
      1713612Updated Jan 3, 2025Jan 3, 2025