internal/

directory
v1.16.0 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Mar 25, 2026 License: Apache-2.0

Directories

Path Synopsis
Package clblast provides Go wrappers for the CLBlast BLAS library.
Package clblast provides Go wrappers for the CLBlast BLAS library.
Package codegen generates CUDA megakernel source code from compiled computation graphs.
Package codegen generates CUDA megakernel source code from compiled computation graphs.
Package cublas provides low-level purego bindings for the cuBLAS library.
Package cublas provides low-level purego bindings for the cuBLAS library.
Package cuda provides low-level bindings for the CUDA runtime API using (Stability: stable) dlopen/dlsym (no CGo).
Package cuda provides low-level bindings for the CUDA runtime API using (Stability: stable) dlopen/dlsym (no CGo).
kernels
Package kernels provides Go wrappers for custom CUDA kernels.
Package kernels provides Go wrappers for custom CUDA kernels.
Package cudnn provides purego bindings for the NVIDIA cuDNN library.
Package cudnn provides purego bindings for the NVIDIA cuDNN library.
Package gpuapi defines internal interfaces for GPU runtime operations.
Package gpuapi defines internal interfaces for GPU runtime operations.
hip
Package hip provides low-level bindings for the AMD HIP runtime API (Stability: alpha) using purego dlopen.
Package hip provides low-level bindings for the AMD HIP runtime API (Stability: alpha) using purego dlopen.
kernels
Package kernels provides Go wrappers for custom HIP kernels via purego (Stability: alpha) dlopen.
Package kernels provides Go wrappers for custom HIP kernels via purego (Stability: alpha) dlopen.
Package miopen provides low-level bindings for the AMD MIOpen library (Stability: alpha) using purego dlopen.
Package miopen provides low-level bindings for the AMD MIOpen library (Stability: alpha) using purego dlopen.
Package nccl provides CGo bindings for the NVIDIA Collective Communications (Stability: beta) Library (NCCL).
Package nccl provides CGo bindings for the NVIDIA Collective Communications (Stability: beta) Library (NCCL).
Package opencl provides Go wrappers for the OpenCL 2.0 runtime API.
Package opencl provides Go wrappers for the OpenCL 2.0 runtime API.
kernels
Package kernels provides OpenCL kernel source and dispatch for elementwise operations.
Package kernels provides OpenCL kernel source and dispatch for elementwise operations.
Package rocblas provides low-level bindings for the AMD rocBLAS library (Stability: alpha) using purego dlopen.
Package rocblas provides low-level bindings for the AMD rocBLAS library (Stability: alpha) using purego dlopen.
Package tensorrt provides bindings for the NVIDIA TensorRT inference (Stability: alpha) library via purego (dlopen/dlsym, no CGo).
Package tensorrt provides bindings for the NVIDIA TensorRT inference (Stability: alpha) library via purego (dlopen/dlsym, no CGo).
Package workerpool provides a persistent pool of goroutines that process submitted tasks.
Package workerpool provides a persistent pool of goroutines that process submitted tasks.
Package xblas provides CPU BLAS wrappers with ARM NEON and AVX2 SIMD assembly.
Package xblas provides CPU BLAS wrappers with ARM NEON and AVX2 SIMD assembly.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL