Directories
¶
| Path | Synopsis |
|---|---|
|
Package clblast provides Go wrappers for the CLBlast BLAS library.
|
Package clblast provides Go wrappers for the CLBlast BLAS library. |
|
Package codegen generates CUDA megakernel source code from a compiled ExecutionPlan instruction tape.
|
Package codegen generates CUDA megakernel source code from a compiled ExecutionPlan instruction tape. |
|
Package cublas provides low-level purego bindings for the cuBLAS library.
|
Package cublas provides low-level purego bindings for the cuBLAS library. |
|
Package cuda provides low-level bindings for the CUDA runtime API using dlopen/dlsym (no CGo).
|
Package cuda provides low-level bindings for the CUDA runtime API using dlopen/dlsym (no CGo). |
|
kernels
Package kernels provides Go wrappers for custom CUDA kernels.
|
Package kernels provides Go wrappers for custom CUDA kernels. |
|
Package cudnn provides purego bindings for the NVIDIA cuDNN library.
|
Package cudnn provides purego bindings for the NVIDIA cuDNN library. |
|
Package gpuapi defines internal interfaces for GPU runtime operations.
|
Package gpuapi defines internal interfaces for GPU runtime operations. |
|
Package hip provides low-level bindings for the AMD HIP runtime API using purego dlopen.
|
Package hip provides low-level bindings for the AMD HIP runtime API using purego dlopen. |
|
kernels
Package kernels provides Go wrappers for custom HIP kernels via purego dlopen.
|
Package kernels provides Go wrappers for custom HIP kernels via purego dlopen. |
|
Package miopen provides low-level bindings for the AMD MIOpen library using purego dlopen.
|
Package miopen provides low-level bindings for the AMD MIOpen library using purego dlopen. |
|
Package nccl provides CGo bindings for the NVIDIA Collective Communications Library (NCCL).
|
Package nccl provides CGo bindings for the NVIDIA Collective Communications Library (NCCL). |
|
Package opencl provides Go wrappers for the OpenCL 2.0 runtime API.
|
Package opencl provides Go wrappers for the OpenCL 2.0 runtime API. |
|
kernels
Package kernels provides OpenCL kernel source and dispatch for elementwise operations.
|
Package kernels provides OpenCL kernel source and dispatch for elementwise operations. |
|
Package rocblas provides low-level bindings for the AMD rocBLAS library using purego dlopen.
|
Package rocblas provides low-level bindings for the AMD rocBLAS library using purego dlopen. |
|
Package tensorrt provides bindings for the NVIDIA TensorRT inference library via purego (dlopen/dlsym, no CGo).
|
Package tensorrt provides bindings for the NVIDIA TensorRT inference library via purego (dlopen/dlsym, no CGo). |
|
Package workerpool provides a persistent pool of goroutines that process submitted tasks.
|
Package workerpool provides a persistent pool of goroutines that process submitted tasks. |
Click to show internal directories.
Click to hide internal directories.