Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
backends/cuda: NVRTC compile to CUBIN when supported (resolve #1587)
This allows using a newer CUDA runtime with an older driver, and seems to have no downsides. NVRTC can generate cubins directly starting with CUDA 11.1. [...] NVRTC used to support only virtual architectures through the option -arch, since it was only emitting PTX. It will now support actual architectures as well to emit SASS. The interface is augmented to retrieve either the PTX or cubin if an actual architecture is specified. https://docs.nvidia.com/cuda/cuda-c-best-practices-guide/index.html#dynamic-code-generation
- Loading branch information