diff options
author | Shilei Tian <tianshilei1992@gmail.com> | 2021-01-26 20:21:27 -0500 |
---|---|---|
committer | Shilei Tian <tianshilei1992@gmail.com> | 2021-01-26 20:21:36 -0500 |
commit | e7535f8fedb5f355c332df9f2a87ebd61c82d983 (patch) | |
tree | 4dd841e6df024676de7666d96babba968a6e5fcc /openmp/README.rst | |
parent | e1d6178951de20b67950039470ac99377b29ddc7 (diff) | |
download | llvm-e7535f8fedb5f355c332df9f2a87ebd61c82d983.tar.gz |
[OpenMP][NVPTX] Drop dependence on CUDA to build NVPTX `deviceRTLs`
With D94745, we no longer use CUDA SDK to compile `deviceRTLs`. Therefore,
many CMake code in the project is useless. This patch cleans up unnecessary code
and also drops the requirement to build NVPTX `deviceRTLs`. CUDA detection is
still being used however to determine whether we need to involve the tests. Auto
detection of compute capability is enabled by default and can be disabled by
setting CMake variable `LIBOMPTARGET_NVPTX_AUTODETECT_COMPUTE_CAPABILITY=OFF`.
If auto detection is enabled, and CUDA is also valid, it will only build the
bitcode library for the detected version; otherwise, all variants supported will
be generated. One drawback of this patch is, we now generate 96 variants of
bitcode library, and totally 1485 files to be built with a clean build on a
non-CUDA system. `LIBOMPTARGET_NVPTX_COMPUTE_CAPABILITIES=""` can be used to
disable building NVPTX `deviceRTLs`.
Reviewed By: JonChesterfield
Differential Revision: https://reviews.llvm.org/D95466
Diffstat (limited to 'openmp/README.rst')
-rw-r--r-- | openmp/README.rst | 2 |
1 files changed, 1 insertions, 1 deletions
diff --git a/openmp/README.rst b/openmp/README.rst index aac0f7fc10d3..e235993caae2 100644 --- a/openmp/README.rst +++ b/openmp/README.rst @@ -281,7 +281,7 @@ Options for ``NVPTX device RTL`` **LIBOMPTARGET_NVPTX_COMPUTE_CAPABILITIES** = ``35`` List of CUDA compute capabilities that should be supported by the NVPTX - device RTL. E.g. for compute capabilities 6.0 and 7.0, the option "60,70" + device RTL. E.g. for compute capabilities 6.0 and 7.0, the option "60;70" should be used. Compute capability 3.5 is the minimum required. **LIBOMPTARGET_NVPTX_DEBUG** = ``OFF|ON`` |