summaryrefslogtreecommitdiff
path: root/openmp/README.rst
diff options
context:
space:
mode:
authorGheorghe-Teodor Bercea <gheorghe-teod.bercea@ibm.com>2018-02-12 16:45:20 +0000
committerGheorghe-Teodor Bercea <gheorghe-teod.bercea@ibm.com>2018-02-12 16:45:20 +0000
commitd5ae4e65014f4c72d7aa61c368084ce214488a32 (patch)
tree563e905f71af8ade994887b0e1cd8f6f955fc137 /openmp/README.rst
parent7dc0f1ec451abea08e725d9bc1bbb5829e8cfa7f (diff)
downloadllvm-d5ae4e65014f4c72d7aa61c368084ce214488a32.tar.gz
[OpenMP][libomptarget] Enable the compilation of multiple bc libraries for runtime inlining
Summary: Different NVIDIA GPUs support different compute capabilities. To enable the inlining of runtime functions and the best performance on different generations of NVIDIA GPUs, a bc library for each compute capability needs to be compiled. The same compiler build will then be usable in conjunction with multiple generations of NVIDIA GPUs. To differentiate between versions of the same bc lib, the output file name will contain the compute capability ID. Depends on D14254 Reviewers: Hahnfeld, hfinkel, carlo.bertolli, caomhin, ABataev, grokos Reviewed By: Hahnfeld, grokos Subscribers: guansong, mgorny, openmp-commits Differential Revision: https://reviews.llvm.org/D41724 llvm-svn: 324904
Diffstat (limited to 'openmp/README.rst')
-rw-r--r--openmp/README.rst8
1 files changed, 4 insertions, 4 deletions
diff --git a/openmp/README.rst b/openmp/README.rst
index a66f75bcaed5..ea79f194870a 100644
--- a/openmp/README.rst
+++ b/openmp/README.rst
@@ -280,10 +280,10 @@ Options for ``NVPTX device RTL``
compatible with NVCC, this option can be use to pass to NVCC a valid compiler
to avoid the error.
- **LIBOMPTARGET_NVPTX_COMPUTE_CAPABILITY** = ``35``
- CUDA compute capability that should be supported by the NVPTX device RTL. E.g.
- for compute capability 6.0, the option "60" should be used. Compute capability
- 3.5 is the minimum required.
+ **LIBOMPTARGET_NVPTX_COMPUTE_CAPABILITIES** = ``35``
+ List of CUDA compute capabilities that should be supported by the NVPTX
+ device RTL. E.g. for compute capabilities 6.0 and 7.0, the option "60,70"
+ should be used. Compute capability 3.5 is the minimum required.
**LIBOMPTARGET_NVPTX_DEBUG** = ``OFF|ON``
Enable printing of debug messages from the NVPTX device RTL.