diff options
author | Dennis Zhang <dennis.zhang@arm.com> | 2020-03-03 13:13:31 +0000 |
---|---|---|
committer | Dennis Zhang <dennis.zhang@arm.com> | 2020-03-03 13:13:31 +0000 |
commit | 8e6d0dba166324f4b257329bd4b4ddc2b4522359 (patch) | |
tree | 128f011c82f3d7c24afbeb977c3caddc475d5cfc /gcc/config/arm/arm_bf16.h | |
parent | 9b4f00dd3f799337d8b8ef5e79f5a682c8059ab9 (diff) | |
download | gcc-8e6d0dba166324f4b257329bd4b4ddc2b4522359.tar.gz |
arm: ACLE BFloat16 convert intrinsics
This patch is part of a series adding support for Armv8.6-A features.
It implements intrinsics to convert between bfloat16 and float32
formats.
gcc/ChangeLog:
* config/arm/arm_bf16.h (vcvtah_f32_bf16, vcvth_bf16_f32): New.
* config/arm/arm_neon.h (vcvt_f32_bf16, vcvtq_low_f32_bf16): New.
(vcvtq_high_f32_bf16, vcvt_bf16_f32): New.
(vcvtq_low_bf16_f32, vcvtq_high_bf16_f32): New.
* config/arm/arm_neon_builtins.def (vbfcvt, vbfcvt_high): New entries.
(vbfcvtv4sf, vbfcvtv4sf_high): Likewise.
* config/arm/iterators.md (VBFCVT, VBFCVTM): New mode iterators.
(V_bf_low, V_bf_cvt_m): New mode attributes.
* config/arm/neon.md (neon_vbfcvtv4sf<VBFCVT:mode>): New.
(neon_vbfcvtv4sf_highv8bf, neon_vbfcvtsf): New.
(neon_vbfcvt<VBFCVT:mode>, neon_vbfcvt_highv8bf): New.
(neon_vbfcvtbf_cvtmode<mode>, neon_vbfcvtbf): New
* config/arm/unspecs.md (UNSPEC_BFCVT, UNSPEC_BFCVT_HIG): New.
gcc/testsuite/ChangeLog:
* gcc.target/arm/simd/bf16_cvt_1.c: New test.
Diffstat (limited to 'gcc/config/arm/arm_bf16.h')
-rw-r--r-- | gcc/config/arm/arm_bf16.h | 14 |
1 files changed, 14 insertions, 0 deletions
diff --git a/gcc/config/arm/arm_bf16.h b/gcc/config/arm/arm_bf16.h index decf23f3834..1aa593192c0 100644 --- a/gcc/config/arm/arm_bf16.h +++ b/gcc/config/arm/arm_bf16.h @@ -34,6 +34,20 @@ extern "C" { typedef __bf16 bfloat16_t; typedef float float32_t; +__extension__ extern __inline float32_t +__attribute__ ((__always_inline__, __gnu_inline__, __artificial__)) +vcvtah_f32_bf16 (bfloat16_t __a) +{ + return __builtin_neon_vbfcvtbf (__a); +} + +__extension__ extern __inline bfloat16_t +__attribute__ ((__always_inline__, __gnu_inline__, __artificial__)) +vcvth_bf16_f32 (float32_t __a) +{ + return __builtin_neon_vbfcvtsf (__a); +} + #ifdef __cplusplus } #endif |