diff options
author | H.J. Lu <hjl.tools@gmail.com> | 2017-10-22 08:03:46 -0700 |
---|---|---|
committer | H.J. Lu <hjl.tools@gmail.com> | 2017-10-22 08:05:15 -0700 |
commit | 0ccc7153ccd8fbecefb0793a65cd5a85cdb099fc (patch) | |
tree | e1e3035d3f0048d9b822ce6a4c9128f90ce5e946 /sysdeps/x86_64/fpu/multiarch/Makefile | |
parent | fe596486d694e657413d0d4c5a04598674ff71b1 (diff) | |
download | glibc-0ccc7153ccd8fbecefb0793a65cd5a85cdb099fc.tar glibc-0ccc7153ccd8fbecefb0793a65cd5a85cdb099fc.tar.gz glibc-0ccc7153ccd8fbecefb0793a65cd5a85cdb099fc.tar.bz2 glibc-0ccc7153ccd8fbecefb0793a65cd5a85cdb099fc.zip |
x86-64: Add logf with FMA
For workload-spec2017.wrf, on Skylake, it improves performance by:
Before After Improvement
reciprocal-throughput 16.1534 13.8874 16%
latency 41.9642 34.3072 22%
* sysdeps/x86_64/fpu/multiarch/Makefile (libm-sysdep_routines):
Add e_logf-fma.
(CFLAGS-e_logf-fma.c): New.
* sysdeps/x86_64/fpu/multiarch/e_logf-fma.c: New file.
* sysdeps/x86_64/fpu/multiarch/e_logf.c: Likewise.
Diffstat (limited to 'sysdeps/x86_64/fpu/multiarch/Makefile')
-rw-r--r-- | sysdeps/x86_64/fpu/multiarch/Makefile | 3 |
1 files changed, 2 insertions, 1 deletions
diff --git a/sysdeps/x86_64/fpu/multiarch/Makefile b/sysdeps/x86_64/fpu/multiarch/Makefile index 71a3b337ed..97d886aa59 100644 --- a/sysdeps/x86_64/fpu/multiarch/Makefile +++ b/sysdeps/x86_64/fpu/multiarch/Makefile @@ -37,10 +37,11 @@ CFLAGS-slowpow-fma.c = -mfma -mavx2 CFLAGS-s_sin-fma.c = -mfma -mavx2 CFLAGS-s_tan-fma.c = -mfma -mavx2 -libm-sysdep_routines += e_exp2f-fma e_expf-fma +libm-sysdep_routines += e_exp2f-fma e_expf-fma e_logf-fma CFLAGS-e_exp2f-fma.c = -mfma -mavx2 CFLAGS-e_expf-fma.c = -mfma -mavx2 +CFLAGS-e_logf-fma.c = -mfma -mavx2 libm-sysdep_routines += e_exp-fma4 e_log-fma4 e_pow-fma4 s_atan-fma4 \ e_asin-fma4 e_atan2-fma4 s_sin-fma4 s_tan-fma4 \ |