diff options
author | H.J. Lu <hjl.tools@gmail.com> | 2016-03-22 07:46:56 -0700 |
---|---|---|
committer | H.J. Lu <hjl.tools@gmail.com> | 2016-03-22 07:47:20 -0700 |
commit | f781a9e96138d8839663af5e88649ab1fbed74f8 (patch) | |
tree | fdb1a9b321a4cae52901cb88e52bff1f1a348a1a /ChangeLog | |
parent | b87e41378beca3c98ec3464d64835e66cc788497 (diff) | |
download | glibc-f781a9e96138d8839663af5e88649ab1fbed74f8.tar glibc-f781a9e96138d8839663af5e88649ab1fbed74f8.tar.gz glibc-f781a9e96138d8839663af5e88649ab1fbed74f8.tar.bz2 glibc-f781a9e96138d8839663af5e88649ab1fbed74f8.zip |
Set index_arch_AVX_Fast_Unaligned_Load only for Intel processors
Since only Intel processors with AVX2 have fast unaligned load, we
should set index_arch_AVX_Fast_Unaligned_Load only for Intel processors.
Move AVX, AVX2, AVX512, FMA and FMA4 detection into get_common_indeces
and call get_common_indeces for other processors.
Add CPU_FEATURES_CPU_P and CPU_FEATURES_ARCH_P to aoid loading
GLRO(dl_x86_cpu_features) in cpu-features.c.
[BZ #19583]
* sysdeps/x86/cpu-features.c (get_common_indeces): Remove
inline. Check family before setting family, model and
extended_model. Set AVX, AVX2, AVX512, FMA and FMA4 usable
bits here.
(init_cpu_features): Replace HAS_CPU_FEATURE and
HAS_ARCH_FEATURE with CPU_FEATURES_CPU_P and
CPU_FEATURES_ARCH_P. Set index_arch_AVX_Fast_Unaligned_Load
for Intel processors with usable AVX2. Call get_common_indeces
for other processors with family == NULL.
* sysdeps/x86/cpu-features.h (CPU_FEATURES_CPU_P): New macro.
(CPU_FEATURES_ARCH_P): Likewise.
(HAS_CPU_FEATURE): Use CPU_FEATURES_CPU_P.
(HAS_ARCH_FEATURE): Use CPU_FEATURES_ARCH_P.
Diffstat (limited to 'ChangeLog')
-rw-r--r-- | ChangeLog | 17 |
1 files changed, 17 insertions, 0 deletions
@@ -1,3 +1,20 @@ +2016-03-22 H.J. Lu <hongjiu.lu@intel.com> + + [BZ #19583] + * sysdeps/x86/cpu-features.c (get_common_indeces): Remove + inline. Check family before setting family, model and + extended_model. Set AVX, AVX2, AVX512, FMA and FMA4 usable + bits here. + (init_cpu_features): Replace HAS_CPU_FEATURE and + HAS_ARCH_FEATURE with CPU_FEATURES_CPU_P and + CPU_FEATURES_ARCH_P. Set index_arch_AVX_Fast_Unaligned_Load + for Intel processors with usable AVX2. Call get_common_indeces + for other processors with family == NULL. + * sysdeps/x86/cpu-features.h (CPU_FEATURES_CPU_P): New macro. + (CPU_FEATURES_ARCH_P): Likewise. + (HAS_CPU_FEATURE): Use CPU_FEATURES_CPU_P. + (HAS_ARCH_FEATURE): Use CPU_FEATURES_ARCH_P. + 2016-03-22 Samuel Thibault <samuel.thibault@ens-lyon.org> * malloc/Makefile ($(objpfx)tst-malloc-backtrace, |