diff options
author | H.J. Lu <hjl.tools@gmail.com> | 2016-03-22 07:46:56 -0700 |
---|---|---|
committer | H.J. Lu <hjl.tools@gmail.com> | 2016-03-22 07:47:20 -0700 |
commit | f781a9e96138d8839663af5e88649ab1fbed74f8 (patch) | |
tree | fdb1a9b321a4cae52901cb88e52bff1f1a348a1a /ChangeLog | |
parent | b87e41378beca3c98ec3464d64835e66cc788497 (diff) | |
download | glibc-f781a9e96138d8839663af5e88649ab1fbed74f8.tar.gz glibc-f781a9e96138d8839663af5e88649ab1fbed74f8.tar.xz glibc-f781a9e96138d8839663af5e88649ab1fbed74f8.zip |
Set index_arch_AVX_Fast_Unaligned_Load only for Intel processors
Since only Intel processors with AVX2 have fast unaligned load, we should set index_arch_AVX_Fast_Unaligned_Load only for Intel processors. Move AVX, AVX2, AVX512, FMA and FMA4 detection into get_common_indeces and call get_common_indeces for other processors. Add CPU_FEATURES_CPU_P and CPU_FEATURES_ARCH_P to aoid loading GLRO(dl_x86_cpu_features) in cpu-features.c. [BZ #19583] * sysdeps/x86/cpu-features.c (get_common_indeces): Remove inline. Check family before setting family, model and extended_model. Set AVX, AVX2, AVX512, FMA and FMA4 usable bits here. (init_cpu_features): Replace HAS_CPU_FEATURE and HAS_ARCH_FEATURE with CPU_FEATURES_CPU_P and CPU_FEATURES_ARCH_P. Set index_arch_AVX_Fast_Unaligned_Load for Intel processors with usable AVX2. Call get_common_indeces for other processors with family == NULL. * sysdeps/x86/cpu-features.h (CPU_FEATURES_CPU_P): New macro. (CPU_FEATURES_ARCH_P): Likewise. (HAS_CPU_FEATURE): Use CPU_FEATURES_CPU_P. (HAS_ARCH_FEATURE): Use CPU_FEATURES_ARCH_P.
Diffstat (limited to 'ChangeLog')
-rw-r--r-- | ChangeLog | 17 |
1 files changed, 17 insertions, 0 deletions
diff --git a/ChangeLog b/ChangeLog index 36360d9134..522bc4fbc8 100644 --- a/ChangeLog +++ b/ChangeLog @@ -1,3 +1,20 @@ +2016-03-22 H.J. Lu <hongjiu.lu@intel.com> + + [BZ #19583] + * sysdeps/x86/cpu-features.c (get_common_indeces): Remove + inline. Check family before setting family, model and + extended_model. Set AVX, AVX2, AVX512, FMA and FMA4 usable + bits here. + (init_cpu_features): Replace HAS_CPU_FEATURE and + HAS_ARCH_FEATURE with CPU_FEATURES_CPU_P and + CPU_FEATURES_ARCH_P. Set index_arch_AVX_Fast_Unaligned_Load + for Intel processors with usable AVX2. Call get_common_indeces + for other processors with family == NULL. + * sysdeps/x86/cpu-features.h (CPU_FEATURES_CPU_P): New macro. + (CPU_FEATURES_ARCH_P): Likewise. + (HAS_CPU_FEATURE): Use CPU_FEATURES_CPU_P. + (HAS_ARCH_FEATURE): Use CPU_FEATURES_ARCH_P. + 2016-03-22 Samuel Thibault <samuel.thibault@ens-lyon.org> * malloc/Makefile ($(objpfx)tst-malloc-backtrace, |