diff options
author | Noah Goldstein <goldstein.w.n@gmail.com> | 2021-04-19 17:48:10 -0400 |
---|---|---|
committer | H.J. Lu <hjl.tools@gmail.com> | 2021-04-19 15:08:04 -0700 |
commit | f53790272ce7bdc5ecd14b45f65d0464d2a61a3a (patch) | |
tree | 080054bf654d57c524271f86b8195d602d881ae4 /sysdeps/x86_64/multiarch/memset-evex-unaligned-erms.S | |
parent | 83c5b368226c34a2f0a5287df40fc290b2b34359 (diff) | |
download | glibc-f53790272ce7bdc5ecd14b45f65d0464d2a61a3a.tar.gz glibc-f53790272ce7bdc5ecd14b45f65d0464d2a61a3a.tar.xz glibc-f53790272ce7bdc5ecd14b45f65d0464d2a61a3a.zip |
x86: Optimize less_vec evex and avx512 memset-vec-unaligned-erms.S
No bug. This commit adds optimized cased for less_vec memset case that uses the avx512vl/avx512bw mask store avoiding the excessive branches. test-memset and test-wmemset are passing. Signed-off-by: Noah Goldstein <goldstein.w.n@gmail.com>
Diffstat (limited to 'sysdeps/x86_64/multiarch/memset-evex-unaligned-erms.S')
-rw-r--r-- | sysdeps/x86_64/multiarch/memset-evex-unaligned-erms.S | 2 |
1 files changed, 1 insertions, 1 deletions
diff --git a/sysdeps/x86_64/multiarch/memset-evex-unaligned-erms.S b/sysdeps/x86_64/multiarch/memset-evex-unaligned-erms.S index ae0a4d6e46..640f092903 100644 --- a/sysdeps/x86_64/multiarch/memset-evex-unaligned-erms.S +++ b/sysdeps/x86_64/multiarch/memset-evex-unaligned-erms.S @@ -19,6 +19,6 @@ # define SECTION(p) p##.evex # define MEMSET_SYMBOL(p,s) p##_evex_##s # define WMEMSET_SYMBOL(p,s) p##_evex_##s - +# define USE_LESS_VEC_MASK_STORE 1 # include "memset-vec-unaligned-erms.S" #endif |