about summary refs log tree commit diff
path: root/sysdeps/x86_64/multiarch/memset-evex-unaligned-erms.S
diff options
context:
space:
mode:
authorNoah Goldstein <goldstein.w.n@gmail.com>2021-04-19 17:48:10 -0400
committerH.J. Lu <hjl.tools@gmail.com>2021-04-19 15:08:04 -0700
commitf53790272ce7bdc5ecd14b45f65d0464d2a61a3a (patch)
tree080054bf654d57c524271f86b8195d602d881ae4 /sysdeps/x86_64/multiarch/memset-evex-unaligned-erms.S
parent83c5b368226c34a2f0a5287df40fc290b2b34359 (diff)
downloadglibc-f53790272ce7bdc5ecd14b45f65d0464d2a61a3a.tar.gz
glibc-f53790272ce7bdc5ecd14b45f65d0464d2a61a3a.tar.xz
glibc-f53790272ce7bdc5ecd14b45f65d0464d2a61a3a.zip
x86: Optimize less_vec evex and avx512 memset-vec-unaligned-erms.S
No bug. This commit adds optimized cased for less_vec memset case that
uses the avx512vl/avx512bw mask store avoiding the excessive
branches. test-memset and test-wmemset are passing.

Signed-off-by: Noah Goldstein <goldstein.w.n@gmail.com>
Diffstat (limited to 'sysdeps/x86_64/multiarch/memset-evex-unaligned-erms.S')
-rw-r--r--sysdeps/x86_64/multiarch/memset-evex-unaligned-erms.S2
1 files changed, 1 insertions, 1 deletions
diff --git a/sysdeps/x86_64/multiarch/memset-evex-unaligned-erms.S b/sysdeps/x86_64/multiarch/memset-evex-unaligned-erms.S
index ae0a4d6e46..640f092903 100644
--- a/sysdeps/x86_64/multiarch/memset-evex-unaligned-erms.S
+++ b/sysdeps/x86_64/multiarch/memset-evex-unaligned-erms.S
@@ -19,6 +19,6 @@
 # define SECTION(p)		p##.evex
 # define MEMSET_SYMBOL(p,s)	p##_evex_##s
 # define WMEMSET_SYMBOL(p,s)	p##_evex_##s
-
+# define USE_LESS_VEC_MASK_STORE	1
 # include "memset-vec-unaligned-erms.S"
 #endif