about summary refs log tree commit diff
path: root/io/readlinkat.c
diff options
context:
space:
mode:
authorH.J. Lu <hjl.tools@gmail.com>2017-05-23 11:25:19 -0700
committerH.J. Lu <hjl.tools@gmail.com>2017-06-05 15:09:59 -0700
commitce40306fcc3edb2baade47e8050c975c5ecba980 (patch)
tree831ce2088ee325b63821b97bc62a5daf0e5973a6 /io/readlinkat.c
parent2aa22acfbbbb26a2e585ff62fef1ebdd290d9d85 (diff)
downloadglibc-ce40306fcc3edb2baade47e8050c975c5ecba980.tar.gz
glibc-ce40306fcc3edb2baade47e8050c975c5ecba980.tar.xz
glibc-ce40306fcc3edb2baade47e8050c975c5ecba980.zip
x86-64: Optimize memrchr with AVX2
Optimize memrchr with AVX2 to search 32 bytes with a single vector
compare instruction.  It is as fast as SSE2 memrchr for small data
sizes and up to 1X faster for large data sizes on Haswell.  Select
AVX2 memrchr on AVX2 machines where vzeroupper is preferred and AVX
unaligned load is fast.

	* sysdeps/x86_64/multiarch/Makefile (sysdep_routines): Add
	memrchr-sse2 and memrchr-avx2.
	* sysdeps/x86_64/multiarch/ifunc-impl-list.c
	(__libc_ifunc_impl_list): Add tests for __memrchr_avx2 and
	__memrchr_sse2.
	* sysdeps/x86_64/multiarch/memrchr-avx2.S: New file.
	* sysdeps/x86_64/multiarch/memrchr-sse2.S: Likewise.
	* sysdeps/x86_64/multiarch/memrchr.c: Likewise.
Diffstat (limited to 'io/readlinkat.c')
0 files changed, 0 insertions, 0 deletions