Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | x86: Optimize less_vec evex and avx512 memset-vec-unaligned-erms.S | Noah Goldstein | 2021-04-19 | 1 | -1/+1 |
| | | | | | | | | No bug. This commit adds optimized cased for less_vec memset case that uses the avx512vl/avx512bw mask store avoiding the excessive branches. test-memset and test-wmemset are passing. Signed-off-by: Noah Goldstein <goldstein.w.n@gmail.com> | ||||
* | x86-64: Add memset family functions with 256-bit EVEX | H.J. Lu | 2021-03-29 | 1 | -0/+24 |
Update ifunc-memset.h/ifunc-wmemset.h to select the function optimized with 256-bit EVEX instructions using YMM16-YMM31 registers to avoid RTM abort with usable AVX512VL and AVX512BW since VZEROUPPER isn't needed at function exit. |