diff options
author | H.J. Lu <hjl.tools@gmail.com> | 2018-01-08 08:04:26 -0800 |
---|---|---|
committer | H.J. Lu <hjl.tools@gmail.com> | 2018-01-08 08:04:40 -0800 |
commit | c70e4e9c9efff9df4c847dd7cfd81bae674219ab (patch) | |
tree | 46cbbfb74a8c03e933fc4245c66559def374b1a8 /NEWS | |
parent | 579396ee082565ab5f42ff166a264891223b7b82 (diff) | |
download | glibc-c70e4e9c9efff9df4c847dd7cfd81bae674219ab.tar.gz glibc-c70e4e9c9efff9df4c847dd7cfd81bae674219ab.tar.xz glibc-c70e4e9c9efff9df4c847dd7cfd81bae674219ab.zip |
x86-64: Add sincosf with vector FMA
Since the x86-64 assembly version of sincosf is higly optimized with vector instructions, there isn't much room for improvement. However s_sincosf.c written in C with vector math and intrinsics can be optimized by GCC with FMA. On Skylake, bench-sincosf reports performance improvement: Assembly FMA improvement max 104.042 101.008 3% min 9.426 8.586 10% mean 20.6209 18.2238 13% * sysdeps/x86_64/fpu/multiarch/Makefile (libm-sysdep_routines): Add s_sincosf-sse2 and s_sincosf-fma. (CFLAGS-s_sincosf-fma.c): New. * sysdeps/x86_64/fpu/multiarch/s_sincosf-fma.c: New file. * sysdeps/x86_64/fpu/multiarch/s_sincosf-sse2.S: Likewise. * sysdeps/x86_64/fpu/multiarch/s_sincosf.c: Likewise. * sysdeps/x86_64/fpu/s_sincosf.S: Don't add alias if __sincosf is defined.
Diffstat (limited to 'NEWS')
-rw-r--r-- | NEWS | 4 |
1 files changed, 2 insertions, 2 deletions
diff --git a/NEWS b/NEWS index 0d6f826dcc..75bf46776a 100644 --- a/NEWS +++ b/NEWS @@ -19,8 +19,8 @@ Major new features: supported on i386, x86_64, x32 and aarch64. * Optimized x86-64 asin, atan2, exp, expf, log, pow, atan, sin, cosf, - sinf and tan with FMA, contributed by Arjan van de Ven and H.J. Lu - from Intel. + sinf, sincosf and tan with FMA, contributed by Arjan van de Ven and + H.J. Lu from Intel. * Optimized x86-64 trunc and truncf for processors with SSE4.1. |