about summary refs log tree commit diff
path: root/sysdeps/powerpc
diff options
context:
space:
mode:
authorAdhemerval Zanella <adhemerval.zanella@linaro.org>2024-11-08 13:24:28 -0300
committerAdhemerval Zanella <adhemerval.zanella@linaro.org>2024-11-22 10:52:27 -0300
commitbccb0648ea29f89a7b1b64f3e5674d2338e3798e (patch)
tree31849761a6d493625977c973db58c50669935fc9 /sysdeps/powerpc
parentd846f4c12d7636efd5b7cff173456e616a185e24 (diff)
downloadglibc-bccb0648ea29f89a7b1b64f3e5674d2338e3798e.tar.gz
glibc-bccb0648ea29f89a7b1b64f3e5674d2338e3798e.tar.xz
glibc-bccb0648ea29f89a7b1b64f3e5674d2338e3798e.zip
math: Use tanf from CORE-MATH
The CORE-MATH implementation is correctly rounded (for any rounding mode)
and shows better performance to the generic tanf.

The code was adapted to glibc style, to use the definition of
math_config.h, to remove errno handling, and to use a generic
128 bit routine for ABIs that do not support it natively.

Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (neoverse1,
gcc 13.2.1), and powerpc (POWER10, gcc 13.2.1):

latency                       master       patched  improvement
x86_64                       82.3961       54.8052       33.49%
x86_64v2                     82.3415       54.8052       33.44%
x86_64v3                     69.3661       50.4864       27.22%
i686                         219.271       45.5396       79.23%
aarch64                      29.2127       19.1951       34.29%
power10                      19.5060       16.2760       16.56%

reciprocal-throughput         master       patched  improvement
x86_64                       28.3976       19.7334       30.51%
x86_64v2                     28.4568       19.7334       30.65%
x86_64v3                     21.1815       16.1811       23.61%
i686                         105.016       15.1426       85.58%
aarch64                      18.1573       10.7681       40.70%
power10                       8.7207        8.7097        0.13%

Signed-off-by: Alexei Sibidanov <sibid@uvic.ca>
Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr>
Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>
Reviewed-by: DJ Delorie <dj@redhat.com>
Diffstat (limited to 'sysdeps/powerpc')
-rw-r--r--sysdeps/powerpc/fpu/libm-test-ulps4
-rw-r--r--sysdeps/powerpc/nofpu/libm-test-ulps4
2 files changed, 0 insertions, 8 deletions
diff --git a/sysdeps/powerpc/fpu/libm-test-ulps b/sysdeps/powerpc/fpu/libm-test-ulps
index 56ca580497..adaa10fb56 100644
--- a/sysdeps/powerpc/fpu/libm-test-ulps
+++ b/sysdeps/powerpc/fpu/libm-test-ulps
@@ -1737,25 +1737,21 @@ double: 1
 float: 1
 
 Function: "tan":
-float: 3
 float128: 1
 ldouble: 2
 
 Function: "tan_downward":
 double: 1
-float: 3
 float128: 1
 ldouble: 3
 
 Function: "tan_towardzero":
 double: 1
-float: 3
 float128: 1
 ldouble: 2
 
 Function: "tan_upward":
 double: 1
-float: 3
 float128: 1
 ldouble: 3
 
diff --git a/sysdeps/powerpc/nofpu/libm-test-ulps b/sysdeps/powerpc/nofpu/libm-test-ulps
index 752d1937c6..2c44e39c2c 100644
--- a/sysdeps/powerpc/nofpu/libm-test-ulps
+++ b/sysdeps/powerpc/nofpu/libm-test-ulps
@@ -1476,22 +1476,18 @@ double: 1
 float: 1
 
 Function: "tan":
-float: 1
 ldouble: 2
 
 Function: "tan_downward":
 double: 1
-float: 2
 ldouble: 3
 
 Function: "tan_towardzero":
 double: 1
-float: 1
 ldouble: 2
 
 Function: "tan_upward":
 double: 1
-float: 1
 ldouble: 3
 
 Function: "tanh":