From f338c7c5f526a86be2de7205d1e0876ff02e2087 Mon Sep 17 00:00:00 2001 From: Adhemerval Zanella Date: Fri, 25 Oct 2024 15:21:53 -0300 Subject: math: Use log10p1f from CORE-MATH The CORE-MATH implementation is correctly rounded (for any rounding mode) and shows slight better performance to the generic log10p1f. The code was adapted to glibc style and to use the definition of math_config.h (to handle errno, overflow, and underflow). Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (M1, gcc 13.2.1), and powerpc (POWER10, gcc 13.2.1): Latency master patched improvement x86_64 68.5251 32.2627 52.92% x86_64v2 68.8912 32.7887 52.41% x86_64v3 59.3427 27.0521 54.41% i686 162.026 103.383 36.19% aarch64 26.8513 14.5695 45.74% power10 12.7426 8.4929 33.35% powerpc 16.6768 9.29135 44.29% reciprocal-throughput master patched improvement x86_64 26.0969 12.4023 52.48% x86_64v2 25.0045 11.0748 55.71% x86_64v3 20.5610 10.2995 49.91% i686 89.8842 78.5211 12.64% aarch64 17.1200 9.4832 44.61% power10 6.7814 6.4258 5.24% powerpc 15.769 7.6825 51.28% Signed-off-by: Alexei Sibidanov Signed-off-by: Paul Zimmermann Signed-off-by: Adhemerval Zanella Reviewed-by: DJ Delorie --- SHARED-FILES | 4 ++++ 1 file changed, 4 insertions(+) (limited to 'SHARED-FILES') diff --git a/SHARED-FILES b/SHARED-FILES index 7fa54a5c9d..228f415dfd 100644 --- a/SHARED-FILES +++ b/SHARED-FILES @@ -264,3 +264,7 @@ sysdeps/ieee754/flt-32/s_log1pf.c (file src/binary32/log1p/log1pf.c in CORE-MATH) - The code was adapted to use glibc code style and internal functions to handle errno, overflow, and underflow. +sysdeps/ieee754/flt-32/s_log10p1f.c + (file src/binary32/log10p1/log10p1f.c in CORE-MATH) + - The code was adapted to use glibc code style and internal + functions to handle errno, overflow, and underflow. -- cgit 1.4.1