mirror/musl - mirror of git://git.musl-libc.org/musl

	Commit message (Collapse)	Author	Age	Files	Lines
*	rename i386 exp.s to exp_ld.s	Rich Felker	2020-02-06	1	-146/+0
\| \| \| \|	this commit is for the sake of reviewable history.
*	fix x87 stack imbalance in corner cases of i386 math asm	Rich Felker	2019-08-05	1	-8/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	commit 31c5fb80b9eae86f801be4f46025bc6532a554c5 introduced underflow code paths for the i386 math asm, along with checks on the fpu status word to skip the underflow-generation instructions if the underflow flag was already raised. unfortunately, at least one such path, in log1p, returned with 2 items on the x87 stack rather than just 1 item for the return value. this is a violation of the ABI's calling convention, and could cause subsequent floating point code to produce NANs due to x87 stack overflow. if floating point results are used in flow control, this can lead to runaway wrong code execution. rather than reviewing each "underflow already raised" code path for correctness, remove them all. they're likely slower than just performing the underflow code unconditionally, and significantly more complex. all of this code should be ripped out and replaced by C source files with inline asm. doing so would preclude this kind of error by having the compiler perform all x87 stack register allocation and stack manipulation, and would produce comparable or better code. however such a change is a much larger project.
*	remove the last of possible-textrels from i386 asm	Rich Felker	2015-04-18	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \|	none of these are actual textrels because of ld-time binding performed by -Bsymbolic-functions, but I'm changing them with the goal of making ld-time binding purely an optimization rather than relying on it for semantic purposes. in the case of memmove's call to memcpy, making it explicit that the memmove asm is assuming the forward-copying behavior of the memcpy asm is desirable anyway; in case memcpy is ever changed, the semantic mismatch would be apparent while editing memmcpy.s.
*	math: fix exp2l asm on x86 (raise underflow correctly)	Szabolcs Nagy	2013-09-05	1	-32/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	there were two problems: * omitted underflow on subnormal results: exp2l(-16383.5) was calculated as sqrt(2)2^-16384, the last bits of sqrt(2) are zero so the down scaling does not underflow eventhough the result is in subnormal range spurious underflow for subnormal inputs: exp2l(0x1p-16400) was evaluated as f2xm1(x)+1 and f2xm1 raised underflow (because inexact subnormal result) the first issue is fixed by raising underflow manually if x is in (-32768,-16382] and not integer (x-0x1p63+0x1p63 != x) the second issue is fixed by treating x in (-0x1p64,0x1p64) specially for these fixes the special case handling was completely rewritten
*	math: fix x86 asin, atan, exp, log1p to raise underflow	Szabolcs Nagy	2013-08-15	1	-2/+35
\| \| \| \| \| \|	underflow is raised by an inexact subnormal float store, since subnormal operations are slow, check the underflow flag and skip the store if it's already raised
*	math: fix i386/expl.s with more precise x*log2e	Szabolcs Nagy	2012-12-14	1	-6/+0
\| \| \| \| \| \| \| \| \|	with naive exp2l(xlog2e) the last 12bits of the result was incorrect for x with large absolute value with hi + lo = xlog2e is caluclated to 128 bits precision and then expl(x) = exp2l(hi) + exp2l(hi) * f2xm1(lo) this gives <1.5ulp measured error everywhere in nearest rounding mode
*	math: fix exp.s on i386 and x86_64 so the exception flags are correct	nsz	2012-08-08	1	-21/+18
\| \| \| \|	exp(inf), exp(-inf), exp(nan) used to raise wrong flags
*	fix exp asm	Rich Felker	2012-03-19	1	-23/+22
\| \| \| \| \| \| \| \| \| \| \| \|	exponents (base 2) near 16383 were broken due to (1) wrong cutoff, and (2) inability to fit the necessary range of scalings into a long double value. as a solution, we fall back to using frndint/fscale for insanely large exponents, and also have to special-case infinities here to avoid inf-inf generating nan. thankfully the costly code never runs in normal usage cases.
*	optimize exponential asm for i386	Rich Felker	2012-03-19	1	-11/+76
\| \| \| \| \| \|	up to 30% faster exp2 by avoiding slow frndint and fscale functions. expm1 also takes a much more direct path for small arguments (the expected usage case).
*	fix broken exponential asm	Rich Felker	2012-03-18	1	-0/+9
\| \| \| \| \| \| \| \| \|	infinities were getting converted into nans. the new code simply tests for infinity and replaces it with a large magnitude value of the same sign. also, the fcomi instruction is apparently not part of the i387 instruction set, so avoid using it.
*	asm exponential functions for i386	Rich Felker	2012-03-18	1	-0/+46