about summary refs log tree commit diff
path: root/src
Commit message (Collapse)AuthorAgeFilesLines
* asm for log functionsRich Felker2012-03-186-0/+42
|
* fix broken exponential asmRich Felker2012-03-182-1/+21
| | | | | | | | | infinities were getting converted into nans. the new code simply tests for infinity and replaces it with a large magnitude value of the same sign. also, the fcomi instruction is apparently not part of the i387 instruction set, so avoid using it.
* asm for lrint family on i386Rich Felker2012-03-186-0/+46
|
* asm exponential functions for i386Rich Felker2012-03-189-0/+89
|
* assembly optimizations for fmod/remainder functionsRich Felker2012-03-188-0/+88
|
* asm versions of some simple math functions for i386 and x86_64Rich Felker2012-03-188-0/+48
| | | | | | | these are functions that have direct fpu approaches to implementation without problematic exception or rounding issues. x86_64 lacks float/double versions because i'm unfamiliar with the necessary sse code for performing these operations.
* simplify lround and llround functionsnsz2012-03-186-112/+20
| | | | | Simple wrappers around round is enough because spurious inexact exception is allowed.
* make lrint and llrint functions work without fenv supportnsz2012-03-186-6/+16
|
* faster lrint and llrint functionsnsz2012-03-186-80/+99
| | | | | | | A faster workaround for spurious inexact exceptions when the result cannot be represented. The old code actually could be wrong, because gcc reordered the integer conversion and the exception check.
* fix loads of missing const in new libm, and some global vars (?!) in powlRich Felker2012-03-1820-51/+51
|
* try fixing/optimizing x86_64 fenv exception codeRich Felker2012-03-171-18/+23
| | | | untested; may need followup-fixes.
* optimize x86 feclearexceptRich Felker2012-03-171-16/+20
| | | | | if all exception flags will be cleared, we can avoid the expensive store/reload of the environment and just use the fnclex instruction.
* fix x86_64 fe[gs]etround, analogous to nsz's x86 changesRich Felker2012-03-171-8/+9
|
* minor 387 fenv optimizationsRich Felker2012-03-171-6/+5
|
* fix i386 fegetround and make fesetround fasternsz2012-03-171-10/+10
| | | | | | | | | | | | | | | | | | | | | Note that the new fesetround has slightly different semantics: Storing the floating-point environment with fnstenv makes the next fldenv (or fldcw) "non-signaling", so unmasked and pending exceptions does not invoke the exception handler. (These are rare since exceptions are handled immediately and by default all exceptions are masked anyway. But if one manually unmasks an exception in the control word then either sets the corresponding exception flag in the status word or the execution of an exception raising floating-point operation gets interrupted then it may happen). So the old implementation did not trap in some rare cases where the new implementation traps. However POSIX does not specify anything like the x87 exception handling traps and the fnstenv/fldenv pair is significantly slower than the fnstcw/fldcw pair (new code is about 5x faster here and it's dominated by the function call overhead).
* one more fenv availability issue: lroundRich Felker2012-03-171-0/+2
|
* make fma and lrint functions build without full fenv supportRich Felker2012-03-164-4/+28
| | | | | | | | this is necessary to support archs where fenv is incomplete or unavailable (presently arm). fma, fmal, and the lrint family should work perfectly fine with this change; fmaf is slightly broken with respect to rounding as it depends on non-default rounding modes to do its work.
* other side of the signgam namespace fix: use the internal nameRich Felker2012-03-163-3/+7
|
* make signgam a weak alias for an internal symbolRich Felker2012-03-161-2/+5
| | | | | otherwise, the standard C lgamma function will clobber a symbol in the namespace reserved for the application.
* fix namespace issues for lgamma, etc.Rich Felker2012-03-167-14/+25
| | | | standard functions cannot depend on nonstandard symbols
* Merge remote branch 'nsz/master'Rich Felker2012-03-1666-216/+413
|\
| * in math.h make lgamma_r and non-double bessel _GNU_SOURCE onlynsz2012-03-153-0/+3
| | | | | | | | long double and float bessel functions are no longer xsi extensions
| * efficient sincos based on sin and cosnsz2012-03-154-8/+247
| |
| * math cleanup: use 1.0f instead of 1.0Fnsz2012-03-134-6/+6
| |
| * math cleanup: use 1.0f instead of (float)1.0nsz2012-03-1325-96/+96
| |
| * remove libm.h includes when math.h and float.h are enoughnsz2012-03-1331-30/+47
| |
| * clean up __expo2.c, use a slightly better k constantnsz2012-03-132-84/+14
| |
* | remove junk sincos implementations in preparation to merge nsz's real onesRich Felker2012-03-163-24/+0
| |
* | remove special nan handling from x86 sqrt asmRich Felker2012-03-151-3/+0
| | | | | | | | | | | | | | a double precision nan, when converted to extended (80-bit) precision, will never end in 0x400, since the corresponding bits do not exist in the original double precision value. thus there's no need to waste time and code size on this check.
* | simplify nan check in sqrt (x86 asm); result of sqrt is never negativeRich Felker2012-03-151-4/+3
| |
* | implement sincosf and sincosl functions; add prototypesRich Felker2012-03-152-0/+16
| | | | | | | | | | presumably broken gcc may generate calls to these, and it's said that ffmpeg makes use of sincosf.
* | avoid changing NaNs in sqrt (x86 asm) to satisfy c99 f.9 recommendationRich Felker2012-03-151-0/+4
| |
* | correctly rounded sqrt() asm for x86 (i387)Rich Felker2012-03-151-0/+16
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | the fsqrt opcode is correctly rounded, but only in the fpu's selected precision mode, which is 80-bit extended precision. to get a correctly rounded double precision output, we check for the only corner cases where two-step rounding could give different results than one-step (extended-precision mantissa ending in 0x400) and adjust the mantissa slightly in the opposite direction of the rounding which the fpu already did (reported in the c1 flag of the fpu status word). this should have near-zero cost in the non-corner cases and at worst very low cost. note that in order for sqrt() to get used when compiling with gcc, the broken, non-conformant builtin sqrt must be disabled.
* | correct rounding for i387 sqrtf functionRich Felker2012-03-131-0/+2
| |
* | fix scanf handling of "0" (followed by immediate EOF) with "%x"Rich Felker2012-03-131-11/+6
|/ | | | | | | | other cases with %x were probably broken too. I would actually like to go ahead and replace this code in scanf with calls to the new __intparse framework, but for now this calls for a quick and unobtrusive fix without the risk of breaking other things.
* implement nan, nanf, nanlRich Felker2012-03-133-0/+18
|
* first commit of the new libm!Rich Felker2012-03-13375-7729/+20193
| | | | | | | | | | | | | | | | thanks to the hard work of Szabolcs Nagy (nsz), identifying the best (from correctness and license standpoint) implementations from freebsd and openbsd and cleaning them up! musl should now fully support c99 float and long double math functions, and has near-complete complex math support. tgmath should also work (fully on gcc-compatible compilers, and mostly on any c99 compiler). based largely on commit 0376d44a890fea261506f1fc63833e7a686dca19 from nsz's libm git repo, with some additions (dummy versions of a few missing long double complex functions, etc.) by me. various cleanups still need to be made, including re-adding (if they're correct) some asm functions that were dropped.
* fix obscure bug in strtoull reading the highest 16 possible valuesRich Felker2012-03-021-1/+1
|
* remove debug cruft that was left in getdateRich Felker2012-03-021-2/+0
|
* first try at implementing getdate functionRich Felker2012-03-021-0/+47
|
* fix bugs in strptime handling of string day/month names, literalsRich Felker2012-03-021-0/+2
|
* implement a64l and l64a (legacy xsi stuff)Rich Felker2012-03-011-0/+26
|
* add all missing wchar functions except floating point parsersRich Felker2012-03-0111-0/+83
| | | | | these are mostly untested and adapted directly from corresponding byte string functions and similar.
* support null buffer argument to getcwd, auto-allocating behaviorRich Felker2012-03-011-1/+6
| | | | | | | this is a popular extension some programs depend on, and by using a temporary buffer and strdup rather than malloc prior to the syscall, i've avoided the dependency on free and thus minimized the bloat cost of supporting this feature.
* implement wcsftime functionRich Felker2012-02-281-0/+32
|
* fix pthread_cleanup_pop(1) crash in non-thread-capable, static-linked programsRich Felker2012-02-282-2/+2
|
* work around "signal loses thread pointer" issue with "approach 2"Rich Felker2012-02-272-2/+8
| | | | | | | | | | | | this was discussed on the mailing list and no consensus on the preferred solution was reached, so in anticipation of a release, i'm just committing a minimally-invasive solution that avoids the problem by ensuring that multi-threaded-capable programs will always have initialized the thread pointer before any signal handler can run. in the long term we may switch to initializing the thread pointer at program start time whenever the program has the potential to access any per-thread data.
* new attempt at working around the gcc 3 visibility bugRich Felker2012-02-243-0/+11
| | | | | since gcc is failing to generate the necessary ".hidden" directive in the output asm, generate it explicitly with an __asm__ statement...
* remove useless attribute visibility from definitionsRich Felker2012-02-242-2/+2
| | | | | | this was a failed attempt at working around the gcc 3 visibility bug affecting x86_64. subsequent patch will address it with an ugly but working hack.
* cleanup and work around visibility bug in gcc 3 that affects x86_64Rich Felker2012-02-234-12/+15
| | | | | | | | | | | | | | in gcc 3, the visibility attribute must be placed on both the declaration and on the definition. if it's omitted from the definition, the compiler fails to emit the ".hidden" directive in the assembly, and the linker will either generate textrels (if supported, such as on i386) or refuse to link (on targets where certain types of textrels are forbidden or impossible without further assumptions about memory layout, such as on x86_64). this patch also unifies the decision about when to use visibility into libc.h and makes the visibility in the utf-8 state machine tables based on libc.h rather than a duplicate test.