Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Use IFUNC on x86-64 memset | H.J. Lu | 2010-11-08 | 9 | -156/+369 |
| | |||||
* | 32bit memset-sse2.S fails with uneven cache size | Ulrich Drepper | 2010-11-05 | 1 | -2/+18 |
| | | | | | | | | | 32bit memset-sse2.S assumes cache size is multiple of 128 bytes. If it isn't true, memset-sse2.S will fail. For example, a processor can have 24576 KB L3 cache and 20 cores. That is 2516582 byte per core. Half of it is 1258291, which isn't helpful for vector instructions. This patch rounds cache sizes to multiple of 256 bytes and adds "raw" cache sizes. | ||||
* | Fix x86-64 strchr propagation of search byte into all bytes of SSE register | Richard Li | 2010-10-25 | 1 | -2/+2 |
| | |||||
* | Provide FP_FAST_FMA{,F,L} definitions for x86/x86-64. | Ulrich Drepper | 2010-10-19 | 1 | -1/+15 |
| | |||||
* | Implement accurate fma. | Jakub Jelinek | 2010-10-13 | 1 | -2/+2 |
| | |||||
* | Correct implementation of fmaf. | Jakub Jelinek | 2010-10-11 | 3 | -4/+6 |
| | |||||
* | Fix handling of tail bytes of buffer in SSE2/SSSE3 x86-64 version strn{,case}cmp | Ulrich Drepper | 2010-10-03 | 1 | -16/+16 |
| | |||||
* | Re-enable all strncasecmp versions. | Ulrich Drepper | 2010-09-20 | 1 | -2/+0 |
| | |||||
* | Fix limit detection in x86-64 SSE2 strncasecmp. | Ulrich Drepper | 2010-09-20 | 2 | -1/+3 |
| | |||||
* | Update x86-64 mpn routines from GMP 5.0.1. | Ulrich Drepper | 2010-09-02 | 7 | -182/+467 |
| | |||||
* | Fix typo in last commit. | Ulrich Drepper | 2010-08-26 | 1 | -3/+3 |
| | |||||
* | Move slow Atom code to separate section. | Ulrich Drepper | 2010-08-26 | 1 | -1/+1 |
| | |||||
* | Shorten x86-64 strlen a bit. | Ulrich Drepper | 2010-08-26 | 1 | -7/+3 |
| | |||||
* | Unroll x86-64 strlen | H.J. Lu | 2010-08-26 | 5 | -90/+477 |
| | |||||
* | Missing comma in last commit. | H.J. Lu | 2010-08-26 | 1 | -1/+1 |
| | |||||
* | Clean up warnings in new x86_64/multiarch code. | Roland McGrath | 2010-08-25 | 5 | -21/+63 |
| | |||||
* | Unroll 32bit SSE strlen and handle slow bsf | H.J. Lu | 2010-08-25 | 2 | -0/+12 |
| | |||||
* | Add missing copyright year updated and pretty printing. | Ulrich Drepper | 2010-08-24 | 3 | -3/+3 |
| | |||||
* | Clean up SSE variable shifts | Richard Henderson | 2010-08-24 | 6 | -354/+73 |
| | |||||
* | Fix two typos in x86-64 SSE4.2 strncasecmp implementation. | Ulrich Drepper | 2010-08-19 | 1 | -2/+2 |
| | |||||
* | Fix fourth parameter of SSE4.2 strcmp for x86-64. | Ulrich Drepper | 2010-08-15 | 1 | -3/+3 |
| | |||||
* | Use correct register for fourth parameter of x86-64 strncasecmp_l. | Ulrich Drepper | 2010-08-15 | 1 | -3/+3 |
| | |||||
* | Undo inccorect change. | Ulrich Drepper | 2010-08-15 | 1 | -1/+1 |
| | |||||
* | Add optimized strncasecmp versions for x86-64. | Ulrich Drepper | 2010-08-14 | 10 | -283/+435 |
| | |||||
* | Fix x86-64 build without multiarch. | Ulrich Drepper | 2010-08-14 | 2 | -1/+1 |
| | |||||
* | x86: Add support for frame pointer less mcount | Andi Kleen | 2010-08-07 | 1 | -0/+38 |
| | |||||
* | Add support for SSSE3 and SSE4.2 versions of strcasecmp on x86-64. | Ulrich Drepper | 2010-07-31 | 5 | -22/+384 |
| | |||||
* | Pretty printing x86-64 SSE4.3 strcmp. | Ulrich Drepper | 2010-07-30 | 1 | -29/+29 |
| | |||||
* | Implement optimized strcaecmp for x86-64. | Ulrich Drepper | 2010-07-30 | 6 | -2/+160 |
| | |||||
* | Fix tolower operation in strcasestr. | Ulrich Drepper | 2010-07-30 | 1 | -1/+1 |
| | |||||
* | Avoid compiling unneeded file in ld.so. | Ulrich Drepper | 2010-07-27 | 1 | -3/+5 |
| | |||||
* | Add optimized x86-64 implementation of strnlen. | Ulrich Drepper | 2010-07-26 | 1 | -0/+64 |
| | | | | | While at it, beef up the test suite for strnlen and add performance tests for it, too. | ||||
* | Speed up x86-64 strcasestr a bit moew. | Ulrich Drepper | 2010-07-24 | 1 | -5/+11 |
| | | | | | Using the new SSE4.2 instructions is cool but not really the fastest. Some older SSE instructions can do the trick faster. | ||||
* | Add strcasestr-nonascii to i386 build | Andreas Schwab | 2010-07-21 | 2 | -7/+10 |
| | |||||
* | Fix non-ASCII case of SSE4.2 strcasstr. | Ulrich Drepper | 2010-07-16 | 1 | -0/+2 |
| | |||||
* | Speed up SSE4.2 strcasestr by avoiding indirect function call. | Ulrich Drepper | 2010-07-16 | 4 | -49/+76 |
| | |||||
* | Improve 64bit memcpy/memmove for Atom, Core 2 and Core i7 | H.J. Lu | 2010-06-30 | 18 | -8/+6640 |
| | | | | | | | This patch includes optimized 64bit memcpy/memmove for Atom, Core 2 and Core i7. It improves memcpy by up to 3X on Atom, up to 4X on Core 2 and up to 1X on Core i7. It also improves memmove by up to 3X on Atom, up to 4X on Core 2 and up to 2X on Core i7. | ||||
* | Incorrect x86 CPU family and model check. | H.J. Lu | 2010-05-27 | 1 | -3/+3 |
| | |||||
* | Whitespace fix. | Ulrich Drepper | 2010-04-14 | 1 | -1/+0 |
| | |||||
* | Add x86-32 FMA support | H.J. Lu | 2010-04-14 | 2 | -36/+0 |
| | |||||
* | Check DATA_CACHE_SIZE_HALF | H.J. Lu | 2010-04-14 | 1 | -2/+2 |
| | |||||
* | Optimie x86-64 SSE4 memcmp for unaligned data. | H.J. Lu | 2010-04-14 | 1 | -6/+371 |
| | |||||
* | x86-64 SSE4 optimized memcmp | H.J. Lu | 2010-04-14 | 4 | -1/+1331 |
| | | | | | This is 64bit SSE4 optimized memcmp. It improves memcmp by upto 3X on Intel Core i7. | ||||
* | Update x86-64 cpu multiarch selection header. | Ulrich Drepper | 2010-04-13 | 1 | -17/+21 |
| | |||||
* | Fix concurrent handling of __cpu_features. | Ulrich Drepper | 2010-04-04 | 2 | -14/+23 |
| | |||||
* | Don't define __strpbrk_sse42 in static library | H.J. Lu | 2010-03-24 | 1 | -4/+8 |
| | |||||
* | Fix R_X86_64_PC32 overflow detection | Richard Guenther | 2010-03-04 | 1 | -2/+2 |
| | |||||
* | We can use the 64-bit register versions of the double functions. | Ulrich Drepper | 2010-02-24 | 1 | -0/+1 |
| | |||||
* | Avoid PLT call to fegetenv on s390 | Andreas Schwab | 2010-02-09 | 1 | -0/+1 |
| | |||||
* | Prevent silent errors should x86-64 strncmp be needed outside libc. | Ulrich Drepper | 2010-01-14 | 1 | -204/+211 |
| |