Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Use p2align instead ALIGN | Ondřej Bílka | 2013-10-08 | 6 | -295/+274 |
| | |||||
* | Faster strrchr. | Ondřej Bílka | 2013-09-26 | 5 | -899/+2 |
| | |||||
* | Faster strchr implementation. | Ondřej Bílka | 2013-09-11 | 2 | -128/+0 |
| | |||||
* | Add unaligned strcmp. | Ondřej Bílka | 2013-09-03 | 4 | -2/+222 |
| | |||||
* | Fix typos. | Ondřej Bílka | 2013-08-30 | 1 | -1/+1 |
| | |||||
* | Fix rawmemchr regression on bulldozer. | Ondřej Bílka | 2013-08-30 | 2 | -109/+0 |
| | |||||
* | Fix typos. | Ondřej Bílka | 2013-08-21 | 1 | -2/+2 |
| | |||||
* | Skip SSE4.2 versions on Intel Silvermont | Liubov Dmitrieva | 2013-06-28 | 5 | -15/+37 |
| | | | | SSE2/SSSE3 versions are faster than SSE4.2 versions on Intel Silvermont. | ||||
* | Fix buffers overrun in x86_64 memcmp-ssse3.S | Liubov Dmitrieva | 2013-06-26 | 1 | -4/+2 |
| | |||||
* | Set fast unaligned load flag for new Intel microarchitecture | Liubov Dmitrieva | 2013-06-14 | 1 | -0/+7 |
| | | | | | | | | | I have small patch for new Intel Silvermont machines. http://newsroom.intel.com/community/intel_newsroom/blog/2013/05/06/intel-launches-low-power-high-performance-silvermont-microarchitecture I checked this on my machine and see that strcpy, ... unaligned versions are faster than ssse3 versions. | ||||
* | Faster memcpy on x64. | Ondrej Bilka | 2013-05-20 | 4 | -8/+185 |
| | | | | | | | | | We add new memcpy version that uses unaligned loads which are fast on modern processors. This allows second improvement which is avoiding computed jump which is relatively expensive operation. Tests available here: http://kam.mff.cuni.cz/~ondra/memcpy_profile_result27_04_13.tar.bz2 | ||||
* | Faster strlen on x64. | Ondrej Bilka | 2013-03-18 | 10 | -1179/+544 |
| | |||||
* | Remove Prefer_SSE_for_memop on x64 | Ondrej Bilka | 2013-03-11 | 8 | -197/+1 |
| | |||||
* | Revert " * sysdeps/x86_64/strlen.S: Replace with new SSE2 based implementation" | Ondrej Bilka | 2013-03-06 | 10 | -537/+1179 |
| | | | | This reverts commit b79188d71716b6286866e06add976fe84100595e. | ||||
* | * sysdeps/x86_64/strlen.S: Replace with new SSE2 based implementation | Ondrej Bilka | 2013-03-06 | 10 | -1179/+537 |
| | | | | | which is faster on all x86_64 architectures. Tested on AMD, Intel Nehalem, SNB, IVB. | ||||
* | Remove lots of inline keywords. | Roland McGrath | 2013-02-07 | 2 | -4/+5 |
| | |||||
* | Change __x86_64 prefix in cache size to __x86 | H.J. Lu | 2013-01-05 | 3 | -13/+13 |
| | |||||
* | Add HAS_RTM | H.J. Lu | 2013-01-03 | 2 | -0/+16 |
| | |||||
* | Update copyright notices with scripts/update-copyrights. | Joseph Myers | 2013-01-02 | 50 | -50/+50 |
| | |||||
* | test-multiarch: terminate printf output with newline | Pino Toscano | 2012-11-22 | 1 | -1/+1 |
| | |||||
* | Compile x86 rtld with -mno-sse -mno-mmx | H.J. Lu | 2012-11-02 | 1 | -1/+2 |
| | |||||
* | Add x86-64 __libc_ifunc_impl_list | H.J. Lu | 2012-10-11 | 33 | -24/+380 |
| | |||||
* | Use IFUNC memmove/memset in x86-64 bcopy/bzero | H.J. Lu | 2012-10-11 | 3 | -33/+11 |
| | | | | Also add separate tests for bcopy and bzero. | ||||
* | Define HAS_FMA with bit_FMA_Usable | H.J. Lu | 2012-10-02 | 2 | -2/+10 |
| | |||||
* | Don't define x86-64 __strncmp_ssse3 in libc.a | H.J. Lu | 2012-09-27 | 1 | -4/+6 |
| | |||||
* | Clean up x86_64/multiarch/strstr-c.c include order. | Roland McGrath | 2012-08-15 | 1 | -6/+26 |
| | |||||
* | Clean up x86_64/multiarch/memmove.c include order. | Roland McGrath | 2012-08-15 | 1 | -20/+18 |
| | |||||
* | Avoid DWARF definition DIE on ifunc symbols | H.J. Lu | 2012-08-09 | 2 | -10/+32 |
| | |||||
* | BZ#14059: Fix AVX and FMA4 detection. | Carlos O'Donell | 2012-05-17 | 5 | -30/+148 |
| | | | | | Fix AVX and FMA4 detection by following the guidelines set out by Intel and AMD for detecting these features. | ||||
* | Load pointers into RAX_LP in strcmp-sse42.S | H.J. Lu | 2012-05-15 | 1 | -6/+6 |
| | |||||
* | Load cache sizes into R*_LP in memcpy-ssse3.S | H.J. Lu | 2012-05-15 | 1 | -12/+12 |
| | |||||
* | Load cache sizes into R*_LP in memcpy-ssse3-back.S | H.J. Lu | 2012-05-15 | 1 | -10/+10 |
| | |||||
* | Load cache size into R8_LP | H.J. Lu | 2012-05-15 | 1 | -4/+4 |
| | |||||
* | Replace FSF snail mail address with URLs. | Paul Eggert | 2012-02-09 | 47 | -141/+94 |
| | |||||
* | Really fix AVX tests | Ulrich Drepper | 2012-01-26 | 2 | -20/+20 |
| | | | | | | There is no problem with strcmp, it doesn't use the YMM registers. The math routines might since gcc perhaps generates such code. Introduce bit_YMM_USBALE and use it in the math routines. | ||||
* | Reset bit_AVX in __cpu_features is OS support is missing | Ulrich Drepper | 2012-01-26 | 2 | -2/+15 |
| | |||||
* | Fix overrun in destination buffer | Liubov Dmitrieva | 2011-12-23 | 2 | -508/+323 |
| | |||||
* | WP fixes | Ulrich Drepper | 2011-12-17 | 1 | -1/+0 |
| | |||||
* | Optimized wcschr and wcscpy for x86-64 and x86-32 | Ulrich Drepper | 2011-12-17 | 4 | -1/+619 |
| | |||||
* | Fix more warnings | Ulrich Drepper | 2011-12-03 | 1 | -0/+4 |
| | |||||
* | Fix test of non-ASCII locales in x86-64 strcasecmp et.al. | Ulrich Drepper | 2011-11-01 | 1 | -2/+2 |
| | |||||
* | More cleanups of x86-64 strstr | Ulrich Drepper | 2011-10-28 | 1 | -13/+12 |
| | |||||
* | Clean up x86-64 strcasestr | Ulrich Drepper | 2011-10-28 | 1 | -13/+12 |
| | | | | Actually describe in the C code what is going on. | ||||
* | Add AVX optimized versions for some x86-64 math functions | Ulrich Drepper | 2011-10-25 | 1 | -14/+18 |
| | |||||
* | Optimize x86-64 SSE4.2+ strcmp a bit more | Ulrich Drepper | 2011-10-25 | 1 | -175/+130 |
| | |||||
* | Fix WS | Ulrich Drepper | 2011-10-23 | 1 | -1/+1 |
| | |||||
* | Optimized strnlen and wcscmp for x86-64 | Liubov Dmitrieva | 2011-10-23 | 7 | -317/+748 |
| | |||||
* | Fix compilation problems in x86-64 init-arch | Ulrich Drepper | 2011-10-21 | 1 | -1/+2 |
| | |||||
* | Check for FMA4 support and generate appropriate fma functions | Ulrich Drepper | 2011-10-20 | 2 | -1/+11 |
| | |||||
* | Move fma routines to right place | Ulrich Drepper | 2011-10-20 | 2 | -85/+0 |
| |