diff options
author | Ondrej Bilka <neleai@seznam.cz> | 2013-05-20 08:26:00 +0200 |
---|---|---|
committer | Ondrej Bilka <neleai@seznam.cz> | 2013-05-20 08:32:45 +0200 |
commit | b2b671b677d92429a3d41bf451668f476aa267ed (patch) | |
tree | a211523045be91beb43abbf7ebacb7a74ddb8adc /ChangeLog | |
parent | 2d48b41c8fa610067c4d664ac2339ae6ca43e78c (diff) | |
download | glibc-b2b671b677d92429a3d41bf451668f476aa267ed.tar.gz glibc-b2b671b677d92429a3d41bf451668f476aa267ed.tar.xz glibc-b2b671b677d92429a3d41bf451668f476aa267ed.zip |
Faster memset on x64
This implementation speed up memset in several ways. First is avoiding expensive computed jump. Second is using fact that arguments of memset are most of time aligned to 8 bytes. Benchmark results on: kam.mff.cuni.cz/~ondra/benchmark_string/memset_profile_result27_04_13.tar.bz2
Diffstat (limited to 'ChangeLog')
-rw-r--r-- | ChangeLog | 6 |
1 files changed, 6 insertions, 0 deletions
diff --git a/ChangeLog b/ChangeLog index 6c540a7e23..67dfa33b14 100644 --- a/ChangeLog +++ b/ChangeLog @@ -1,5 +1,11 @@ 2013-05-20 Ondřej Bílka <neleai@seznam.cz> + * sysdeps/x86_64/memset.S (memset): New implementation. + (__bzero): Likewise. + (__memset_tail): New function. + +2013-05-20 Ondřej Bílka <neleai@seznam.cz> + * sysdeps/x86_64/multiarch/memcpy-sse2-unaligned.S: New file. * sysdeps/x86_64/multiarch/memcpy.S (__new_memcpy): Add __memcpy_sse2_unaligned ifunc selection. |