diff options
author | Will Schmidt <will_schmidt@vnet.ibm.com> | 2012-08-22 11:04:42 -0500 |
---|---|---|
committer | Ryan S. Arnold <rsa@linux.vnet.ibm.com> | 2012-08-22 11:04:42 -0500 |
commit | 15d0da8cb36f5c96a3c649d29a2e2623e995cd49 (patch) | |
tree | 3c1ccdcbc8fb94fb795fc13e519e02957d02e626 /bits/stdio-lock.h | |
parent | bcca089526c6859e775243731037a469aec3065c (diff) | |
download | glibc-15d0da8cb36f5c96a3c649d29a2e2623e995cd49.tar.gz glibc-15d0da8cb36f5c96a3c649d29a2e2623e995cd49.tar.xz glibc-15d0da8cb36f5c96a3c649d29a2e2623e995cd49.zip |
Add versions of wcscpy, wcschr, wcsrchr for power6/power7.
Initially based on the versions found in wcsmbs/* ; these files have been changed by hand unrolling, and adding some additional variables to allow some read-ahead to occur, which then relieves some of the wait-for-increment/wait-for-load/wait-for-compare-results pressure that was slowing down every iteration through the while-loop. For 64-bit Power7, These changes give an approx 20% throughput boost for the wcschr and wcsrchr functions; and approx 40% boost for the wcscpy function. 32-bit improvements appear to be slightly better with ~ %30 and ~ %45 respectively. Results for Power6 closely match those for power7.
Diffstat (limited to 'bits/stdio-lock.h')
0 files changed, 0 insertions, 0 deletions