diff options
author | Adhemerval Zanella <azanella@linux.vnet.ibm.com> | 2014-12-23 05:59:44 -0600 |
---|---|---|
committer | Adhemerval Zanella <azanella@linux.vnet.ibm.com> | 2015-01-13 11:28:30 -0500 |
commit | 96d6fd6c4060d739abb1822e7ad633af749532b2 (patch) | |
tree | 3958c551f6b4e375b62cf08e3e30fd472a94acf7 /NEWS | |
parent | 0f9e585480edcdf1e30dc3d79e24b84aeee516fa (diff) | |
download | glibc-96d6fd6c4060d739abb1822e7ad633af749532b2.tar.gz glibc-96d6fd6c4060d739abb1822e7ad633af749532b2.tar.xz glibc-96d6fd6c4060d739abb1822e7ad633af749532b2.zip |
powerpc: Optimized st{r,p}cpy for POWER8/PPC64
This patch adds an optimized POWER8 strcpy using unaligned accesses. For strings up to 16 bytes the implementation first calculate the string size, like strlen, and issues a memcpy. For larger strings, source is first aligned to 16 bytes and then tested over a loop that reads 16 bytes am combine the cmpb results for speedup. Special case is added for page cross reads. It shows 30%-60% improvement over the optimized POWER7 one that uses only aligned accesses.
Diffstat (limited to 'NEWS')
-rw-r--r-- | NEWS | 2 |
1 files changed, 2 insertions, 0 deletions
diff --git a/NEWS b/NEWS index fbf133eac0..e02091802e 100644 --- a/NEWS +++ b/NEWS @@ -19,6 +19,8 @@ Version 2.21 17744, 17745, 17746, 17747, 17748, 17775, 17777, 17780, 17781, 17782, 17791, 17793, 17796, 17797, 17803, 17806, 17834 +* Optimized strcpy and stpcpy implementations for powerpc64/powerpc64le. + * Added support for TSX lock elision of pthread mutexes on powerpc32, powerpc64 and powerpc64le. This may improve lock scaling of existing programs on HTM capable systems. The lock elision code is only enabled with |