diff options
author | Adhemerval Zanella <azanella@linux.vnet.ibm.com> | 2014-12-23 05:59:44 -0600 |
---|---|---|
committer | Adhemerval Zanella <azanella@linux.vnet.ibm.com> | 2015-01-13 11:28:30 -0500 |
commit | 96d6fd6c4060d739abb1822e7ad633af749532b2 (patch) | |
tree | 3958c551f6b4e375b62cf08e3e30fd472a94acf7 /NEWS | |
parent | 0f9e585480edcdf1e30dc3d79e24b84aeee516fa (diff) | |
download | glibc-96d6fd6c4060d739abb1822e7ad633af749532b2.tar glibc-96d6fd6c4060d739abb1822e7ad633af749532b2.tar.gz glibc-96d6fd6c4060d739abb1822e7ad633af749532b2.tar.bz2 glibc-96d6fd6c4060d739abb1822e7ad633af749532b2.zip |
powerpc: Optimized st{r,p}cpy for POWER8/PPC64
This patch adds an optimized POWER8 strcpy using unaligned accesses.
For strings up to 16 bytes the implementation first calculate the
string size, like strlen, and issues a memcpy. For larger strings,
source is first aligned to 16 bytes and then tested over a loop that
reads 16 bytes am combine the cmpb results for speedup. Special case is
added for page cross reads.
It shows 30%-60% improvement over the optimized POWER7 one that uses
only aligned accesses.
Diffstat (limited to 'NEWS')
-rw-r--r-- | NEWS | 2 |
1 files changed, 2 insertions, 0 deletions
@@ -19,6 +19,8 @@ Version 2.21 17744, 17745, 17746, 17747, 17748, 17775, 17777, 17780, 17781, 17782, 17791, 17793, 17796, 17797, 17803, 17806, 17834 +* Optimized strcpy and stpcpy implementations for powerpc64/powerpc64le. + * Added support for TSX lock elision of pthread mutexes on powerpc32, powerpc64 and powerpc64le. This may improve lock scaling of existing programs on HTM capable systems. The lock elision code is only enabled with |