Age | Commit message (Collapse) | Author |
|
|
|
|
|
a byte loop. Performance
on bench-strncpy is 1.9-2.1x faster on average. I tried several variations, and using a tailcall and
calling memset conditionally gave the best overall results.
|
|
|
|
|
|
|
|
|
|
* sysdeps/generic/memcopy.h (reg_char): Delete.
* debug/strcat_chk.c: Use char, not reg_char.
* debug/strcpy_chk.c: Likewise.
* debug/strncat_chk.c: Likewise.
* debug/strncpy_chk.c: Likewise.
* string/memchr.c: Likewise.
* string/memrchr.c: Likewise.
* string/rawmemchr.c: Likewise.
* string/strcat.c: Likewise.
* string/strchr.c: Likewise.
* string/strchrnul.c: Likewise.
* string/strcmp.c: Likewise.
* string/strcpy.c: Likewise.
* string/strncat.c: Likewise.
* string/strncmp.c: Likewise.
* string/strncpy.c: Likewise.
|
|
|
|
This patch adds SSSE3 strcpy/stpcpy. I got up to 4X speed up on Core 2
and Core i7. I disabled it on Atom since SSSE3 version is slower for
shorter (<64byte) data.
|
|
|
|
|
|
|