aboutsummaryrefslogtreecommitdiff
path: root/sysdeps/x86_64/multiarch
AgeCommit message (Collapse)Author
2014-01-01Update copyright notices with scripts/update-copyrightsAllan McRae
2013-12-16Update file name in x86_64 ifunc listAllan McRae
File name update missed in commit 584b18eb.
2013-12-14Add strstr with unaligned loads. Fixes bug 12100.Ondřej Bílka
A sse42 version of strstr used pcmpistr instruction which is quite ineffective. A faster way is look for pairs of characters which is uses sse2, is faster than pcmpistr and for real strings a pairs we look for are relatively rare. For linear time complexity we use buy or rent technique which switches to two-way algorithm when superlinear behaviour is detected.
2013-10-08Use p2align instead ALIGNOndřej Bílka
2013-09-26Faster strrchr.Ondřej Bílka
2013-09-11Faster strchr implementation.Ondřej Bílka
2013-09-03Add unaligned strcmp.Ondřej Bílka
2013-08-30Fix typos.Ondřej Bílka
2013-08-30Fix rawmemchr regression on bulldozer.Ondřej Bílka
2013-08-21Fix typos.Ondřej Bílka
2013-06-28Skip SSE4.2 versions on Intel SilvermontLiubov Dmitrieva
SSE2/SSSE3 versions are faster than SSE4.2 versions on Intel Silvermont.
2013-06-26Fix buffers overrun in x86_64 memcmp-ssse3.SLiubov Dmitrieva
2013-06-14Set fast unaligned load flag for new Intel microarchitectureLiubov Dmitrieva
I have small patch for new Intel Silvermont machines. http://newsroom.intel.com/community/intel_newsroom/blog/2013/05/06/intel-launches-low-power-high-performance-silvermont-microarchitecture I checked this on my machine and see that strcpy, ... unaligned versions are faster than ssse3 versions.
2013-05-20Faster memcpy on x64.Ondrej Bilka
We add new memcpy version that uses unaligned loads which are fast on modern processors. This allows second improvement which is avoiding computed jump which is relatively expensive operation. Tests available here: http://kam.mff.cuni.cz/~ondra/memcpy_profile_result27_04_13.tar.bz2
2013-03-18Faster strlen on x64.Ondrej Bilka
2013-03-11Remove Prefer_SSE_for_memop on x64Ondrej Bilka
2013-03-06Revert " * sysdeps/x86_64/strlen.S: Replace with new SSE2 based implementation"Ondrej Bilka
This reverts commit b79188d71716b6286866e06add976fe84100595e.
2013-03-06 * sysdeps/x86_64/strlen.S: Replace with new SSE2 based implementationOndrej Bilka
which is faster on all x86_64 architectures. Tested on AMD, Intel Nehalem, SNB, IVB.
2013-02-07Remove lots of inline keywords.Roland McGrath
2013-01-05Change __x86_64 prefix in cache size to __x86H.J. Lu
2013-01-03Add HAS_RTMH.J. Lu
2013-01-02Update copyright notices with scripts/update-copyrights.Joseph Myers
2012-11-22test-multiarch: terminate printf output with newlinePino Toscano
2012-11-02Compile x86 rtld with -mno-sse -mno-mmxH.J. Lu
2012-10-11Add x86-64 __libc_ifunc_impl_listH.J. Lu
2012-10-11Use IFUNC memmove/memset in x86-64 bcopy/bzeroH.J. Lu
Also add separate tests for bcopy and bzero.
2012-10-02Define HAS_FMA with bit_FMA_UsableH.J. Lu
2012-09-27Don't define x86-64 __strncmp_ssse3 in libc.aH.J. Lu
2012-08-15Clean up x86_64/multiarch/strstr-c.c include order.Roland McGrath
2012-08-15Clean up x86_64/multiarch/memmove.c include order.Roland McGrath
2012-08-09Avoid DWARF definition DIE on ifunc symbolsH.J. Lu
2012-05-17BZ#14059: Fix AVX and FMA4 detection.Carlos O'Donell
Fix AVX and FMA4 detection by following the guidelines set out by Intel and AMD for detecting these features.
2012-05-15Load pointers into RAX_LP in strcmp-sse42.SH.J. Lu
2012-05-15Load cache sizes into R*_LP in memcpy-ssse3.SH.J. Lu
2012-05-15Load cache sizes into R*_LP in memcpy-ssse3-back.SH.J. Lu
2012-05-15Load cache size into R8_LPH.J. Lu
2012-02-09Replace FSF snail mail address with URLs.Paul Eggert
2012-01-26Really fix AVX testsUlrich Drepper
There is no problem with strcmp, it doesn't use the YMM registers. The math routines might since gcc perhaps generates such code. Introduce bit_YMM_USBALE and use it in the math routines.
2012-01-26Reset bit_AVX in __cpu_features is OS support is missingUlrich Drepper
2011-12-23Fix overrun in destination bufferLiubov Dmitrieva
2011-12-17WP fixesUlrich Drepper
2011-12-17Optimized wcschr and wcscpy for x86-64 and x86-32Ulrich Drepper
2011-12-03Fix more warningsUlrich Drepper
2011-11-01Fix test of non-ASCII locales in x86-64 strcasecmp et.al.Ulrich Drepper
2011-10-28More cleanups of x86-64 strstrUlrich Drepper
2011-10-28Clean up x86-64 strcasestrUlrich Drepper
Actually describe in the C code what is going on.
2011-10-25Add AVX optimized versions for some x86-64 math functionsUlrich Drepper
2011-10-25Optimize x86-64 SSE4.2+ strcmp a bit moreUlrich Drepper
2011-10-23Fix WSUlrich Drepper
2011-10-23Optimized strnlen and wcscmp for x86-64Liubov Dmitrieva