Age | Commit message (Collapse) | Author |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
There is no problem with strcmp, it doesn't use the YMM registers.
The math routines might since gcc perhaps generates such code.
Introduce bit_YMM_USBALE and use it in the math routines.
|
|
|
|
|
|
No more __const.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Actually describe in the C code what is going on.
|
|
Some of the AVX-specific code is not giving enough speed-up to
justify the extra code.
|
|
|
|
|
|
|
|
Move the FMA4 code into its own section. Avoid some of the duplication
of data resulting from the double use of source files.
|
|
It's better to use __builtin_fma if it works. Use it for gcc 4.6 and
higher. Move the x86-64 dla.h to the correct place.
|
|
|
|
|
|
|