diff options
author | levytamar82 <levytamar82@gmail.com> | 2013-12-29 01:23:50 -0700 |
---|---|---|
committer | levytamar82 <levytamar82@gmail.com> | 2014-01-08 12:05:53 -0700 |
commit | 357b65369f7d41a3c66eadb390224d260bd6dc63 (patch) | |
tree | 1c7716e5845112ada8b717c1a57b1fdd262e9518 /vp8/common/arm/neon | |
parent | cdc933ca0043b7944a4a45edb85d4c142f63fc56 (diff) | |
download | libvpx-357b65369f7d41a3c66eadb390224d260bd6dc63.tar libvpx-357b65369f7d41a3c66eadb390224d260bd6dc63.tar.gz libvpx-357b65369f7d41a3c66eadb390224d260bd6dc63.tar.bz2 libvpx-357b65369f7d41a3c66eadb390224d260bd6dc63.zip |
AVX2 Variance Optimization
Optimizing the variance functions: vp9_variance16x16, vp9_variance32x32,
vp9_variance64x64, vp9_variance32x16, vp9_variance64x32,
vp9_mse16x16 by migrating to AVX2
some of the functions were optimized by processing 32 elements instead of 16.
some of the functions were optimized by processing 2 loop strides of 16
elements in a single 256 bit register
This optimization gives between 2.4% - 2.7% user level performance gain
and 42% function level gain.
Change-Id: I265ae08a2b0196057a224a86450153ef3aebd85d
Diffstat (limited to 'vp8/common/arm/neon')
0 files changed, 0 insertions, 0 deletions