diff options
author | Yunqing Wang <yunqingwang@google.com> | 2013-11-12 16:51:15 -0800 |
---|---|---|
committer | Frank Galligan <fgalligan@google.com> | 2013-11-15 16:18:43 -0800 |
commit | 64f728caef5d9f019222c6989a9c6df17464dd69 (patch) | |
tree | 5e5994eb65821008e78d7677d5abffbb3908f1fc /vp9/common/vp9_convolve.c | |
parent | 60d1a5299576649f6db38714319b5845683ff0ab (diff) | |
download | libvpx-64f728caef5d9f019222c6989a9c6df17464dd69.tar libvpx-64f728caef5d9f019222c6989a9c6df17464dd69.tar.gz libvpx-64f728caef5d9f019222c6989a9c6df17464dd69.tar.bz2 libvpx-64f728caef5d9f019222c6989a9c6df17464dd69.zip |
Do horizontal loopfiltering in parallel
This patch followed "Rewrite filter_selectively_horiz for parallel
loopfiltering" commit, and added x86 SSE2 optimization to do
16-pixel filtering in parallel. Also, corrected the declaration
of aligned arrays. For 8-pixel-in-parallel case, improved the
calculation of the masks and filters. Updated the threshold loading
since the thresholds were already duplicated. Updated neon C functions
to call neon loopfilters twice.
Using tulip clip, tests showed it gave a ~1.5% decoder speed gain.
Change-Id: Id02638626ac27a4b0e0b09d71792a24c0499bd35
Diffstat (limited to 'vp9/common/vp9_convolve.c')
0 files changed, 0 insertions, 0 deletions