libvpx.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	Yunqing Wang <yunqingwang@google.com>	2013-11-12 16:51:15 -0800
committer	Frank Galligan <fgalligan@google.com>	2013-11-15 16:18:43 -0800
commit	64f728caef5d9f019222c6989a9c6df17464dd69 (patch)
tree	5e5994eb65821008e78d7677d5abffbb3908f1fc /vp9/common/vp9_convolve.c
parent	60d1a5299576649f6db38714319b5845683ff0ab (diff)
download	libvpx-64f728caef5d9f019222c6989a9c6df17464dd69.tar libvpx-64f728caef5d9f019222c6989a9c6df17464dd69.tar.gz libvpx-64f728caef5d9f019222c6989a9c6df17464dd69.tar.bz2 libvpx-64f728caef5d9f019222c6989a9c6df17464dd69.zip

Do horizontal loopfiltering in parallel

This patch followed "Rewrite filter_selectively_horiz for parallel loopfiltering" commit, and added x86 SSE2 optimization to do 16-pixel filtering in parallel. Also, corrected the declaration of aligned arrays. For 8-pixel-in-parallel case, improved the calculation of the masks and filters. Updated the threshold loading since the thresholds were already duplicated. Updated neon C functions to call neon loopfilters twice. Using tulip clip, tests showed it gave a ~1.5% decoder speed gain. Change-Id: Id02638626ac27a4b0e0b09d71792a24c0499bd35

Diffstat (limited to 'vp9/common/vp9_convolve.c')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: