Age | Commit message (Collapse) | Author |
|
Change-Id: I695a990689c79d160227975116125b140875aed1
|
|
Allow selective masking of individual split modes rather than
just a single on / off flag.
For speed 2 recovers the large speed loss seen for some derf
clips in change Ie6bdfa0a370148dd60bd800961077f7e97e67dd4
and a small quality gain.
For speed 1 10 % speed increase observed locally on some derf clips
for minimal quality change.
Change-Id: If86191087b93cbc05351c26c60c7933e2149e485
|
|
In relation to change:
Refactor inter mode rate-distortion search
Ie6bdfa0a370148dd60bd800961077f7e97e67dd4
sf->thresh_mult_sub8x8[THR_INTRA] = INT_MAX missing;
Change-Id: Ia86b68a5073368a3e2ca124a27b632243b525c8b
|
|
|
|
|
|
Change-Id: I2cc3684f416a63dc99b9303109f9850f34a470d5
|
|
|
|
398ddafb629b7f49cf255bf09d3e38b4abd0bb95"
|
|
398ddafb629b7f49cf255bf09d3e38b4abd0bb95
This patch reverts old commit 398ddafb629b7f49cf255bf09d3e38b4abd0bb95
"New way of updating last frame segmentation map.".
Change-Id: Iba730f433c30ed7f5e5449d6768049cbf9a2b2c5
|
|
Change-Id: Ic23a705e48cadaa7151f2bd8536d56636cb973e3
|
|
This commit defines b_mode_info as a struct type. This will allow
us to further remove the use of PARTITION_INFO in the encoding process.
Change-Id: I975b0f7d557b5e0f66545a61b472def76b671cce
|
|
Remove redundant variable definition/use in rate-distortion search
loop for regular and sub8x8 blocks, respectively.
Change-Id: Ic0eb3660bb6851ba2eb8d702ba9fd11595000d01
|
|
|
|
|
|
|
|
This commit separates the rate-distortion optimization loop of
superblocks from that of sub8x8 blocks. This allows better design
rate-distortion optimization search loop for each setting. It also
removes the use of SPLITMV and I4X4_PRED therein.
No performance change in speed 0 settings. For bus@CIF at 2000kbps,
the speed 1 runtime goes from 48009ms to 43894ms (about 10% faster).
The overall compression performance on derf changed by -0.021%.
Speed 2 runtime goes from 27114ms to 28700ms (6% slower), while the
overall coding efficiency goes up by 1.629% for derf, 1.236% for yt.
Change-Id: Ie6bdfa0a370148dd60bd800961077f7e97e67dd4
|
|
|
|
Change-Id: I9a0d0e9c3459954aa7b9c68f92cc5d56385ebd18
|
|
Change-Id: I5ae659c1bfb132428a7272d094b5287d144ec7c8
|
|
|
|
|
|
|
|
|
|
In subpixel filters, prefetched source data, unrolled loops,
and interleaved instructions.
In HORIZx4, integrated the idea in Scott's CL (commit:
d22a504d11a15dc3eab666859db0046b5a7d75c5), which was suggested by
Erik/Tamar from Intel. Further tweaking was done to combine row 0,
2, and row 1, 3 in registers to do more 2-row-in-1 operations until
the last add.
Test showed a ~2% decoder speedup.
Change-Id: Ib53d04ede8166c38c3dc744da8c6f737ce26a0e3
|
|
Substantial reworking of the speed vs quality trade offs for
speed 1 and 2.
In this patch I am attempting to freeze the "quality" meaning of
speeds 1 and 2 relative to speed 0 so that in future we can
better evaluate progress.
I am targeting :
Speed 1 quality ~-5% vs speed 0.
Speed 2 quality ~-10% vs speed 0
It is inevitable that quality will still fluctuate a little as we adjust
settings and add new features, but we will attempt to keep as
close as possible to these values. Above speed 2 things will remain
a bit more fluid for now.
In this patch speed 1 is approximately 4-5x as fast as speed 0. This
is similar to before but the quality hit is a lot less. Likewise speed 2
is approximately 2x as fast as speed 1 but is similar in quality to the
previous speed 1 configuration.
Also slight change to behavior of FLAG_EARLY_TERMINATE to insure
all reference frames get at least one rd test. Important for very low
variance regions.
WIP :- Added a new speed level with old speed 4 becoming speed 5.
Speed 3 and 4 tradeoffs still WIP
Change-Id: Ic7a38dd7b5b63ab1501f9352411972f480ac6264
|
|
This commit causes use last partition to consider whether a 64x64 has
motion that might make a new partitioning worth while.
Change-Id: I3a57bedef4f3cd961fadbfa96651c206fa36da4a
|
|
|
|
Adding appropriate test vector vp90-2-06-bilinear.webm.
Change-Id: Ia3bbf57318e0cc61a1b724fe751e3f9c7e11b337
|
|
Change-Id: I401536778e3c68ba2b3ae3955c689d005e1f1d59
|
|
|
|
|
|
|
|
|
|
Making appropriate memset inside decode_block now.
Change-Id: I8e944194668c830de08271c8fb6e413251c201d8
|
|
Remove mode_check_freq and mode_test_hit_counts from VP9_COMP.
Change-Id: Iabfd9f841444cd9bf19ac761a9795f140082ce0b
|
|
The codec should effectively run with motion vector of range (-2048, 2047)
in full pixels, for sequences of 1080p and below. Add assertions to clarify
this behavior.
Change-Id: Ia0cac28249f587d8f8882205228fa480263ab313
|
|
|
|
Change-Id: I4b1c6bb9ff615f5872b96ed07dbf0f5e18e63643
|
|
Change-Id: I92fd32476c472e54f52b8d7602a98262b25e6eaf
|
|
|
|
|
|
|
|
|
|
Moving out decode_tokens function calls and adding decode_blocks boolean
variable. We only have to decode if eobtotal > 0, i.e. we have at least one
non-zero coefficient. Also inlining and remove vp9_set_pred_flag_mbskip
function.
Change-Id: I7be38b12ee8206faf0beea2bbf4d52be42575b03
|
|
Change-Id: Ib19dabe697656e4d7e8403d91bedca7cd31d36bf
|
|
Change-Id: Icc6a76a5be77f3e19918155bab3998e0aa32ccf5
|
|
Change-Id: I6c4058aebe834e1a12b7a3fb10484b9ebe60b349
|
|
Change-Id: I84209d382ca5dfc537ee533cd792d8caa0e25cee
|
|
|
|
Interleaved the instructions, reduced register dependency, and
prefetched the source data. This improved the decoder speed
by 0.6% - 2%.
Change-Id: I568067aa0c629b2e58219326899c82aedf7eccca
|