Age | Commit message (Collapse) | Author |
|
This rebase is a better implementation of the previous ones.
Modifications are done to reduce the total clock cycle.
Speedup: 1.341
Compiled with -O3
Tested with: park_joy_420_720p50.y4m
Change-Id: I940eaf283f60597ca0d9d2e13d518878d55ff02d
|
|
|
|
|
|
Commit a4a5a210 enabled lossless coding, but the commit incorrectly
disabled the usage of skip in encoder even when skip should be used.
This commit make sure that skip is enabled even in lossless mode.
Change-Id: I276954f952c6ac68f17a316ebc72f09001228a08
|
|
Since they used in encoder only. This commit also re-order includes
for the files that include vp9_extend.h
Change-Id: I929fc113f2135d3198cd1fc6a17434e5a2f8a459
|
|
|
|
Explicitly constrain the upper limit of motion search range (in the
unit of full pixel) to be [-1023, +1023]. It is intended to control
the effective motion search range for 4K sequences.
Change-Id: I645539c70885eec0f155781f439d97d333336e88
|
|
|
|
This removes a lot of operations in setting partition context...
Change-Id: I365e6f5607ece85190cb21443988816dfa510ce3
|
|
This patch followed "Rewrite filter_selectively_horiz for parallel
loopfiltering" commit, and added x86 SSE2 optimization to do
16-pixel filtering in parallel. Also, corrected the declaration
of aligned arrays. For 8-pixel-in-parallel case, improved the
calculation of the masks and filters. Updated the threshold loading
since the thresholds were already duplicated. Updated neon C functions
to call neon loopfilters twice.
Using tulip clip, tests showed it gave a ~1.5% decoder speed gain.
Change-Id: Id02638626ac27a4b0e0b09d71792a24c0499bd35
|
|
until we implenment real vp9_idct32x32_34_add_neon."
|
|
|
|
on arm until we implenment real vp9_idct32x32_34_add_neon.
This issue is due to commit 47665452f0da3c11427ecb4852535e1787bb0c5b
Merge "Add 32x32 idct function for eob<=34 case".
Change-Id: I56b5f0abc20e7dd1bba521f78a995e85d65ea296
|
|
from vp9_decodframe.{c,h} to vp9_decodeframe.{c,h}
Change-Id: I21ac4b14fc90246e3f16bd90c52c12d126d791f8
|
|
There was only one function in *.c file, so moving it to vp9_encodemb.c.
Change-Id: I728859d08b3d6c05c33c1c5b21f0ea1d0e0f83af
|
|
|
|
|
|
Removes silly operations inside loop.
Change-Id: I9eeab1e914e715a887f86cf1089de508e2364165
|
|
|
|
|
|
Change-Id: If97ae16a4478717933345b6b9d5bc1b417b8dd84
|
|
Change-Id: Ib748eb287520c794631697204da6ebe19523ce95
|
|
Change-Id: Ic6770072f80dfb54d2725ed96370d4f243a9f474
|
|
Change-Id: I9d18f351abe7614107f34f47eeb38a234a9937c9
|
|
Change-Id: Ic1367d767705377402ebfec0705f9f553a834400
|
|
Change-Id: I4e2ad4b7342681e6ac236356ef3a4927a54f105b
|
|
|
|
|
|
Simplifies the code by implementing band mapping with static arrays.
A lot of the code complexity introduced in a previous patch
disappears.
Change-Id: Ia3fac36e594fb5ad2d55ae141c58bba4c55c2d28
|
|
Change-Id: I7d11f1b6075a1115cdc2dcd605225b9c9c9b39c7
|
|
|
|
Removing vp9_read_and_apply_sign macro which was used only once.
Change-Id: I6a1625b720d89fc1291c99deccd6638b705f9b06
|
|
The switch to the rate-correction damping factor
in https://gerrit.chromium.org/gerrit/#/c/67536/ was not conditioned on CBR mode.
Change-Id: I2326704e8ac030a4f7b592dd3fedb94c7dd0644d
|
|
|
|
The step that sums three input samples could potentially cause the
intermediate result go beyond 16 bit limit, when operating as the
second 1-D transform. This commit fixes the issue.
Change-Id: Iaf512449ac2d25ddd8a806d760afab362c62a516
|
|
|
|
|
|
Overall change (using dual buffer scheme for superblocks of both inter
and intra modes) reduces speed 2 runtime:
bluesky_1080p at 6000kbps: 263553ms -> 257441ms
riverbed_1080p at 8000kbps: 233230ms -> 225308ms.
Change-Id: Idf8d70f768a4b0d97b2a8506372c57b7b4022119
|
|
|
|
|
|
|
|
|
|
Change-Id: Ib27fc4f3dbe01fe8adfa04a61aaba21b3480e75c
|
|
Change-Id: Ia7f640ca395e8deaac5986f19d11ab18d85eec2d
|
|
Moving because q_index is used only by encoder.
Change-Id: I0b96175614ed4fd3d76ee56a0ba36258e1e896f6
|
|
|
|
|
|
|
|
|
|
|