summaryrefslogtreecommitdiff
path: root/vp9
AgeCommit message (Collapse)Author
2013-11-18Improve vp9_iht4x4_16_add_sse2 (x1.341)Abo Talib Mahfoodh
This rebase is a better implementation of the previous ones. Modifications are done to reduce the total clock cycle. Speedup: 1.341 Compiled with -O3 Tested with: park_joy_420_720p50.y4m Change-Id: I940eaf283f60597ca0d9d2e13d518878d55ff02d
2013-11-18Merge "Fixed a bug in commit a4a5a210"Yaowu Xu
2013-11-18Merge "Move vp9_extend.{h,c} from common to encoder"Yaowu Xu
2013-11-18Fixed a bug in commit a4a5a210Yaowu Xu
Commit a4a5a210 enabled lossless coding, but the commit incorrectly disabled the usage of skip in encoder even when skip should be used. This commit make sure that skip is enabled even in lossless mode. Change-Id: I276954f952c6ac68f17a316ebc72f09001228a08
2013-11-18Move vp9_extend.{h,c} from common to encoderYaowu Xu
Since they used in encoder only. This commit also re-order includes for the files that include vp9_extend.h Change-Id: I929fc113f2135d3198cd1fc6a17434e5a2f8a459
2013-11-18Merge "Constrain encoder motion search range"Jingning Han
2013-11-18Constrain encoder motion search rangeJingning Han
Explicitly constrain the upper limit of motion search range (in the unit of full pixel) to be [-1023, +1023]. It is intended to control the effective motion search range for 4K sequences. Change-Id: I645539c70885eec0f155781f439d97d333336e88
2013-11-18Merge "Do horizontal loopfiltering in parallel"Yunqing Wang
2013-11-17partition context update speedupJim Bankoski
This removes a lot of operations in setting partition context... Change-Id: I365e6f5607ece85190cb21443988816dfa510ce3
2013-11-15Do horizontal loopfiltering in parallelYunqing Wang
This patch followed "Rewrite filter_selectively_horiz for parallel loopfiltering" commit, and added x86 SSE2 optimization to do 16-pixel filtering in parallel. Also, corrected the declaration of aligned arrays. For 8-pixel-in-parallel case, improved the calculation of the masks and filters. Updated the threshold loading since the thresholds were already duplicated. Updated neon C functions to call neon loopfilters twice. Using tulip clip, tests showed it gave a ~1.5% decoder speed gain. Change-Id: Id02638626ac27a4b0e0b09d71792a24c0499bd35
2013-11-15Merge "Let the idct vp9_idct32x32_34_add = vp9_idct32x32_1024_add on arm ↵hkuang
until we implenment real vp9_idct32x32_34_add_neon."
2013-11-15Merge "Renamed two files"Yaowu Xu
2013-11-15Let the idct vp9_idct32x32_34_add = vp9_idct32x32_1024_addhkuang
on arm until we implenment real vp9_idct32x32_34_add_neon. This issue is due to commit 47665452f0da3c11427ecb4852535e1787bb0c5b Merge "Add 32x32 idct function for eob<=34 case". Change-Id: I56b5f0abc20e7dd1bba521f78a995e85d65ea296
2013-11-15Renamed two filesYaowu Xu
from vp9_decodframe.{c,h} to vp9_decodeframe.{c,h} Change-Id: I21ac4b14fc90246e3f16bd90c52c12d126d791f8
2013-11-15Removing vp9_encodeintra.{h, c} files.Dmitry Kovalev
There was only one function in *.c file, so moving it to vp9_encodemb.c. Change-Id: I728859d08b3d6c05c33c1c5b21f0ea1d0e0f83af
2013-11-15Merge "Cleaning up vp9_loopfilter.c file."Dmitry Kovalev
2013-11-15Merge "Fix coding format in vp9_idct"Jingning Han
2013-11-15partition plane context speed upJim Bankoski
Removes silly operations inside loop. Change-Id: I9eeab1e914e715a887f86cf1089de508e2364165
2013-11-15Merge "loop filter assert cleanout"Jim Bankoski
2013-11-14Merge "Cleaning up vp9_tile_common.{h, c} files."Dmitry Kovalev
2013-11-14Fix coding format in vp9_idctJingning Han
Change-Id: If97ae16a4478717933345b6b9d5bc1b417b8dd84
2013-11-14fix scalling bug by buffer auto-reallocationAdrian Grange
Change-Id: Ib748eb287520c794631697204da6ebe19523ce95
2013-11-14Cleaning up vp9_loopfilter.c file.Dmitry Kovalev
Change-Id: Ic6770072f80dfb54d2725ed96370d4f243a9f474
2013-11-14Cleaning up vp9_tile_common.{h, c} files.Dmitry Kovalev
Change-Id: I9d18f351abe7614107f34f47eeb38a234a9937c9
2013-11-14Removing unused coefband_trans_8x8plus array from VP9Decompressor.Dmitry Kovalev
Change-Id: Ic1367d767705377402ebfec0705f9f553a834400
2013-11-14loop filter assert cleanoutJim Bankoski
Change-Id: I4e2ad4b7342681e6ac236356ef3a4927a54f105b
2013-11-14Merge "Eliminating usage of txfrm_block_to_raster_block() from encode_block()."Dmitry Kovalev
2013-11-14Merge "For CBR, keep rate-correction damping factor to 2."Marco Paniconi
2013-11-13Simplifies band-getting with a static arrayDeb Mukherjee
Simplifies the code by implementing band mapping with static arrays. A lot of the code complexity introduced in a previous patch disappears. Change-Id: Ia3fac36e594fb5ad2d55ae141c58bba4c55c2d28
2013-11-13Eliminating usage of txfrm_block_to_raster_block() from encode_block().Dmitry Kovalev
Change-Id: I7d11f1b6075a1115cdc2dcd605225b9c9c9b39c7
2013-11-13Merge "Cleaning up decode_coefs() function."Dmitry Kovalev
2013-11-13Cleaning up decode_coefs() function.Dmitry Kovalev
Removing vp9_read_and_apply_sign macro which was used only once. Change-Id: I6a1625b720d89fc1291c99deccd6638b705f9b06
2013-11-13For CBR, keep rate-correction damping factor to 2.Marco Paniconi
The switch to the rate-correction damping factor in https://gerrit.chromium.org/gerrit/#/c/67536/ was not conditioned on CBR mode. Change-Id: I2326704e8ac030a4f7b592dd3fedb94c7dd0644d
2013-11-13Merge "Dual buffer encoding for intra modes"Jingning Han
2013-11-13Fix an overflow issue in SSE2 forward ADSTJingning Han
The step that sums three input samples could potentially cause the intermediate result go beyond 16 bit limit, when operating as the second 1-D transform. This commit fixes the issue. Change-Id: Iaf512449ac2d25ddd8a806d760afab362c62a516
2013-11-13Merge "Replacing raster_block with block in the encoder."Dmitry Kovalev
2013-11-13Merge "Removing function pointers from inter prediction."Dmitry Kovalev
2013-11-13Dual buffer encoding for intra modesJingning Han
Overall change (using dual buffer scheme for superblocks of both inter and intra modes) reduces speed 2 runtime: bluesky_1080p at 6000kbps: 263553ms -> 257441ms riverbed_1080p at 8000kbps: 233230ms -> 225308ms. Change-Id: Idf8d70f768a4b0d97b2a8506372c57b7b4022119
2013-11-13Merge "Syncing write_modes_{b, sb} implementation with decode_modes_{b, sb}."Dmitry Kovalev
2013-11-13Merge "Optimizing set_contexts() function."Dmitry Kovalev
2013-11-13Merge "Use 1D array to store super block filter levels"Yunqing Wang
2013-11-13Merge "mips dsp-ase r2 vp9 decoder intra module optimizations (rebase)"Johann
2013-11-13mips dsp-ase r2 vp9 decoder intra module optimizations (rebase)Parag Salasakar
Change-Id: Ib27fc4f3dbe01fe8adfa04a61aaba21b3480e75c
2013-11-13mips dsp-ase r2 vp9 decoder loopfilter module optimizations (rebase)Parag Salasakar
Change-Id: Ia7f640ca395e8deaac5986f19d11ab18d85eec2d
2013-11-12Moving q_index from MACROBLOCKD to MACROBLOCK.Dmitry Kovalev
Moving because q_index is used only by encoder. Change-Id: I0b96175614ed4fd3d76ee56a0ba36258e1e896f6
2013-11-12Merge "Enable dual buffer rd search and encoding scheme"Jingning Han
2013-11-12Merge "Calculating transform block offsets (x and y) only once."Dmitry Kovalev
2013-11-12Merge "Using max_tx_size instead of bsize when possible."Dmitry Kovalev
2013-11-12Merge "Moving {sb, mb, b, ab}_index from MACROBLOCKD to MACROBLOCK."Dmitry Kovalev
2013-11-12Merge "Adding const to tree pointer inside vp9_extra_bit struct."Dmitry Kovalev