summaryrefslogtreecommitdiff
path: root/vp9/encoder
AgeCommit message (Collapse)Author
2013-04-10Merge "Make SB coding size-independent." into experimentalRonald S. Bultje
2013-04-09Make SB coding size-independent.Ronald S. Bultje
Merge sb32x32 and sb64x64 functions; allow for rectangular sizes. Code gives identical encoder results before and after. There are a few macros for rectangular block sizes under the sbsegment experiment; this experiment is not yet functional and should not yet be used. Change-Id: I71f93b5d2a1596e99a6f01f29c3f0a456694d728
2013-04-08Clamp inferred motion vectors onlyJingning Han
Clamp only the motion vectors inferred from neighboring reference macroblocks. The motion vectors obtained through motion search in NEWMV mode are constrained during the search process, which allows a relatively larger referencing region than the inferred mvs. Hence further clamping the best mv provided by the motion search may affect the efficacy of NEWMV mode. Synchronized the decoding process. The decoded mvs in NEWMV modes should be guaranteed to fit in the effective range. Put a mv range clamping function there for security purpose. This improves the coding performance of high motion sequences, e.g., derf set: foreman 0.233% husky 0.175% icd 0.135% mother_daughter 0.337% pamphlet 0.561% stdhd set: blue_sky 0.408% city 0.455% also saw sunflower goes down by -0.469%. Change-Id: I3fcbba669e56dab779857a8126a91b926e899cb5
2013-04-05Merge changes Ibbfa68d6,Idb76a0e2 into experimentalJohn Koleszar
* changes: Move EOB to per-plane data Move qcoeff, dqcoeff from BLOCKD to per-plane data
2013-04-05Merge "Remove full-pixel-related code." into experimentalRonald S. Bultje
2013-04-05Remove full-pixel-related code.Ronald S. Bultje
This is a VP8-only feature (part of profile 3) that is unsupported in VP9. Change-Id: I78016eede8d9c834d44d4c517f3e8b8fc2a378b1
2013-04-05Removed a speed feature no longer usedYaowu Xu
Change-Id: Id0c2e44daa936f1d6fb76469fd1bd72a4d7c19fd
2013-04-04Move EOB to per-plane dataJohn Koleszar
Continue migrating data from BLOCKD/MACROBLOCKD to the per-plane structures. Change-Id: Ibbfa68d6da438d32dcbe8df68245ee28b0a2fa2c
2013-04-04Move qcoeff, dqcoeff from BLOCKD to per-plane dataJohn Koleszar
Start grouping data per-plane, as part of refactoring to support additional planes, and chroma planes with other-than 4:2:0 subsampling. Change-Id: Idb76a0e23ab239180c818025bae1f36f1608bb23
2013-04-04make one_shot_q an experimentYaowu Xu
so it is configurable to faciliate testings Change-Id: I247b62736c3a08ec2934793959d1ae605a05efa3
2013-04-04Bugfix in encode_inter_mb_segment_8x8Deb Mukherjee
Fixes an indexing bug. Looks like the bug has been there for a while. Change-Id: I9fc04b0c30754bcb47366ad94a08112925600c4d
2013-04-04Fixed incorrect use of compute_qdelta()Paul Wilkins
This function expects real Q values as inputs not index values. The use-age her impacts the Q chosen for force key frames. Though this is a bug fix I have not yet verified whether following the bug fix the q multiplier value used is correct. Change-Id: I49f6da894d90baeb1e86c820c335f02dc80d3b66
2013-04-03Modify vp9_setup_interp_filters functionYunqing Wang
Took vp9_setup_scale_factors_for_frame() out from vp9_setup_interp_filters(), so that it is only called once per frame instead of per macroblock. Decoder tests showed a 1.5% performance gain. Change-Id: I770cb09eb2140ab85132f82aed388ac0bdd3a0aa
2013-04-02Merge "Renaming sb32_coded and sb64_coded fields." into experimentalJohn Koleszar
2013-04-02Merge "Code cleanup in vp9_onyx_if.c." into experimentalJohn Koleszar
2013-04-02Renaming sb32_coded and sb64_coded fields.Dmitry Kovalev
Renaming sb32_coded to prob_sb32_coded and sb64_coded to prob_sb64_coded. Change-Id: I6de5cad00a57c3e066d53467f8c38cb6073dce11
2013-04-02Code cleanup in vp9_onyx_if.c.Dmitry Kovalev
Using clamp and MIN/MAX functions instead of plain C code. Lower case variable names. Removing redundant parenthesis. Change-Id: Ibf7cc5fbe4fbdb5029049a599af71534176e6f42
2013-04-01Merge "Code cleanup." into experimentalJohn Koleszar
2013-04-01Merge "Removing redundant function arguments." into experimentalJohn Koleszar
2013-04-01Code cleanup.Dmitry Kovalev
Adding multiple16 function, removing redundant code, better formatting. Change-Id: I50195b78ac8ab803e3d05c8fb05a7ca134fab386
2013-04-01Merge "Calculate SSIM over both reconstruction as well as postproc buffer." ↵Ronald S. Bultje
into experimental
2013-04-01Calculate SSIM over both reconstruction as well as postproc buffer.Ronald S. Bultje
We used to calculate SSIM only over the postproc buffer, whereas we calculate PSNR for both. Compared to postproc-SSIM, this is about 0.3% higher for derf, 1.4% lower for hd and 0.5% lower for stdhd, although it is highly variable on a per-clip basis. Change-Id: I8dd491f0f5b4201dedfb15d288c854d5d4caa10f
2013-03-29Merge "Framework changes in nzc to allow more flexibility" into experimentalDeb Mukherjee
2013-03-29Merge "General code cleanup." into experimentalJohn Koleszar
2013-03-28Merge "Adjust mv_ratio_accumulator threshold." into experimentalPaul Wilkins
2013-03-28Merge "Fix mix-up in pt token indexing." into experimentalRonald S. Bultje
2013-03-28Merge "Fix crash when --tune=ssim is selected." into experimentalYaowu Xu
2013-03-28Merge "Save nzcstats." into experimentalRonald S. Bultje
2013-03-28Framework changes in nzc to allow more flexibilityDeb Mukherjee
The patch adds the flexibility to use standard EOB based coding on smaller block sizes and nzc based coding on larger blocksizes. The tx-sizes that use nzc based coding and those that use EOB based coding are controlled by a function get_nzc_used(). By default, this function uses nzc based coding for 16x16 and 32x32 transform blocks, which seem to bridge the performance gap substantially. All sets are now lower by 0.5% to 0.7%, as opposed to ~1.8% before. Change-Id: I06abed3df57b52d241ea1f51b0d571c71e38fd0b
2013-03-28Fix mix-up in pt token indexing.Ronald S. Bultje
This fixes uninitialized reads in the trellis, and probably makes the trellis do something again. Change-Id: Ifac8dae9aa77574bde0954a71d4571c5c556df3c
2013-03-28Adjust mv_ratio_accumulator threshold.Paul Wilkins
This threshold effectively limits the amount of motion from one end of a GF/ARF group to the other. This patch makes the threshold depend on image size. Change-Id: Id45d1d7bced815f86ddd037be53164894b00b82f
2013-03-28Fix crash when --tune=ssim is selected.Paul Wilkins
Crash fix only. No functional change or testing. Change-Id: I0c6d114d024c29fc11ae61666f5938f11b01dd6a
2013-03-27Removing redundant function arguments.Dmitry Kovalev
Almost all arguments for vp9_build_inter32x32_predictors_sb and vp9_build_inter64x64_predictors_sb can be deduced from the first macroblock argument. Change-Id: I5d477a607586d05698d5b3b9b9bc03891dd3fe83
2013-03-27General code cleanup.Dmitry Kovalev
Removing redundant code, lower case variable names, better indentation, better parameter names, adding const to readonly parameters. Change-Id: Ibfdee00f60316fdc5b3f024028c7aaa76a627483
2013-03-27Merge "Cleaning up rate control code." into experimentalJohn Koleszar
2013-03-27Save nzcstats.Ronald S. Bultje
Change-Id: I4a3a9eb9f9d17218a0f0d7e148123d34dae879c2
2013-03-26Merge "Add col/row-based coefficient scanning patterns for 1D 8x8/16x16 ↵Ronald S. Bultje
ADSTs." into experimental
2013-03-26Merge "Redo banding for all transforms." into experimentalRonald S. Bultje
2013-03-26Merge "Use above/left (instead of previous in scan-order) as token context." ↵Ronald S. Bultje
into experimental
2013-03-26Implicit weighted prediction experimentDeb Mukherjee
Adds an experiment to use a weighted prediction of two INTER predictors, where the weight is one of (1/4, 3/4), (3/8, 5/8), (1/2, 1/2), (5/8, 3/8) or (3/4, 1/4), and is chosen implicitly based on consistency of the predictors to the already reconstructed pixels to the top and left of the current macroblock or superblock. Currently the weighting is not applied to SPLITMV modes, which default to the usual (1/2, 1/2) weighting. However the code is in place controlled by a macro. The same weighting is used for Y and UV components, where the weight is derived from analyzing the Y component only. Results (over compound inter-intra experiment) derf: +0.18% yt: +0.34% hd: +0.49% stdhd: +0.23% The experiment suggests bigger benefit for explicitly signaled weights. Change-Id: I5438539ff4485c5752874cd1eb078ff14bf5235a
2013-03-26Add col/row-based coefficient scanning patterns for 1D 8x8/16x16 ADSTs.Ronald S. Bultje
These are mostly just for experimental purposes. I saw small gains (in the 0.1% range) when playing with this on derf. Change-Id: Ib21eed477bbb46bddcd73b21c5c708a5b46abedc
2013-03-26Redo banding for all transforms.Ronald S. Bultje
Now that the first AC coefficient in both directions use the same DC as their context, there no longer is a purpose in letting both have their own band. Merging these two bands allows us to split bands for some of the very high-frequency AC bands. In addition, I'm redoing the banding for the 1D-ADST col/row scans. I don't think the old banding made any sense at all (it merged the last coefficient of the first row/col in the same band as the first two of the second row/col), which was clearly an oversight from the band being applied in scan-order (rather than in their actual position). Now, coefficients at the same position will be in the same band, regardless what scan order is used. I think this makes most sense for the purpose of banding, which is basically "predict energy for this coefficient depending on the energy of context coefficients" (i.e. pt). After full re-training, together with previous patch, derf gains about 1.2-1.3%, and hd/stdhd gain about 0.9-1.0%. Change-Id: I7a0cc12ba724e88b278034113cb4adaaebf87e0c
2013-03-26Use above/left (instead of previous in scan-order) as token context.Ronald S. Bultje
Pearson correlation for above or left is significantly higher than for previous-in-scan-order (absolute values depend on position in scan, but in general, we gain about 0.1-0.2 by using either above or left; using both basically just makes this even better). For eob branch skipping, we continue to use the previous token in scan order. This helps about 0.9% on derf after re-training on a limited data set. Full re-training and results on larger-resolution clips are pending. Note that this commit breaks trellis, so we can probably get further gains out of it by fixing trellis at some later point. Change-Id: Iead68e296fc3a105cca746b5e3da9555d6010cfe
2013-03-26Cleaning up rate control code.Dmitry Kovalev
Lower case variable names, declaration and initialization on the same line, removing redundant casts to double. Change-Id: I7ea3905bed827aa6faac11a78401b85e448b57f9
2013-03-26Add an in-loop deringing experimentJohn Koleszar
Adds a per-frame, strength adjustable, in loop deringing filter. Uses the existing vp9_post_proc_down_and_across 5 tap thresholded blur code, with a brute force search for the threshold. Results almost strictly positive on the YT HD set, either having no effect or helping PSNR in the range of 1-3% (overall average 0.8%). Results more mixed for the CIF set, (-0.5 min, 1.4 max, 0.1 avg). This has an almost strictly negative impact to SSIM, so examining a different filter or a more balanced search heuristic is in order. Other test set results pending. Change-Id: I5ca6ee8fe292dfa3f2eab7f65332423fa1710b58
2013-03-26Merge "Modeling default coef probs with distribution" into experimentalDeb Mukherjee
2013-03-25Modeling default coef probs with distributionDeb Mukherjee
Replaces the default tables for single coefficient magnitudes with those obtained from an appropriate distribution. The EOB node is left unchanged. The model is represeted as a 256-size codebook where the index corresponds to the probability of the Zero or the One node. Two variations are implemented corresponding to whether the Zero node or the One-node is used as the peg. The main advantage is that the default prob tables will become considerably smaller and manageable. Besides there is substantially less risk of over-fitting for a training set. Various distributions are tried and the one that gives the best results is the family of Generalized Gaussian distributions with shape parameter 0.75. The results are within about 0.2% of fully trained tables for the Zero peg variant, and within 0.1% of the One peg variant. The forward updates are optionally (controlled by a macro) model-based, i.e. restricted to only convey probabilities from the codebook. Backward updates can also be optionally (controlled by another macro) model-based, but is turned off by default. Currently model-based forward updates work about the same as unconstrained updates, but there is a drop in performance with backward-updates being model based. The model based approach also allows the probabilities for the key frames to be adjusted from the defaults based on the base_qindex of the frame. Currently the adjustment function is a placeholder that adjusts the prob of EOB and Zero node from the nominal one at higher quality (lower qindex) or lower quality (higher qindex) ends of the range. The rest of the probabilities are then derived based on the model from the adjusted prob of zero. Change-Id: Iae050f3cbcc6d8b3f204e8dc395ae47b3b2192c9
2013-03-22Merge "Disable zero bin mode boost." into experimentalPaul Wilkins
2013-03-22Merge "Minor code clean up" into experimentalPaul Wilkins
2013-03-22Disable zero bin mode boost.Paul Wilkins
As things stand the zero bin mode boost is hurting somewhat. In part this seems to be because the boost applied as is interferes with the rd mode selection loop. Average gains (derf 0.072, yt 0.243, ythd 0.179 std-hd 0.212%) Change-Id: Icaecea3908d9a7352370e49b8fa822f2c2c49dc1