summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2011-07-22Merge "fix sharpness bug and clean up"Johann
2011-07-22fix sharpness bug and clean upJohann
sharpness was not recalculated in vp8cx_pick_filter_level_fast remove last_filter_type. all values are calculated, don't need to update the lfi data when it changes. always use cm->sharpness_level. the extra indirection was annoying. don't track last frame_type or sharpness_level manually. frame type only matters for motion search and sharpness_level is taken care of in frame_init move function declarations to their proper header Change-Id: I7ef037bd4bf8cf5e37d2d36bd03b5e22a2ad91db
2011-07-22Merge "Preload reference area to an intermediate buffer in sub-pixel motion ↵Yunqing Wang
search"
2011-07-22Preload reference area to an intermediate buffer in sub-pixel motion searchYunqing Wang
In sub-pixel motion search, the search range is small(+/- 3 pixels). Preload whole search area from reference buffer into a 32-byte aligned buffer. Then in search, load reference data from this buffer instead. This keeps data in cache, and reduces the crossing cache- line penalty. For tulip clip, tests on Intel Core2 Quad machine(linux) showed encoder speed improvement: 3.4% at --rt --cpu-used =-4 2.8% at --rt --cpu-used =-3 2.3% at --rt --cpu-used =-2 2.2% at --rt --cpu-used =-1 Test on Atom notebook showed only 1.1% speed improvement(speed=-4). Test on Xeon machine also showed less improvement, since unaligned data access latency is greatly reduced in newer cores. Next, I will apply similar idea to other 2 sub-pixel search functions for encoding speed > 4. Make this change exclusively for x86 platforms. Change-Id: Ia7bb9f56169eac0f01009fe2b2f2ab5b61d2eb2f
2011-07-21Merge "Add .size directive to ARM asm functions."Johann
2011-07-21Merge "Mark ARM asm objects as allowing a non-executable stack."Johann
2011-07-21Add .size directive to ARM asm functions.Timothy B. Terriberry
This makes them show up properly in debugging tools like gdb and valgrind. Change-Id: I0c72548a1090de88ba226314e5efe63360b7e07f
2011-07-21Mark ARM asm objects as allowing a non-executable stack.Timothy B. Terriberry
This adds the magic .note.GNU-stack section at the end of each ARM asm file (when built with gas), indicating that a non-executable stack is allowed. Without this section, the linker will assume the object requires an executable stack by default, forcing an executable stack for the entire program. Change-Id: Ie86de6a449b52d392b9e5e0479833ed8c508ee65
2011-07-21Merge "Increase chrow row alignment to 16 bytes."John Koleszar
2011-07-21Merge "Add improvements made in good-quality mode to real-time mode"Yunqing Wang
2011-07-20Increase chrow row alignment to 16 bytes.Timothy B. Terriberry
This is done by expanding luma row to 32-byte alignment, since there is currently a bunch of code that assumes that uv_stride == y_stride/2 (see, for example, vp8/common/postproc.c, common/reconinter.c, common/arm/neon/recon16x16mb_neon.asm, encoder/temporal_filter.c, and possibly others; I haven't done a full audit). It also uses replaces the hardcoded border of 16 in a number of encoder buffers with VP8BORDERINPIXELS (currently 32), as the chroma rows start at an offset of border/2. Together, these two changes have the nice advantage that simply dumping the frame memory as a contiguous blob produces a valid, if padded, image. Change-Id: Iaf5ea722ae5c82d5daa50f6e2dade9de753f1003
2011-07-20encoder: don't set the fragment bit for the last partitionAttila Nagy
Change-Id: Icb4e4f0d7c3074a8507852178be87541a1cb5bac
2011-07-19Merge "Moved vp8_encode_bool into boolhuff.h"Scott LaVarnway
2011-07-19Revert "Disable __longjmp_chk protection"John Koleszar
This reverts commit b73a3693e581583e9ec676f4396d0c3d173e2462. This version of the check doesn't work with generic-gnu, and figuring out the correct symbol version at configure time is probably more work than this is worth. May revisit in the future. Change-Id: I6c75e88bd3bd82a4b21e09a25780fe53aacb7d70
2011-07-19remove old armv5 codeJohann
armv5 dequantizer is not referenced Change-Id: Id1cc617dcee35ebd6a406816ec6aaa26e8bbc8ad
2011-07-19Moved vp8_encode_bool into boolhuff.hScott LaVarnway
allowing the compiler to inline this function. For real-time encodes, this gave a boost of 1% to 2.5%, depending on the speed setting. Change-Id: I3929d176cca086b4261267b848419d5bcff21c02
2011-07-18Improved 1-pass CBR rate controlJohn Koleszar
This patch attempts to improve the handling of CBR streams with respect to the short term buffering requirements. The "buffer level" is changed to be an average over the rc buffer, rather than a long running average. Overshoot is also tracked over the same interval and the golden frame targets suppressed accordingly to correct for overly aggressive boosting. Testing shows that this is fairly consistently positive in one metric or another -- some clips that show significant decreases in quality have better buffering characteristics, others show improvenents in both. Change-Id: I924c89aa9bdb210271f2e03311e63de3f1f8f920
2011-07-18Merge "Disable __longjmp_chk protection"John Koleszar
2011-07-18Merge "Fixed rate histogram calculation"John Koleszar
2011-07-18Fixed rate histogram calculationTero Rintaluoma
Using small values for --buf-sz= in command line causes floating point exception due to division by zero. Change-Id: Ibfe2d44db922993a78ebc9a4a1087d9625de48ae
2011-07-15Merge "Tokenize MB optimized"Scott LaVarnway
2011-07-15Merge "Fix vpxenc encoding incorrect webm file header on big endian ↵Yunqing Wang
machines(Issue 331)"
2011-07-15Tokenize MB optimizedTero Rintaluoma
Optimized C-code of the following functions: - vp8_tokenize_mb - tokenize1st_order_b - tokenize2nd_order_b Gives ~1-5% speed-up for RT encoding on Cortex-A8/A9 depending on encoding parameters. Change-Id: I6be86104a589a06dcbc9ed3318e8bf264ef4176c
2011-07-14bug fix vpx_copy_and_extend_frame size issueJames Berry
vpx_copy_and_extend_frame could incorrectly resize uv frames which could result in a crash. Change-Id: Ie96f7078b1e328b3907a06eebeee44ca39a2e898
2011-07-14Remove unused speed featuresJohn Koleszar
min_fs_radius, max_fs_radius, full_freq were set but never read. Change-Id: I82657f4e7f2ba2acc3cbc3faa5ec0de5b9c6ec74
2011-07-13Merge "Better allocate yuv buffers."Fritz Koenig
2011-07-13Merge "Fix unnecessary casting of B_PREDICTION_MODE (issue 349)"Yunqing Wang
2011-07-13Disable __longjmp_chk protectionJohn Koleszar
glibc implements some checking on longjmp() calls by replacing it with an internal function __longjmp_chk(), when FORTIFY_SOURCE is defined. This can be problematic when compiling the library under one version of glibc and running it under another. Work around this issue for the one symbol affected for now, before taking out the undef hammer. Fixes http://code.google.com/p/webm/issues/detail?id=166 Change-Id: Ifc5e25cdec17915e394711f2185b3e9214572d10
2011-07-13Fix unnecessary casting of B_PREDICTION_MODE (issue 349)Yunqing Wang
Minor fix. Change-Id: Iaf93f6e47e882a33c479e57c7a0d0bf321e291c0
2011-07-13Add improvements made in good-quality mode to real-time modeYunqing Wang
Several improvements we made in good-quality mode can be added into real-time mode to speed up encoding in speed 1, 2, and 3 with small quality loss. Tests using tulip clip showed: --rt --cpu-used=-1 (before change) PSNR: 38.028 time: 1m33.195s (after change) PSNR: 38.014 time: 1m20.851s --rt --cpu-used=-2 (before change) PSNR: 37.773 time: 0m57.650s (after change) PSNR: 37.759 time: 0m54.594s --rt --cpu-used=-3 (before change) PSNR: 37.392 time: 0m42.865s (after change) PSNR: 37.375 time: 0m41.949s Change-Id: I76ab2a38d72bc5efc91f6fe20d332c472f6510c9
2011-07-13Better allocate yuv buffers.Fritz Koenig
Previously allocated more memory than necessary for yuv buffers. This makes it harder to track bugs with reading uninitialized data. Change-Id: I510f7b298d3c647c869be6e5d51608becc63cce9
2011-07-13Merge "Reduce motion vector search on alt-ref frame."Fritz Koenig
2011-07-13Merge "Remove rotting NDS_NITRO code."John Koleszar
2011-07-13Merge "update x86 asm for loopfilter"Johann
2011-07-13Merge "Update armv6 loopfilter to new interface"Johann
2011-07-13Merge "Update armv7 loopfilter to new interface"Johann
2011-07-13Merge "New loop filter interface"Johann
2011-07-12Remove rotting NDS_NITRO code.Fritz Koenig
Code has not been used and is no longer relevant. Change-Id: I38590513da7c7a436804ff8a1a3805d9697f575d
2011-07-12Fix vpxenc encoding incorrect webm file header on big endian machines(Issue 331)Yunqing Wang
As reported in issue 331, vpxenc encoded incorrect webm file header on big endian machines. This change fixed that. Change-Id: I31924ebd476a87f3e88b9b5424540bf781d2b86f
2011-07-12Update armv6 loopfilter to new interfaceAttila Nagy
Change-Id: I5fe581d797571a7a9432fbd17fc557591d0c1afa
2011-07-12Update armv7 loopfilter to new interfaceAttila Nagy
Change-Id: I65105a9c63832669237e6a6a7fcb4ea3ea683346
2011-07-11Reduce motion vector search on alt-ref frame.Fritz Koenig
Clamp mv search to accomodate subpixel filtering of UV mv. Change-Id: Iab3ed405993ef6bf779ad7cf60863153068fb7d1
2011-07-08Minor change in pick_inter_mode()Yunqing Wang
Scott suggested to move vp8_mv_pred() under "case NEWMV" to save extra checks. Change-Id: I09e69892f34a08dd425a4d81cfcc83674e344a20
2011-07-08Merge "Adjust full-pixel clamping and motion vector limit calculation"Yunqing Wang
2011-07-08Adjust full-pixel clamping and motion vector limit calculationYunqing Wang
Do mvp clamping in full-pixel precision instead of 1/8-pixel precision to avoid error caused by right shifting operation. Also, further fixed the motion vector limit calculation in change: b7480454706a6b15bf091e659cd6227ab373c1a6 Change-Id: Ied88a4f7ddfb0476eb9f7afc6ceeddbf209fffd7
2011-07-08update x86 asm for loopfilterJohann
Change-Id: I1ed739522db7c00c189851c7095c1b64ef6412ce
2011-07-08Merge "clean up warnings when building arm with rtcd"Johann
2011-07-08New loop filter interfaceAttila Nagy
Separate simple filter with reduced no. of parameters. MB filter level picking based on precalculated table. Level table updated for each frame. Inside and edge limits precalculated and updated just when sharpness changes. HEV threshhold is constant. ARM targets use scalars and others vectors. Change works only with --target=generic-gnu All other targets have to be updated! Change-Id: I6b73aca6b525075b20129a371699b2561bd4d51c
2011-07-07Merge "Set VPX_FRAME_IS_DROPPABLE"John Koleszar
2011-07-07Set VPX_FRAME_IS_DROPPABLEJohn Koleszar
Allow the encoder to inform the application that the encoded frame will not be used as a reference. Change-Id: I90e41962325ef73d44da03327deb340d6f7f4860