Age | Commit message (Collapse) | Author |
|
Runs about twice as fast as C
BUG=webm:1027
Change-Id: I6760d99f4e22259439ca35d746194b12a81bfa71
|
|
BUG=webm:1027
Change-Id: Ibe58781905b372b9fe29dace39b4bfdd33fd0f83
|
|
the difference between src and dst will be signed, the error will be
unsigned.
quiets -fsanitize=integer:
unsigned integer overflow: 4294967295 * 4294967295
Change-Id: I580813093ee46284fde7954520dfcb1188f79268
|
|
the difference between src and dst will be signed, the error will be
unsigned.
quiets -fsanitize=integer:
unsigned integer overflow: 4294967295 * 4294967295
Change-Id: I502fd707823c4faaa7f587c9cc0312f057e04904
|
|
* changes:
vpx_dsp,add_noise: remove mmx implementation
vpx_dsp: remove mmx variance implementations
|
|
|
|
a sse2 version exists, this is a reasonable modern baseline.
Change-Id: If31d36c8412d25b53f41b4a93cf02f46802c0c33
|
|
there are sse2 equivalents for all remaining variance implementations
Change-Id: I10b947e73fc0067688181f819b59e47966bec3d2
|
|
Replaced vpx_d45_predictor_4x4_ssse3(), vpx_d45_predictor_8x8_ssse3()
and vpx_d207_predictor_4x4_ssse3() with
created vpx_d45_predictor_4x4_sse2(), vpx_d45_predictor_8x8_sse2()
and vpx_d207_predictor_4x4_sse2() respectively.
It's mostly neutral or slightly worse than ssse3 in good cases and
better than ssse3 in the bad cases (but still worse than using the mmx
regs).
Change-Id: Ib0237ceb71d2c57b8a93fd3170330cfed9d56bdd
|
|
1. test/dct16x16_test.cc
2. test/dct32x32_test.cc
3. test/fdct8x8_test.cc
BUG=webm:1225
Change-Id: I9c9315fbd65ddb3b44f688e01ba265fd22192198
|
|
|
|
|
|
convert the random value to int16 before subtracting 256 from it; quiets
a ubsan (sanitize=integer) warning
BUG=webm:1225
Change-Id: Ibc2c5a21f30e112bd6c180f7d6a033327c38d0df
|
|
Function level timing test shows about 27% time saving on
a Xeon E5-2680 v2 desktop.
Rename vp9_dct_sse2.c to vp9_dct_intrin_sse2.c for vp9 and
rename dct_sse2.c to dct_intrin_sse2.c for vp10 to avoid
duplicate basenames.
Actually vp9_fwht4x4_mmx/sse2() and vp10_fwht4x4_mmx/sse2()
are identical. TODO: They should be unified later if there is
no intention to keep a duplicate.
Change-Id: I3e537b7bbd9ba417c606cd7c68c4dbbfa583f77d
|
|
|
|
Followed the code style of other lpf fuctions.
These 2 functions put 2 rows of data in a single xmm register,
so they have similar but not identical filter operations,
and cannot share the same macros.
Change-Id: I3bab55a5d1a1232926ac8fd1f03251acc38302bc
|
|
|
|
Existing tests are only for CBR mode.
Change-Id: Ie3b2cd46236457748e2650901d1a347a730f38af
|
|
Replace MMX with SSE2.
Change-Id: Id8482d2589131f9427e7f36bc64413f058caf31f
|
|
|
|
This reverts commit 2468163e0770108f5216b65445ce05a8241bca21.
causes valgrind errors for overread of buffer in SubpelVarianceTest
Change-Id: I448e52c76f815ac199305b71f7d169f2bc167679
|
|
This reverts commit 6dd7f2b50a65373aa906d678cb5a29fb65531a55.
conversion warnings, crashes in 32-bit builds
Change-Id: I529ead34cd93c862dd07c9a29d8542dda2fc20ea
|
|
|
|
|
|
Change-Id: I73e1b9ea6f4c76ae539e2b3292ee4c751d9c7de4
|
|
|
|
Replace MMX with SSE2.
Change-Id: Ia8fcba755952804e347d7d7736f57d1f90c988a0
|
|
Runs about 30% faster than the C
BUG=webm:1021
Change-Id: I6809d6d84c3077ab619c53298296950e976bdaba
|
|
Change-Id: I99f50bdd5af3f64a029c2f5f6f5fb1ff45bad67e
|
|
Add a cast.
BUG=webm:1225
Change-Id: I34ea18ee816569485c1f1046a81fd2a0ce527ac8
|
|
|
|
|
|
|
|
- Add frame limit argument.
- Make all arguments required.
- Enable the VP9 simple encoder test.
Change-Id: I11d228b358ff90c60ea92e02760cb476434ea571
|
|
- Remove twopass_encoder test TODO re frame limit.
- Enable VP9 twopass_encoder test.
Change-Id: I0649f15aabef79a63891e997fd20b212af5672e6
|
|
there are sse2 equivalents which is a reasonable modern baseline
Removed mmx variance functions:
vpx_get_mb_ss_mmx()
vpx_get8x8var_mmx()
vpx_get4x4var_mmx()
vpx_variance4x4_mmx()
vpx_variance8x8_mmx()
vpx_mse16x16_mmx()
vpx_variance16x16_mmx()
vpx_variance16x8_mmx()
vpx_variance8x16_mmx()
Change-Id: Iffaf85344c6676a3dd337c0645a2dd5deb2f86a1
|
|
there are sse2 equivalents which is a reasonable modern baseline
Change-Id: Ibbe536a5ad1c2cccef6bdcc75c13b3dde35a56ba
|
|
|
|
Add a cast.
BUG=https://bugs.chromium.org/p/webm/issues/detail?id=1216
Change-Id: I40627de387bc9cfba37860e7a0a4f2d4524f3431
|
|
|
|
This change makes the c match the assembly and removes the todo's
associated with getting this to work.
Change-Id: Ie32e9ebb584a9d60399662d8bcb71b74fbd19d1e
|
|
|
|
In so doing this fixes a couple of bugs:
vpx_plane_add_noise.c needed to subtract a clamp instead of add.
And the assembly (mmx sse) had assumptions that parameters were
continuous in memory which was not true.
Change-Id: I76f2c43cf54bfc838eb2edf8a443eaaa7565d7b5
|
|
this avoids the decoder test which was only correct for vp9, vp10 was
missed in the earlier change
Change-Id: Ib789c906d440c0e4169052cf64c74d5e4b196caa
|
|
Brings f03e238f to master.
Change-Id: I7f7754e7d1288b103a4510303d10afc68a7d8ca8
|
|
|
|
ADL will look this up from the callsite namespace iff it is declared
before the callsite or from the parent namespace of the class type (the
global namespace).
This patch has been tested on MSVS 2015 and clang-3.8.
Change-Id: I00ba74712c9b617b9d81761abed1e14d8f25d8e3
|
|
* changes:
vp9_ethread_test: avoid crash when decoder is unavailable
resize_test: skip VP9 decode checks if it's disabled
|
|
|
|
Remove unused variable buffer_size and rename bytes_in_buffer to
buffer_size in webm_read_frame.
Change-Id: I18b6ccbb044cd366610a5e39d83883cc0b174051
|