- Mar 14, 2019
-
-
Henrik Gramner authored
-
- Mar 06, 2019
-
-
Anton Mitrofanov authored
This will break decoding with older versions of FFmpeg/Libav.
-
Anton Mitrofanov authored
-
-
Warn when the following are used without the appropriate cpuflag: * YMM and ZMM registers * 'pextrw' with a memory operand * GPR instruction set extensions
-
Allows for marking symbols as having limited global scope, similar to using 'hidden' symbol visibility on ELF.
-
-
-
-
-
Allows for automatic command line completion for both options and values. Options such as --input-csp and --input-fmt will dynamically retrieve supported values from libavformat when compiled with lavf support. Execute 'source tools/bash-autocomplete.sh' in bash to enable.
-
Progressive High, Constrained High, and Progressive High 10. Even in Main profile, constraint_set4_flag is now set to 1 if progressive, and constraint_set5_flag is set to 1 if no B-slices are present.
-
-
Around a ~1% speedup to the overall encoding for --slow.
-
No overall speedup, just tidier code.
-
Seems to give about a 1-2% overall speedup on --slow.
-
Around a ~2% speedup to the overall encoding for --slow.
-
Yet another use of xxpermdi, another 10% gain.
-
About 20% faster.
-
Now 4x4 is as slow as C and 4x8 is a 2% faster than before.
-
And use it on the other satd > 8. 5-10% faster depending on the size.
-
10% faster.
-
4x faster than C.
-
-
Most x86-64 operating systems nowadays doesn't even allow .text relocations in object files any more, and there is no measurable overall performance difference from using RIP-relative addressing in x264 asm. Enforcing PIC reduces complexity and simplifies testing.
-
- Mar 03, 2019
-
-
Henrik Gramner authored
-
Also check that CQP mode is not used with 2-pass.
-
Ensures that access is atomic and that other threads sees the actual value of the variable.
-
Bug report by Daniel Deptford.
-
-
-
Henrik Gramner authored
Also fix the string parsing in param_apply_tune() to correctly compare the entire string, not just the first N characters.
-
- Dec 23, 2018
-
-
Bug report by Dirk Fieldhouse.
-
--trellis 0 was missed for it during 8-bit and 10-bit unification. Bug report by Aleksey Vasenev.
-
Bug report by Koby Shina.
-
- Sep 25, 2018
-
-
Bug reported by Nicolas Gaullier
-
- Aug 22, 2018
-
-
Henrik Gramner authored
-
-
- Aug 06, 2018
-
-
1) using xxpermdi + merge instead of 2 merges improves quant_8x8 performance by 5% 2) use vec_splats instead of vec_splat checkasm timings when compiled with gcc: C: AltiVec: before: after: quant_2x2_dc: 57 163 46 quant_4x4_dc: 141 162 57 dequant_4x4_cmp: 104 101 45 dequant_4x4_flat: 104 106 46 dequant_8x8_cmp: 412 208 147 dequant_8x8_flat: 414 212 149
-
Increases overall encoding speed on POWER9 by 8%.
-