Skip to content
Snippets Groups Projects
  1. Apr 02, 2024
    • Martin Storsjö's avatar
      checkasm: Add support for the private macOS kperf API for benchmarking · 5e31720b
      Martin Storsjö authored
      
      On AArch64, the performance counter registers usually are
      restricted and not accessible from user space.
      
      On macOS, we currently use mach_absolute_time() as timer on
      aarch64. This measures wallclock time but with a very coarse
      resolution.
      
      There is a private API, kperf, that one can use for getting
      high precision timers though. Unfortunately, it requires running
      the checkasm binary as root (e.g. with sudo).
      
      Also, as it is a private, undocumented API, it can potentially
      change at any time.
      
      This is handled by adding a new meson build option, for switching
      to this timer. If the timer source in checkasm could be changed
      at runtime with an option, this wouldn't need to be a build time
      option.
      
      This allows getting benchmarks like this:
      
      mc_8tap_regular_w16_hv_8bpc_c:              1522.1 ( 1.00x)
      mc_8tap_regular_w16_hv_8bpc_neon:            331.8 ( 4.59x)
      
      Instead of this:
      
      mc_8tap_regular_w16_hv_8bpc_c:                 9.0 ( 1.00x)
      mc_8tap_regular_w16_hv_8bpc_neon:              1.9 ( 4.76x)
      
      Co-authored-by: default avatarJ. Dekker <jdek@itanimul.li>
      5e31720b
  2. Jan 24, 2024
  3. Jan 23, 2024
  4. Jul 06, 2022
    • Henrik Gramner's avatar
      Eliminate unused C DSP functions at compile time · bd046635
      Henrik Gramner authored and Henrik Gramner's avatar Henrik Gramner committed
      When compiling with asm enabled there's no point in compiling
      C versions of DSP functions that have asm implementations using
      instruction sets that the compiler can unconditionally use.
      
      E.g. when compiling with -mssse3 we can remove the C version
      of all functions with SSSE3 implementations.
      
      This is accomplished using the compiler's dead code elimination
      functionality.
      
      Can be configured using the new 'trim_dsp' meson option, which
      by default is enabled when compiling in release mode.
      bd046635
  5. Dec 29, 2021
  6. Oct 18, 2021
  7. Feb 08, 2021
    • Janne Grunau's avatar
      tools: add optional xxh3 based muxer · e6168525
      Janne Grunau authored and Jean-Baptiste Kempf's avatar Jean-Baptiste Kempf committed
      The required 'xxhash.h' header can either be in system include directory
      or can be copied to 'tools/output'.
      
      The xxh3_128bits based muxer shows no significant slowdown compared to
      the null muxer. Decoding times Chimera-AV1-8bit-1920x1080-6736kbps.ivf
      with 4 frame and 4 tile threads on a core i7-8550U (disabled turbo boost):
      
      null:  72.5 s
      md5:   99.8 s
      xxh3:  73.8 s
      
      Decoding Chimera-AV1-10bit-1920x1080-6191kbps.ivf with 6 frame and 4 tile
      threads on a m1 mc mini:
      
      null:  27.8 s
      md5:  105.9 s
      xxh3:  28.3 s
      e6168525
  8. Mar 27, 2020
  9. Aug 10, 2019
  10. Jul 02, 2019
  11. May 24, 2019
  12. Feb 19, 2019
  13. Jan 28, 2019
  14. Dec 05, 2018
  15. Oct 30, 2018
  16. Oct 23, 2018
    • Janne Grunau's avatar
      fuzzer: add a standalone fuzzing engine 'none' · f8e918a9
      Janne Grunau authored and Ronald S. Bultje's avatar Ronald S. Bultje committed
      Replaces the boolean 'build_libfuzzer' meson option with 'fuzzing_engine'.
      This allows reproducing fuzzing test cases on systems without libfuzzer.
      Also prevents regressions in the fuzzing test target since it will be
      build by default.
      f8e918a9
  17. Oct 03, 2018
    • Janne Grunau's avatar
      tests: add libfuzzer test target · 858689e1
      Janne Grunau authored
      Disabled by default, enabble with `meson -Dbuild_libfuzzer=true -Db_lundef=false ...`.
      Fuzz target improved by the paralell work by Thierry Foucu in !138.
      858689e1
  18. Sep 29, 2018
  19. Sep 27, 2018
  20. Sep 22, 2018
    • Ronald S. Bultje's avatar
      Initial decoder implementation. · e2892ffa
      Ronald S. Bultje authored
      With minor contributions from:
       - Jean-Baptiste Kempf <jb@videolan.org>
       - Marvin Scholz <epirat07@gmail.com>
       - Hugo Beauzée-Luyssen <hugo@videolan.org>
      e2892ffa
Loading