Skip to content
Snippets Groups Projects
  1. Apr 08, 2023
    • Ronald S. Bultje's avatar
      x86: add AVX512-IceLake implementation of HBD 16x64 DCT^2 · 5aa3b38f
      Ronald S. Bultje authored
      nop:                                              39.4
      inv_txfm_add_16x64_dct_dct_0_10bpc_c:           2208.0 ( 1.00x)
      inv_txfm_add_16x64_dct_dct_0_10bpc_sse4:         133.5 (16.54x)
      inv_txfm_add_16x64_dct_dct_0_10bpc_avx2:          71.3 (30.98x)
      inv_txfm_add_16x64_dct_dct_0_10bpc_avx512icl:    102.0 (21.66x)
      inv_txfm_add_16x64_dct_dct_1_10bpc_c:          25757.0 ( 1.00x)
      inv_txfm_add_16x64_dct_dct_1_10bpc_sse4:        1366.1 (18.85x)
      inv_txfm_add_16x64_dct_dct_1_10bpc_avx2:         657.6 (39.17x)
      inv_txfm_add_16x64_dct_dct_1_10bpc_avx512icl:    378.9 (67.98x)
      inv_txfm_add_16x64_dct_dct_2_10bpc_c:          25771.0 ( 1.00x)
      inv_txfm_add_16x64_dct_dct_2_10bpc_sse4:        1739.7 (14.81x)
      inv_txfm_add_16x64_dct_dct_2_10bpc_avx2:         772.1 (33.38x)
      inv_txfm_add_16x64_dct_dct_2_10bpc_avx512icl:    469.3 (54.92x)
      inv_txfm_add_16x64_dct_dct_3_10bpc_c:          25775.7 ( 1.00x)
      inv_txfm_add_16x64_dct_dct_3_10bpc_sse4:        1968.1 (13.10x)
      inv_txfm_add_16x64_dct_dct_3_10bpc_avx2:         886.5 (29.08x)
      inv_txfm_add_16x64_dct_dct_3_10bpc_avx512icl:    662.6 (38.90x)
      inv_txfm_add_16x64_dct_dct_4_10bpc_c:          25745.9 ( 1.00x)
      inv_txfm_add_16x64_dct_dct_4_10bpc_sse4:        2330.9 (11.05x)
      inv_txfm_add_16x64_dct_dct_4_10bpc_avx2:        1008.5 (25.53x)
      inv_txfm_add_16x64_dct_dct_4_10bpc_avx512icl:    662.3 (38.88x)
      5aa3b38f
  2. Apr 06, 2023
  3. Mar 31, 2023
    • Matthias Dressel's avatar
      0207e0fe
    • Matthias Dressel's avatar
      x86/itx: Add 32x32 12bpc AVX2 idtx · f6d4c0c4
      Matthias Dressel authored
      inv_txfm_add_32x32_identity_identity_0_12bpc_c:      5785.8 ( 1.00x)
      inv_txfm_add_32x32_identity_identity_0_12bpc_avx2:     20.7 (279.65x)
      inv_txfm_add_32x32_identity_identity_1_12bpc_c:      5896.9 ( 1.00x)
      inv_txfm_add_32x32_identity_identity_1_12bpc_avx2:     20.7 (285.01x)
      inv_txfm_add_32x32_identity_identity_2_12bpc_c:      5799.5 ( 1.00x)
      inv_txfm_add_32x32_identity_identity_2_12bpc_avx2:     68.9 (84.20x)
      inv_txfm_add_32x32_identity_identity_3_12bpc_c:      5798.1 ( 1.00x)
      inv_txfm_add_32x32_identity_identity_3_12bpc_avx2:    140.6 (41.25x)
      inv_txfm_add_32x32_identity_identity_4_12bpc_c:      5803.3 ( 1.00x)
      inv_txfm_add_32x32_identity_identity_4_12bpc_avx2:    308.2 (18.83x)
      f6d4c0c4
    • Matthias Dressel's avatar
      x86/itx: Add 32x16 12bpc AVX2 idtx · 1e602b8b
      Matthias Dressel authored
      inv_txfm_add_32x16_identity_identity_0_12bpc_c:      4138.7 ( 1.00x)
      inv_txfm_add_32x16_identity_identity_0_12bpc_avx2:     30.4 (136.26x)
      inv_txfm_add_32x16_identity_identity_1_12bpc_c:      4147.5 ( 1.00x)
      inv_txfm_add_32x16_identity_identity_1_12bpc_avx2:     30.7 (135.25x)
      inv_txfm_add_32x16_identity_identity_2_12bpc_c:      4138.2 ( 1.00x)
      inv_txfm_add_32x16_identity_identity_2_12bpc_avx2:     98.9 (41.84x)
      inv_txfm_add_32x16_identity_identity_3_12bpc_c:      4136.6 ( 1.00x)
      inv_txfm_add_32x16_identity_identity_3_12bpc_avx2:    167.7 (24.67x)
      inv_txfm_add_32x16_identity_identity_4_12bpc_c:      4156.3 ( 1.00x)
      inv_txfm_add_32x16_identity_identity_4_12bpc_avx2:    242.1 (17.17x)
      1e602b8b
    • Matthias Dressel's avatar
      x86/itx: Add 16x32 12bpc AVX2 idtx · e6b194e7
      Matthias Dressel authored
      inv_txfm_add_16x32_identity_identity_0_12bpc_c:      4287.9 ( 1.00x)
      inv_txfm_add_16x32_identity_identity_0_12bpc_avx2:     31.4 (136.66x)
      inv_txfm_add_16x32_identity_identity_1_12bpc_c:      4293.7 ( 1.00x)
      inv_txfm_add_16x32_identity_identity_1_12bpc_avx2:     30.9 (139.07x)
      inv_txfm_add_16x32_identity_identity_2_12bpc_c:      4273.8 ( 1.00x)
      inv_txfm_add_16x32_identity_identity_2_12bpc_avx2:     97.3 (43.92x)
      inv_txfm_add_16x32_identity_identity_3_12bpc_c:      4269.0 ( 1.00x)
      inv_txfm_add_16x32_identity_identity_3_12bpc_avx2:    165.2 (25.83x)
      inv_txfm_add_16x32_identity_identity_4_12bpc_c:      4284.4 ( 1.00x)
      inv_txfm_add_16x32_identity_identity_4_12bpc_avx2:    235.2 (18.22x)
      e6b194e7
  4. Mar 25, 2023
  5. Mar 23, 2023
  6. Mar 21, 2023
  7. Mar 16, 2023
  8. Mar 13, 2023
  9. Mar 07, 2023
  10. Mar 06, 2023
  11. Mar 03, 2023
  12. Mar 01, 2023
  13. Feb 28, 2023
  14. Feb 27, 2023
  15. Feb 26, 2023
    • Martin Storsjö's avatar
      Fix building with MSVC after recent commit · ef0fb0b6
      Martin Storsjö authored
      98b0c96d added an include of
      src/ref.h in src/fg_apply_tmpl.c. That template source file is
      included in tests/checkasm/filmgrain.c.
      
      src/ref.h includes <stdatomic.h>. Including this file requires
      declaring a dependency on stdatomic_dependencies in meson, which
      provides the fallback implementation of stdatomic.h when building
      with MSVC.
      ef0fb0b6
  16. Feb 25, 2023
  17. Feb 23, 2023
  18. Feb 14, 2023
Loading