Skip to content
Snippets Groups Projects
  1. Nov 15, 2021
  2. Nov 13, 2021
    • Matthias Dressel's avatar
      x86/itx: Add 8x8 12bpc AVX2 transforms · 31820a5e
      Matthias Dressel authored
      inv_txfm_add_8x8_adst_adst_0_12bpc_c: 1997.9
      inv_txfm_add_8x8_adst_adst_0_12bpc_avx2: 185.7
      inv_txfm_add_8x8_adst_adst_1_12bpc_c: 2009.8
      inv_txfm_add_8x8_adst_adst_1_12bpc_avx2: 185.7
      inv_txfm_add_8x8_adst_dct_0_12bpc_c: 1991.0
      inv_txfm_add_8x8_adst_dct_0_12bpc_avx2: 161.3
      inv_txfm_add_8x8_adst_dct_1_12bpc_c: 1977.0
      inv_txfm_add_8x8_adst_dct_1_12bpc_avx2: 161.4
      inv_txfm_add_8x8_adst_flipadst_0_12bpc_c: 2017.6
      inv_txfm_add_8x8_adst_flipadst_0_12bpc_avx2: 184.2
      inv_txfm_add_8x8_adst_flipadst_1_12bpc_c: 2018.9
      inv_txfm_add_8x8_adst_flipadst_1_12bpc_avx2: 184.2
      inv_txfm_add_8x8_adst_identity_0_12bpc_c: 1407.2
      inv_txfm_add_8x8_adst_identity_0_12bpc_avx2: 95.7
      inv_txfm_add_8x8_adst_identity_1_12bpc_c: 1405.9
      inv_txfm_add_8x8_adst_identity_1_12bpc_avx2: 95.8
      inv_txfm_add_8x8_dct_adst_0_12bpc_c: 2024.2
      inv_txfm_add_8x8_dct_adst_0_12bpc_avx2: 156.9
      inv_txfm_add_8x8_dct_adst_1_12bpc_c: 2018.8
      inv_txfm_add_8x8_dct_adst_1_12bpc_avx2: 160.1
      inv_txfm_add_8x8_dct_dct_0_12bpc_c: 213.0
      inv_txfm_add_8x8_dct_dct_0_12bpc_avx2: 24.8
      inv_txfm_add_8x8_dct_dct_1_12bpc_c: 2008.6
      inv_txfm_add_8x8_dct_dct_1_12bpc_avx2: 139.0
      inv_txfm_add_8x8_dct_flipadst_0_12bpc_c: 2012.3
      inv_txfm_add_8x8_dct_flipadst_0_12bpc_avx2: 159.2
      inv_txfm_add_8x8_dct_flipadst_1_12bpc_c: 2005.1
      inv_txfm_add_8x8_dct_flipadst_1_12bpc_avx2: 158.7
      inv_txfm_add_8x8_dct_identity_0_12bpc_c: 1470.4
      inv_txfm_add_8x8_dct_identity_0_12bpc_avx2: 71.7
      inv_txfm_add_8x8_dct_identity_1_12bpc_c: 1477.8
      inv_txfm_add_8x8_dct_identity_1_12bpc_avx2: 70.7
      inv_txfm_add_8x8_flipadst_adst_0_12bpc_c: 2006.1
      inv_txfm_add_8x8_flipadst_adst_0_12bpc_avx2: 183.6
      inv_txfm_add_8x8_flipadst_adst_1_12bpc_c: 1987.6
      inv_txfm_add_8x8_flipadst_adst_1_12bpc_avx2: 183.6
      inv_txfm_add_8x8_flipadst_dct_0_12bpc_c: 1986.6
      inv_txfm_add_8x8_flipadst_dct_0_12bpc_avx2: 163.0
      inv_txfm_add_8x8_flipadst_dct_1_12bpc_c: 1979.3
      inv_txfm_add_8x8_flipadst_dct_1_12bpc_avx2: 163.1
      inv_txfm_add_8x8_flipadst_flipadst_0_12bpc_c: 2004.0
      inv_txfm_add_8x8_flipadst_flipadst_0_12bpc_avx2: 184.3
      inv_txfm_add_8x8_flipadst_flipadst_1_12bpc_c: 2003.9
      inv_txfm_add_8x8_flipadst_flipadst_1_12bpc_avx2: 184.3
      inv_txfm_add_8x8_flipadst_identity_0_12bpc_c: 1433.5
      inv_txfm_add_8x8_flipadst_identity_0_12bpc_avx2: 95.3
      inv_txfm_add_8x8_flipadst_identity_1_12bpc_c: 1425.4
      inv_txfm_add_8x8_flipadst_identity_1_12bpc_avx2: 96.3
      inv_txfm_add_8x8_identity_adst_0_12bpc_c: 1456.5
      inv_txfm_add_8x8_identity_adst_0_12bpc_avx2: 115.8
      inv_txfm_add_8x8_identity_adst_1_12bpc_c: 1453.5
      inv_txfm_add_8x8_identity_adst_1_12bpc_avx2: 115.8
      inv_txfm_add_8x8_identity_dct_0_12bpc_c: 1450.0
      inv_txfm_add_8x8_identity_dct_0_12bpc_avx2: 93.5
      inv_txfm_add_8x8_identity_dct_1_12bpc_c: 1447.5
      inv_txfm_add_8x8_identity_dct_1_12bpc_avx2: 94.3
      inv_txfm_add_8x8_identity_flipadst_0_12bpc_c: 1451.7
      inv_txfm_add_8x8_identity_flipadst_0_12bpc_avx2: 114.0
      inv_txfm_add_8x8_identity_flipadst_1_12bpc_c: 1456.4
      inv_txfm_add_8x8_identity_flipadst_1_12bpc_avx2: 114.0
      inv_txfm_add_8x8_identity_identity_0_12bpc_c: 892.3
      inv_txfm_add_8x8_identity_identity_0_12bpc_avx2: 33.7
      inv_txfm_add_8x8_identity_identity_1_12bpc_c: 897.2
      inv_txfm_add_8x8_identity_identity_1_12bpc_avx2: 33.1
      31820a5e
    • Matthias Dressel's avatar
      x86/itx: Add 8x4 12bpc AVX2 transforms · 53cf6a3b
      Matthias Dressel authored
      inv_txfm_add_8x4_adst_adst_0_12bpc_c: 882.1
      inv_txfm_add_8x4_adst_adst_0_12bpc_avx2: 113.7
      inv_txfm_add_8x4_adst_adst_1_12bpc_c: 882.5
      inv_txfm_add_8x4_adst_adst_1_12bpc_avx2: 113.8
      inv_txfm_add_8x4_adst_dct_0_12bpc_c: 928.0
      inv_txfm_add_8x4_adst_dct_0_12bpc_avx2: 109.2
      inv_txfm_add_8x4_adst_dct_1_12bpc_c: 924.9
      inv_txfm_add_8x4_adst_dct_1_12bpc_avx2: 109.2
      inv_txfm_add_8x4_adst_flipadst_0_12bpc_c: 889.9
      inv_txfm_add_8x4_adst_flipadst_0_12bpc_avx2: 114.3
      inv_txfm_add_8x4_adst_flipadst_1_12bpc_c: 886.0
      inv_txfm_add_8x4_adst_flipadst_1_12bpc_avx2: 114.8
      inv_txfm_add_8x4_adst_identity_0_12bpc_c: 832.2
      inv_txfm_add_8x4_adst_identity_0_12bpc_avx2: 88.8
      inv_txfm_add_8x4_adst_identity_1_12bpc_c: 834.6
      inv_txfm_add_8x4_adst_identity_1_12bpc_avx2: 89.0
      inv_txfm_add_8x4_dct_adst_0_12bpc_c: 870.3
      inv_txfm_add_8x4_dct_adst_0_12bpc_avx2: 96.3
      inv_txfm_add_8x4_dct_adst_1_12bpc_c: 884.6
      inv_txfm_add_8x4_dct_adst_1_12bpc_avx2: 96.3
      inv_txfm_add_8x4_dct_dct_0_12bpc_c: 116.1
      inv_txfm_add_8x4_dct_dct_0_12bpc_avx2: 24.5
      inv_txfm_add_8x4_dct_dct_1_12bpc_c: 925.1
      inv_txfm_add_8x4_dct_dct_1_12bpc_avx2: 92.3
      inv_txfm_add_8x4_dct_flipadst_0_12bpc_c: 882.7
      inv_txfm_add_8x4_dct_flipadst_0_12bpc_avx2: 97.0
      inv_txfm_add_8x4_dct_flipadst_1_12bpc_c: 882.1
      inv_txfm_add_8x4_dct_flipadst_1_12bpc_avx2: 97.0
      inv_txfm_add_8x4_dct_identity_0_12bpc_c: 827.5
      inv_txfm_add_8x4_dct_identity_0_12bpc_avx2: 72.4
      inv_txfm_add_8x4_dct_identity_1_12bpc_c: 827.8
      inv_txfm_add_8x4_dct_identity_1_12bpc_avx2: 73.8
      inv_txfm_add_8x4_flipadst_adst_0_12bpc_c: 899.5
      inv_txfm_add_8x4_flipadst_adst_0_12bpc_avx2: 113.2
      inv_txfm_add_8x4_flipadst_adst_1_12bpc_c: 898.8
      inv_txfm_add_8x4_flipadst_adst_1_12bpc_avx2: 113.3
      inv_txfm_add_8x4_flipadst_dct_0_12bpc_c: 945.7
      inv_txfm_add_8x4_flipadst_dct_0_12bpc_avx2: 108.3
      inv_txfm_add_8x4_flipadst_dct_1_12bpc_c: 945.6
      inv_txfm_add_8x4_flipadst_dct_1_12bpc_avx2: 108.3
      inv_txfm_add_8x4_flipadst_flipadst_0_12bpc_c: 903.6
      inv_txfm_add_8x4_flipadst_flipadst_0_12bpc_avx2: 113.9
      inv_txfm_add_8x4_flipadst_flipadst_1_12bpc_c: 902.8
      inv_txfm_add_8x4_flipadst_flipadst_1_12bpc_avx2: 114.2
      inv_txfm_add_8x4_flipadst_identity_0_12bpc_c: 856.6
      inv_txfm_add_8x4_flipadst_identity_0_12bpc_avx2: 88.3
      inv_txfm_add_8x4_flipadst_identity_1_12bpc_c: 848.8
      inv_txfm_add_8x4_flipadst_identity_1_12bpc_avx2: 87.4
      inv_txfm_add_8x4_identity_adst_0_12bpc_c: 583.2
      inv_txfm_add_8x4_identity_adst_0_12bpc_avx2: 69.6
      inv_txfm_add_8x4_identity_adst_1_12bpc_c: 584.3
      inv_txfm_add_8x4_identity_adst_1_12bpc_avx2: 69.6
      inv_txfm_add_8x4_identity_dct_0_12bpc_c: 632.9
      inv_txfm_add_8x4_identity_dct_0_12bpc_avx2: 65.3
      inv_txfm_add_8x4_identity_dct_1_12bpc_c: 629.6
      inv_txfm_add_8x4_identity_dct_1_12bpc_avx2: 65.8
      inv_txfm_add_8x4_identity_flipadst_0_12bpc_c: 587.0
      inv_txfm_add_8x4_identity_flipadst_0_12bpc_avx2: 71.0
      inv_txfm_add_8x4_identity_flipadst_1_12bpc_c: 586.9
      inv_txfm_add_8x4_identity_flipadst_1_12bpc_avx2: 71.0
      inv_txfm_add_8x4_identity_identity_0_12bpc_c: 533.0
      inv_txfm_add_8x4_identity_identity_0_12bpc_avx2: 45.3
      inv_txfm_add_8x4_identity_identity_1_12bpc_c: 539.7
      inv_txfm_add_8x4_identity_identity_1_12bpc_avx2: 45.9
      53cf6a3b
    • Matthias Dressel's avatar
      x86/itx: Add 4x8 12bpc AVX2 transforms · 241753f5
      Matthias Dressel authored
      inv_txfm_add_4x8_adst_adst_0_12bpc_c: 900.8
      inv_txfm_add_4x8_adst_adst_0_12bpc_avx2: 118.8
      inv_txfm_add_4x8_adst_adst_1_12bpc_c: 893.7
      inv_txfm_add_4x8_adst_adst_1_12bpc_avx2: 118.8
      inv_txfm_add_4x8_adst_dct_0_12bpc_c: 890.2
      inv_txfm_add_4x8_adst_dct_0_12bpc_avx2: 104.8
      inv_txfm_add_4x8_adst_dct_1_12bpc_c: 887.4
      inv_txfm_add_4x8_adst_dct_1_12bpc_avx2: 104.8
      inv_txfm_add_4x8_adst_flipadst_0_12bpc_c: 919.6
      inv_txfm_add_4x8_adst_flipadst_0_12bpc_avx2: 116.6
      inv_txfm_add_4x8_adst_flipadst_1_12bpc_c: 912.1
      inv_txfm_add_4x8_adst_flipadst_1_12bpc_avx2: 116.6
      inv_txfm_add_4x8_adst_identity_0_12bpc_c: 613.5
      inv_txfm_add_4x8_adst_identity_0_12bpc_avx2: 42.8
      inv_txfm_add_4x8_adst_identity_1_12bpc_c: 608.7
      inv_txfm_add_4x8_adst_identity_1_12bpc_avx2: 43.3
      inv_txfm_add_4x8_dct_adst_0_12bpc_c: 951.7
      inv_txfm_add_4x8_dct_adst_0_12bpc_avx2: 113.8
      inv_txfm_add_4x8_dct_adst_1_12bpc_c: 949.0
      inv_txfm_add_4x8_dct_adst_1_12bpc_avx2: 113.1
      inv_txfm_add_4x8_dct_dct_0_12bpc_c: 118.6
      inv_txfm_add_4x8_dct_dct_0_12bpc_avx2: 24.5
      inv_txfm_add_4x8_dct_dct_1_12bpc_c: 942.4
      inv_txfm_add_4x8_dct_dct_1_12bpc_avx2: 99.2
      inv_txfm_add_4x8_dct_flipadst_0_12bpc_c: 959.3
      inv_txfm_add_4x8_dct_flipadst_0_12bpc_avx2: 113.9
      inv_txfm_add_4x8_dct_flipadst_1_12bpc_c: 964.1
      inv_txfm_add_4x8_dct_flipadst_1_12bpc_avx2: 114.3
      inv_txfm_add_4x8_dct_identity_0_12bpc_c: 659.9
      inv_txfm_add_4x8_dct_identity_0_12bpc_avx2: 41.9
      inv_txfm_add_4x8_dct_identity_1_12bpc_c: 658.6
      inv_txfm_add_4x8_dct_identity_1_12bpc_avx2: 41.6
      inv_txfm_add_4x8_flipadst_adst_0_12bpc_c: 906.6
      inv_txfm_add_4x8_flipadst_adst_0_12bpc_avx2: 117.3
      inv_txfm_add_4x8_flipadst_adst_1_12bpc_c: 907.7
      inv_txfm_add_4x8_flipadst_adst_1_12bpc_avx2: 117.3
      inv_txfm_add_4x8_flipadst_dct_0_12bpc_c: 890.3
      inv_txfm_add_4x8_flipadst_dct_0_12bpc_avx2: 104.6
      inv_txfm_add_4x8_flipadst_dct_1_12bpc_c: 895.6
      inv_txfm_add_4x8_flipadst_dct_1_12bpc_avx2: 104.6
      inv_txfm_add_4x8_flipadst_flipadst_0_12bpc_c: 902.9
      inv_txfm_add_4x8_flipadst_flipadst_0_12bpc_avx2: 116.5
      inv_txfm_add_4x8_flipadst_flipadst_1_12bpc_c: 915.0
      inv_txfm_add_4x8_flipadst_flipadst_1_12bpc_avx2: 116.4
      inv_txfm_add_4x8_flipadst_identity_0_12bpc_c: 618.6
      inv_txfm_add_4x8_flipadst_identity_0_12bpc_avx2: 45.3
      inv_txfm_add_4x8_flipadst_identity_1_12bpc_c: 618.1
      inv_txfm_add_4x8_flipadst_identity_1_12bpc_avx2: 44.0
      inv_txfm_add_4x8_identity_adst_0_12bpc_c: 829.7
      inv_txfm_add_4x8_identity_adst_0_12bpc_avx2: 107.4
      inv_txfm_add_4x8_identity_adst_1_12bpc_c: 831.7
      inv_txfm_add_4x8_identity_adst_1_12bpc_avx2: 107.8
      inv_txfm_add_4x8_identity_dct_0_12bpc_c: 823.2
      inv_txfm_add_4x8_identity_dct_0_12bpc_avx2: 90.7
      inv_txfm_add_4x8_identity_dct_1_12bpc_c: 824.1
      inv_txfm_add_4x8_identity_dct_1_12bpc_avx2: 90.7
      inv_txfm_add_4x8_identity_flipadst_0_12bpc_c: 853.4
      inv_txfm_add_4x8_identity_flipadst_0_12bpc_avx2: 106.8
      inv_txfm_add_4x8_identity_flipadst_1_12bpc_c: 852.2
      inv_txfm_add_4x8_identity_flipadst_1_12bpc_avx2: 106.8
      inv_txfm_add_4x8_identity_identity_0_12bpc_c: 543.2
      inv_txfm_add_4x8_identity_identity_0_12bpc_avx2: 36.4
      inv_txfm_add_4x8_identity_identity_1_12bpc_c: 544.8
      inv_txfm_add_4x8_identity_identity_1_12bpc_avx2: 36.6
      241753f5
  3. Nov 12, 2021
  4. Nov 11, 2021
  5. Nov 10, 2021
  6. Nov 05, 2021
  7. Nov 02, 2021
  8. Nov 01, 2021
  9. Oct 31, 2021
  10. Oct 29, 2021
  11. Oct 28, 2021
    • Martin Storsjö's avatar
      meson: Check for the pthread_getaffinity_np function before deciding to use it · 8c94f95c
      Martin Storsjö authored
      Use the check result instead of hardcoding what OSes have got the
      function.
      
      This also requires checking for the pthread_np.h header and including
      it while testing for functions in meson, but allows getting rid of the
      hardcoded OS conditions in the source.
      
      This fixes building for Android, if _GNU_SOURCE happens to be defined.
      (It gets defined if building with a slightly nonstandard cross file
      that defines "system = 'linux'", but it could also have been set by the
      caller.)
      8c94f95c
  12. Oct 27, 2021
  13. Oct 18, 2021
  14. Oct 06, 2021
Loading