Skip to content
Snippets Groups Projects

WIP: Perform cdef on ~64x64 chunks.

Closed Kyle Siefring requested to merge KyleSiefring/dav1d:cdef_scratch_review into master

Merge request reports

Pipeline #6843 passed

Pipeline passed for 67375f8b on KyleSiefring:cdef_scratch_review

Approval is optional

Closed by Jean-Baptiste KempfJean-Baptiste Kempf 5 years ago (Apr 7, 2020 5:33pm UTC)

Merge details

  • The changes were not merged into 3.0.
  • Auto-merge enabled

Activity

Filter activity
  • Approvals
  • Assignees & reviewers
  • Comments (from bots)
  • Comments (from users)
  • Commits & branches
  • Edits
  • Labels
  • Lock status
  • Mentions
  • Merge request status
  • Tracking
52 db 1, 2, 1, 2, 0, -1, 0, 0
53 db 1, 2, 1, 2, 1, 2, 0, 0
54 db 1, 2, 0, 1, 1, 2, 0, 0
55 db 1, 2, 0, 0, 1, 2, 0, 0
56 db 0, 1, 0, -1, 1, 2, 0, 0
57 db 0, 0, 1, 2, 1, 2, 0, 0
58 db 0, -1, 1, 2, 0, 1, 0, 0
59 pw_128: times 2 dw 128
60 pw_2048: times 2 dw 2048
58 61
59 62 SECTION .text
60 63
64 %macro cdef_setup_fn 1 ; w
65 INIT_YMM avx2
66 ; TODO: correct number of registers used
67 cglobal cdef_setup_%1xh, 4, 15, 16, -4, \
  • Kyle Siefring added 1 commit

    added 1 commit

    • d2c99ea0 - Perform cdef on ~64x64 chunks

    Compare with previous version

  • Kyle Siefring added 1 commit

    added 1 commit

    • 275f253a - Perform cdef on ~64x64 chunks

    Compare with previous version

  • Kyle Siefring added 5 commits

    added 5 commits

    Compare with previous version

  • Mixed results:

    decoder dav1d dav1d
    Build 8bbcd3f7 69cc839a
    Build date 2019-04-08 2019-04-08
    ISA AVX2 AVX2
    Morocco MT 172,41 171,30 99,4%
    Morocco MT 172,13 170,93 99,3%
    Dua Lipa MT 142,86 143,50 100,5%
    Dua Lipa MT 142,71 143,14 100,3%
    Chimera MT 192,76 194,05 100,7%
    Chimera MT 192,17 193,87 100,9%
    Edited by Ewout ter Hoeven
  • Kyle Siefring added 1 commit

    added 1 commit

    • 807d4208 - Perform cdef on ~64x64 chunks

    Compare with previous version

  • Kyle Siefring added 1 commit

    added 1 commit

    • 3a05ac5e - Perform cdef on ~64x64 chunks

    Compare with previous version

  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Please register or sign in to reply
    Loading