arm64: Add NEON implementation of fgy_32x32xn
Relative speedup over C code: Cortex A53 A72 A73 Apple M1 fgy_32x32xn_8bpc_neon: 4.48 2.84 3.73 5.64
Showing
- src/arm/64/film_grain.S 299 additions, 0 deletionssrc/arm/64/film_grain.S
- src/arm/film_grain_init_tmpl.c 133 additions, 0 deletionssrc/arm/film_grain_init_tmpl.c
- src/film_grain.h 1 addition, 0 deletionssrc/film_grain.h
- src/film_grain_tmpl.c 5 additions, 1 deletionsrc/film_grain_tmpl.c
- src/meson.build 2 additions, 0 deletionssrc/meson.build
Loading
Please register or sign in to comment