Small optimizations in prep_neon function
Changed ldp to ld1, since they have 2x throughput in equal conditions according to arm docs Before Cortex A53 A73 21.028 fps 62.67 fps After 20.99 fps 63.25 fps
Loading
Please register or sign in to comment