aarch64: mc16: Optimize the BTI landing pads in put/prep_neon
Don't include the BTI landing pad instruction in the loops.
If built with BTI enabled, AARCH64_VALID_JUMP_TARGET expands to a no-op instruction that indicates that indirect jumps can land there. But there's no need for the loops to include that instruction.