I am trying to translate C code to inline assembly to check whether I get a (timing) performance improvement.
However, I am having problems to use my variables from C code into inluine assembly code.
For example, I have:
u8x8_out = vqmovn_u16(u16x8_tmp);
and the following assembly code:
And the error message I get is:
Error: Neon quad precision register expected -- `vqmovn.u16 d22,d22'
I have seen other examples but they always seem to show "r" inputs/outputs. Is it that I always need load data via registers? Or though an intermediate quadword/doubleword?
Thanks in advance for the help.