Hi.
I'm trying to run my neural network with different runtimes on SD870, but I found that the performance for gpu is faster than AIP runtime (76709us vs 186290us). I have quantized my network and enable_hta.I think the benchmark result is not correct, but I don't know why(No errors were reported when I run benchmark).
What can i do with this?