In the context of SNPE and QNN, is there a difference between the HTP and the DSP when it comes to hexagon architectures that are v68+?
I ask this because for the UDO code prodvided by the SNPE SDK, we can see HTP header files grabbed in DSP_v68 implementation of the Conv2D:
#include "HTP/core/simple_reg.h"
Another Question:
When using snpe-dlc-quantize , I have noticed that when using the --enable_htp flag, it quantizes the model to 16-bit rather than 8 bit. Is this supposed to happen?