Hi,
I am running MobilenetSSD object detection model on GPU on Snapdragin 820 platform. My application is based on "examples/NativeCpp/SampleCode/" sample application provided with SDK. I have converted coco model into a DLC file and running network on GPU. I have also enabled the CPU fallback option as suggested in docs provide with SDK.
Network execution takes around 90ms per frame to execute on GPU which is little high for the usecase I want to achieve. Is there any way to improve the performance on GPU for MobilenetSSD model ? Can I have multiple instance of SNPE, have multiple threads running snpe execution simultaneously ? Does it help improve the performance ?
Any help, pointers would be appreciated.
Thanks,
Shabbir