Dear QC engineer,
I'm having issues to load a Llama2 3B model with QNN onto S23 Snapdragon Gen 2 SM8550. We are trying to load it via a binary context, and got the following error message. Any pointer on how to proceed?
Thanks![+2]:[DEBUG] Calling QNN function: contextCreateFromBinary [+0]:[ERROR] <E> fastrpc memory map for fd: 45 with length: 450887680 failed with error: 1 [+0]:[ERROR] <E> Failed to map weights buffer to device! [+0]:[ERROR] <E> Could not allocate persistent weights buffer! [+0]:[ERROR] <E> Failed to initialize graph memory [+0]:[ERROR] <E> Failed to initialize graph with id 257 context 4 deviceId 0 coreId 0 pdId 0 with err 1002 [+0]:[ERROR] <E> Context create from binary failed for deviceId 0 coreId 0 pdId 0 err 1002
Cannot load a 3B model with QNN
Posted: Fri, 2023-09-29 04:06