Forums - QML run faster than arm-neon-vfp?

1 post / 0 new
QML run faster than arm-neon-vfp?
Join Date: 19 Jun 17
Posts: 1
Posted: Wed, 2019-02-27 20:42

Hi, QML Team,

We work to optimize LSTM algorithm on Android devices.

We have some parallel operators with arm-neon-vfp intrinsics like 'vmull_s8' and 'vaddq_f32' ...

Now, we want to refactor with QML. The premise is  the operators with QML have better performance than arm-neon.

So could you introduce more details about QML parallel implementations

if QML run faster than common SIMD instructions on SnapDragon platform.


  • Up0
  • Down0

Opinions expressed in the content posted here are the personal opinions of the original authors, and do not necessarily reflect those of Qualcomm Incorporated or its subsidiaries (“Qualcomm”). The content is provided for informational purposes only and is not meant to be an endorsement or representation by Qualcomm or any other party. This site may also provide links or references to non-Qualcomm sites and resources. Qualcomm makes no representations, warranties, or other commitments whatsoever about any non-Qualcomm sites or third-party resources that may be referenced, accessible from, or linked to this site.