Results for: Jay Yun
Matrix Multiply on Adreno GPUs – Part 2: Host Code and Kernel
This is the second and final part of a guest post by Vladislav Shimanskiy, one of our Adreno™ engineers. His previous post explained the concepts behind an optimized implementation of device-side...
https://developer.qualcomm.com/blog/matrix-multiply-adreno-gpus-part-2-host-code-and-kernelTags:
Matrix Multiply on Adreno GPUs – Part 1: OpenCL Optimization
The matrix multiply (MM) operation has become very popular on GPUs thanks to recent interest in deep learning, which depends on convolutions. We’ve been hearing from developers who want to accelerate...
https://developer.qualcomm.com/blog/matrix-multiply-adreno-gpus-part-1-opencl-optimizationTags:
Better OpenCL Performance on Qualcomm Adreno GPU – Memory Optimization
When you want to accelerate an image processing or computer vision application using a graphics processing unit (GPU) like Adreno™, OpenCL™ offers a convenient way to write your program since it’s...
https://developer.qualcomm.com/blog/better-opencl-performance-qualcomm-adreno-gpu-memory-optimization