OpenCL Basics: Running multiple kernels in OpenCL

Reading Time: 1 minute

This series “Basic concepts” is based on GPGPU-questions we get via email more than once, or when the question is not clearly explained in the books. For one it is obvious, for the other just what they’re missing.

They say that learning a new technique is best done by playing around with working code and then try to combine it. The idea is that when you have Stackoverflowed and Githubed code together, you’ve created so many bugs by design that you’ll learn a lot if you make it work. When applying this to OpenCL, you quickly get to a situation that you want to run one.cl file and then another.cl file. Almost all beginner’s material discuss a single OpenCL-file, so how to do this elegantly?

The answer is in clCreateProgramWithSource, which takes one or more kernels. If multiple cl-files are compiled (or one big concatenated file), one Program-object is created. But how to use this?

We first need to talk about public and private as we know from C++, as this is in OpenCL cl files too. From the perspective of the host the ‘__kernel void‘ functions are public and the normal functions are private. After a Kernel-object for each kernel-function is created, all kernel-functions are callable. Now you can get an output C by running kernel1(A, B) and then kernel2(B, C).

 

Related Posts

nvidia logo

Academic hackatons for Nvidia GPUs

...  left where you can work on any language, from OpenMP to OpenCL and from OpenACC to CUDA. Python + CUDA-libraries is also no problem, ...

IWOCL-logo

IWOCL 2019

...  that might be spontaneously organized. This is the biggest OpenCL-focused event that discusses everything that would make ...

cuda

Question: do we work with CUDA?

...  Yes, actually a lot!The company was built on OpenCL and we are still work with the language a lot - from embedded GPUs and FPGAs to ...

screenshot-twitter

The 12 latest Twitter Poll Results of 2018

...  votes for Nvidia.Out of scope: SYCL is the answer from OpenCL to make a real alternative to CUDA.Intel discontinues ...