Altera has just released their optimisation guide for OpenCL-on-FPGAs. It does not go into the howto’s of OpenCL, but assumes you have knowledge of the technology. Niether does it provide any information on the basics of Altera’s Stratix V or other FPGA.
It is the first public optimisation document, so it is appreciated to send feedback directly. Not aware what OpenCL can do on an FPGA? Watch the below video.
Subjects
The following subjects and optimisation tricks are discussed:
- FPGA Overview
- Pipelines
- Good Design Practices
- Avoid Pointer Aliasing
- Avoid Expensive Functions
- Avoid Work-Item ID-Dependent Backward Branching
- Aligned Memory Allocation
- Ensure 4-Byte Alignment for All Data Structures
- Maintain Similar Structures for Vector Type Elements
- Optimization of Data Processing Efficiency
- Specify a Maximum Work-Group Size or a Required Work-Group Size
- Loop Unrolling
- Resource Sharing
- Kernel Vectorization
- Multiple Compute Units
- Combination of Compute Unit Replication and Kernel SIMD Vectorization
- Resource-Driven Optimization
- Floating-Point Operations
- Optimization of Memory Access Efficiency
- General Guidelines on Optimizing Memory Accesses
- Optimize Global Memory Accesses
- Perform Kernel Computations Using Constant, Local or Private Memory
- Single Work-Item Execution
Carefully compare these with CPU and GPU optimisation guides to be able to write more generic OpenCL code.
Download
You can download the document here.
If you have any question on OpenCL-on-FPGAs, OpenCL, generic optimisations or Altera FPGAs, feel welcomed to contact us.
Related Posts
Improving FinanceBench for GPUs Part II – low hanging fruit
... original repository. They’ve even invited us to update their repository too. For now it will be on our repository only. We also ...
The Art of Benchmarking
... series, which will be linked at the end of the post when published. The questions to ask Even when it depends on various ...
Birthday present! Free 1-day Online GPGPU crash course: CUDA / HIP / OpenCL
... goal is hands on. We'll provide templates for CUDA/HIP and OpenCL, where you focus on making the core compute part. The third goal ...
Problem solving tactic: making black boxes smaller
We are a problem solving company first, specialised in HPC - building software close to the processor. The more projects we finish, the more it's clea ...