Reading Time: 2 minutes

Texas-Instruments-logo-designTI has a fully conformant OpenCL 1.1 implementation.

The below table is taken from http://downloads.ti.com/mctools/esd/docs/opencl/intro.html and shows which DSPs have OpenCL-support.

SoCSystemKhronos ConformanceInstallation Instructions
AM572AM572 EVMOpenCL v1.1 ConformantProcessor SDK for AM57x
DRA75xDRA75x EVMOpenCL v1.1 ConformantProcessor SDK for DRA7x (Enabling OpenCL on DRA75x)
AM571AM572 EVMOpenCL v1.1 ConformantProcessor SDK for AM57x
66AK2H66AK2H EVMOpenCL v1.1 ConformantProcessor SDK for K2H
66AK2L66AK2L EVMNot submitted for conformanceProcessor SDK for K2L
66AK2E66AK2E EVMNot submitted for conformanceProcessor SDK for K2E
66AK2G66AK2G EVMNot submitted for conformanceProcessor SDK for K2G

Theoretical Performance of the C66x

  • Fixed point 16×16 MACs per cycle: 32
  • Fixed point 32×32 MACs per cycle: 8
  • Floating point single precision MACs per cycle: 8
  • Arithmetic floating point operations per cycle: 16 2-way SIMD on .L and .S units (e.g. 8 SP operations for A and B) and 4 SP multiply on one .M unit (e.g 8 SP operations for A and B)
  • Arithmetic floating point operations per cycle: 164 2-way SIMD on .L and .S units (e.g. 8 SP operations for A and B) and 4 SP multiply on one .M unit (e.g 8 SP operations for A and B)
  • Load/store width 2 x 64-bit 2 x 64-bit Vector size (SIMD capability): 128-bit (4 x 32-bit, 4 x 16-bit, 4x-8bits)

GFLOPs

2 FLOPs – 2-way SIMD on .L1 (A side) such as DADDSP or DSUBSP
2 FLOPs – 2-way SIMD on .L2 (B side) such as DADDSP or DSUBSP
2 FLOPs – 2-way SIMD on .S1 (A side) such as DADDSP or DSUBSP
2 FLOPs – 2-way SIMD on .S2 (B side) such as DADDSP or DSUBSP
4 FLOPs – 4-way SIMD on .M1 (A side) such as QMPYSP (or CMPYSP, maybe not 4-way SIMD)
4 FLOPs – 4-way SIMD on .M2 (B side) such as QMPYSP (or CMPYSP, maybe not 4-way SIMD)
========================
16 FLOPs total per cycle per C66x CorePac (source)

Boards

A good starter board is the BeagleBoard X-15, and has OpenCL drivers. It has 2x C66X DSPs and 2x 1.5-GHz ARM Cortex-A15.

X15_TOP_SIDE

 


Want to know more? Get in contact!

We are the acknowledged experts in OpenCL, CUDA and performance optimization for CPUs and GPUs. We proudly boast a portfolio of satisfied customers worldwide, and can also help you build high performance software. E-mail us today