Research Article

Implementing and Evaluating an Heterogeneous, Scalable, Tridiagonal Linear System Solver with OpenCL to Target FPGAs, GPUs, and CPUs

Table 5

Maximum observed average (n = 16) compute performance and estimated energy efficiency of oclspkt for devices and heterogeneous combinations.

DeviceCompute performanceEstimated energy efficiency

CPU2791.39
GPU50112.9
FPGA28028.6
CPU + GPU3961.67
CPU + FPGA3651.81
GPU + FPGA69115.6
CPU + GPU + FPGA4312.06