Research Article

Effective SIMD Vectorization for Intel Xeon Phi Coprocessors

Algorithm 3

Small matrix multiplication summation.
real, dimension(4,4):: A, B, C
real sum
integer j, l, i
do j = 1, 4
do l = 1, 4
sum = 0.0
do i = 1, 4
sum = sum + A(i,l)   B(i,j)
enddo
C(l,j) = sum
enddo
enddo