There is a SIMD vectorization opportunity within a multi-threaded region.
SIMD vectorization is performed at the lowest level in hardware and is usually compatible with higher forms of parallelization. In this case, it could potentially be used to further increase the performance of a multi-threaded computation.
Add vectorization directives to instruct the compiler to vectorize the loop
Building performance into the code from day one with Codee