There are worlds where performance matters. For example HPC world: faster software means less wait time for the scientists. Embedded world: faster software means we can use cheaper silicon to build our product. Game world: faster software means that our game will run on slower CPUs, thus making our game more interesting to people with […]
OpenMP
Case Study: How we made the Canny edge detector run faster? (part 2)
In the previous post we talked about the performance improvement we obtained for the Canny edge detection algorithm. Most of the changes we did there were focused on hot loops: making memory accesses sequential and making the loops longer, which in turn allowed the compiler to autovectorize them. This made our program run significantly faster […]
Parallelizing a shallow-water flow model with variable pressure
Executive Summary A team from the University of Malaga, Spain, attended the second Appentra-led hackathon, CESGAHACK-18 and made use of Appentra’s unique parallelization methodology to parallelize their shallow-water model that models salinity concentrations and temperature to understand natural flows. The model uses a computational approach that is faster than the direct simulation of flows using […]