Publications & Events
Chapter 2 - Numerical Weather Prediction Optimization High Performance Parallelism Pearls Volume Two: Multicore and Many-core Programming Approaches
Abstract
This chapter describes the efforts to improve performance of the widely used WSM6 microphysics package used in numerical weather prediction via a variety of techniques including threading, vectorization, array alignment, improving data locality, and optional use of compile-time constants for loop and array index bounds. Code examples are included to illustrate each technique along with its performance benefits on Intel Xeon processors and Intel Xeon Phi coprocessors. Use of tools such as the Intel Thread Inspector to speed the performance tuning process are also described.

