From BoostCon 2011 -- Bulk Synchronous Parallelism using Hybrid MPI+OpenMP
An interesting approach to implementing structured parallel algorithms in C++ presented today by Joel Falcou. The performance appears to be close to hand-coded algorithms and it should be reasonably easy to use. However, it currently needs at an external analysis stage based on a plug-in to clang.
Code is available at: https://github.com/jfalcou/bsppp