MPI is a directory of C++ programs which illustrate the use of the Message Passing Interface for parallel programming.
MPI is a library of message passing routines. The library allows a user to write a program in a familiar language, such as C, C++, FORTRAN77 or FORTRAN90, and carry out a computation in parallel on an arbitrary number of cooperating computers.
A remarkable feature of MPI is that the user writes a single program which runs on all the computers. However, because each computer is assigned a unique identifying number, it is possible for different actions to occur on different machines, even though they run the same program:
if ( I am processor A ) then
add a bunch of numbers
else if ( I am processor B ) then
multipy a matrix times a vector
end
Another feature of MPI is that the data stored on each computer is entirely separate from that stored on other computers. If one computer needs data from another, or wants to send a particular value to all the other computers, it must explicitly call the appropriate library routine requesting a data transfer. Depending on the library routine called, it may be necessary for both sender and receiver to be "on the line" at the same time (which means that one will probably have to wait for the other to show up), or it is possible for the sender to send the message to a buffer, for later delivery, allowing the sender to proceed immediately to further computation.
Here is a simple example of what a piece of the program would look like, in which the number X is presumed to have been computed by processor A and needed by processor B:
if ( I am processor A ) then
call MPI_Send ( X )
else if ( I am processor B ) then
call MPI_Recv ( X )
end
Often, an MPI program is written so that one computer supervises the work, creating data, issuing it to the worker computers, and gathering and printing the results at the end. Other models are also possible.
It should be clear that a program using MPI to execute in parallel will look much different from a corresponding sequential version. The user must divide the problem data among the different processes, rewrite the algorithm to divide up work among the processes, and add explicit calls to transfer values as needed from the process where a data item "lives" to a process that needs that value.
A C++ program, subroutine or function that calls any MPI function, or uses an MPI-defined variable, must include the line
include "mpi.h"
so that the types of the MPI variables are defined.
You probably compile and link your program with a single command, as in
g++ myprog.C
Depending on the computer that you are using, you may be able
to compile an MPI program with a similar command, which automatically
locates the include file and the compiled libraries that you will
need. This command is likely to be:
mpiCC myprog.c
Some systems allow users to run an MPI program interactively. You do this with the mpirun command:
mpirun -np 4 a.out
This command requests that the executable program a.out
be run, right now, using 4 processors.
The mpirun command may be a convenience for beginners, with very small jobs, but this is not the way to go once you have a large lengthy program to run! Also, what actually happens can vary from machine to machine. When you ask for 4 processors, for instance,
The computer code and data files described and made available on this web page are distributed under the GNU LGPL license.
HEAT_MPI is a C++ program which solves the 1D Time Dependent Heat Equation using MPI.
LAMMPS is a FORTRAN77 program which uses MPI in molecular dynamics simulation. A serial version can be created by linking the program with the MPI_STUBS library.
METIS is a C library which can partition the nodes of a graph or the elements of a finite element mesh in a way suitable for further treatment by parallel processing.
MPI examples are available in a C version and a C++ version and a FORTRAN77 version and a FORTRAN90 version.
MPI_INTRODUCTION is a one page introduction to MPI.
MPI_MORE_INFO contains a list of references, web sites, examples and tutorials on MPI.
MPI_STUBS is a C++ library which contains "stub" MPI routines which allow a user to compile, load, and possibly run an MPI program on a serial machine.
MPI_SYSX is a directory of PBS scripts for running C++ programs using MPI on System X.
OPEN_MP is a simple system for parallel programming which relies on the use of shared memory.
PETSC is a scientific programming library for parallel programming, which requires MPI in order to run.
PRIME_NUMBER_MPI is a C++ program which counts the number of primes between 1 and N, using MPI for parallel execution.
PTHREADS is a set of C examples which illustrate the use of the POSIX thread library to carry out parallel program execution.
QUAD_MPI is a C++ program which approximates an integral using a quadrature rule, and carries out the computation in parallel using MPI.
RANDOM_MPI, a C++ program which demonstrates one way to generate the same sequence of random numbers for both sequential execution and parallel execution under MPI.
SATISFIABILITY_MPI is a C++ program which demonstrates, for a particular circuit, an exhaustive search for solutions of the circuit satisfiability problem, using MPI to carry out the calculation in parallel.
SGE is the Sun Grid Engine, a remote queueing system.
BONES passes a vector of real data from one process to another. It was used as an example in an introductory MPI workshop.
BUFFON_LAPLACE demonstrates how parallel Monte Carlo processes can set up distinct random number streams.
DAY1_EX3 works out exercise #3 assigned after day 1 of a workshop on MPI. The instructions were to have process 1 generate some integers, send them to process 3 which used some of those values to generate some real numbers which were then sent back to process 1.
HELLO simply has each process say hello:
INTERVALS estimates an integral by dividing an interval into subintervals, and having the servant processes estimate the integral over each subinterval.
MATVEC computes a matrix-vector product c = A * b, giving each process a copy of the vector b, and using self-scheduling to let any process have the next row of A to work on when it is ready. Arrays are allocated dynamically. The "math.h" include file is needed, as is the run-time math library.
MONTE CARLO computes PI by the Monte Carlo method, testing whether points in the unit square are in the unit circle.
POISSON_SERIAL solves the Poisson equation on a 2D grid. This version of the program does not use MPI, and is provided for comparison.
QUADRATURE integrates a function f(x) over an interval;
SEARCH searches a list of numbers for all occurrences of a target value.
SUM adds a list of numbers.
TASK_DIVISION is a serial code that simply suggests how T tasks could automatically and evenly be divided among P processors.
TYPE sets up a user-defined datatype, and sends and receives data in this form.
You can go up one level to the C++ source codes.