Parallel Programming with MPI

Parallel Programming with MPI Masao Fujinaga Academic Information and Communication Technology University of Alberta Message Passing • Parallel compu...

Author: Julia Lane

6 downloads 0 Views 3MB Size

Report

Download PDF

Recommend Documents

Parallel Programming with MPI

9 Parallel Programming with MPI

Parallel Programming Using MPI

Parallel Programming 2: MPI

Introduction to Parallel Programming with MPI

Parallel Programming with MPI- Day 3

Introduction to Parallel Programming with MPI

Introduction to MPI & Parallel Programming

Parallel Programming with MPI. Saber Feki July 14,2016

Parallel Programming in C with MPI and OpenMP

Programming with MPI

Chapter 7: Programming with MPI

Parallel Programming with Python

Parallel Programming With Spark

Parallel Programming with OpenMP

Writing Message-Passing Parallel Programs with MPI

Writing Message Passing Parallel Programs with MPI

Parallel Programming for Multicore Machines Using OpenMP and MPI

Writing Message- Passing Parallel Programs with MPI. Writing Message-Passing Parallel Programs with MPI 1 Edinburgh Parallel Computing Centre

Parallel programming using MPI. Research Computing and Cyberinfrastructure

Parallel Programming with pthreads. pthreads Multithreaded Programming

Parallel Concepts and MPI

Parallel Programming with MPI Masao Fujinaga Academic Information and Communication Technology University of Alberta

Message Passing • Parallel computation occurs through a number of processes, each with its own local data • Sharing of data is achieved by message passing. i.e. by explicitly sending and receiving data between processes

A simple MPI program What is MPI?

• Fortran INCLUDE 'mpif.h' INTEGER error, rank CALL MPI_Init(error) CALL MPI_Comm_rank(MPI_COMM_WORLD, rank, error) PRINT *, "Hello world from ”,rank CALL MPI_Finalize(error) STOP END

• MPI – Specified by a committee of experts from research and industry – Standard message-passing specification for all the Massively Parallel Processor (MPP) vendors involved

•

C

#include #include void main (int argc, char *argv[]) { int rank; MPI_Init(&argc, &argv); MPI_Comm_rank(MPI_COMM_WORLD, &rank); printf("Hello world from %d\n", rank); MPI_Finalize(); }

mpxlf_r -o hello hello.f ./hello -procs 4 Hello world from 0 Hello world from 2 Hello world from 1 Hello world from 3 ./hello -procs 4 Hello world from 0 Hello world from 1 Hello world from 2 Hello world from 3

Serial program do i = 1, n y(i) = x(i)**2.3 enddo

Master-slave program call MPI_Comm_rank(MPI_COMM_WORLD, rank, error) call MPI_Comm_size(MPI_COMM_WORLD, size, error)

Processor 0 Processor 1 Processor 2 Processor 3

if(rank .eq. 0)then - master code send data to slaves calculate its share of results receive results from slaves else - slave code receive data from master calculate results send results to master endif

MPI_Send/MPI_Recv • MPI_Send

MPI Data Type

Fortran Data Type

MPI_INTEGER

integer

• MPI_Recv

MPI_REAL

real

MPI_Recv(buf, count, type, source, tag, comm, status, ierr)

MPI_DOUBLE_PRECISION

double precision

MPI_COMPLEX

complex

MPI_CHARACTER

character(1)

MPI_LOGICAL

logical

MPI_BYTE

(none)

MPI_PACKED

(none)

MPI_Send(buf, count, type, dest, tag, comm, ierr)

Wildcards MPI_ANY_SOURCE, MPI_ANY_TAG Fortran status(MPI_SOURCE) status(MPI_TAG) status(MPI_ERROR) C status.MPI_SOURCE status.MPI_TAG status.MPI_ERROR

MPI Data Type

C Data Type

MPI_INT

int

MPI_LONG

long

MPI_FLOAT

float

MPI_DOUBLE

double

MPI_UNSIGNED_CHAR

unsigned char

MPI_UNSIGNED_SHORT

unsigned short

MPI_UNSIGNED

unsigned int

MPI_UNSIGNED_LONG

unsigned long

integer status(MPI_STATUS_SIZE) call MPI_Comm_rank(MPI_COMM_WORLD, rank, error) call MPI_Comm_size(MPI_COMM_WORLD, size, error) npart = nmax/size if(rank .eq. 0)then do iproc = 1, size-1 index = iproc*npart+1 call MPI_Send(x(index), npart, MPI_REAL, iproc, 1, MPI_COMM_WORLD, error) enddo do 210 i = 1, npart y(i) = x(i)**2.3 enddo do iproc = 1, size-1 index = iproc*npart+1 call MPI_Recv(y(index), npart, MPI_REAL, iproc, 2, MPI_COMM_WORLD, status,error) enddo else call MPI_Recv(x, npart, MPI_REAL, 0, 1, MPI_COMM_WORLD, status, error) do i = 1, npart y(i) = x(i)**2.3 enddo call MPI_Send(y, npart, MPI_REAL, 0, 2, MPI_COMM_WORLD, error) endif

Basic commands • • • • • • •

Include file MPI_Init MPI_Comm_rank MPI_Comm_size MPI_Send MPI_Recv MPI_Finalize

A simpler way call MPI_Scatter(x, npart, MPI_INTEGER, x, npart, MPI_INTEGER, 0, MPI_COMM_WORLD, error) do i = 1, npart y(i) = x(i)**2.3 enddo call MPI_Gather(y, npart, MPI_REAL, y, npart, MPI_REAL, 0, MPI_COMM_WORLD, error)

Broadcast MPI_Bcast(buffer, count, type, rank, comm, ierr) Processor 0

Processor 0 Processor 1 Processor 2 Processor 3

Reduction integer subsum, sum, x(n) subsum = 0 do i = 1, n subsum = subsum + x(i) enddo call MPI_Reduce(subsum, sum, 1, MPI_INTEGER, MPI_SUM, 0, MPI_COMM_WORLD, ierr)

Performance • For best performance, minimize communication. – Minimize the amount of data transferred and the number of calls to message passing routines.

• Next best thing: Minimize communication time relative to computation time. • Or overlap communication with calculation • Avoid synchronization steps. • Make sure that all processes are busy (load balancing)

Performance analysis with IBM MPI/MPE • Link with MPE library mpxlf_r -o executable source.f -L/usr/global/ibm/mpe2-32/lib -lmpe_f2cmpi -llmpe -lmpe • Reset PATH setenv PATH /usr/global/ibm/mpe2-32/bin:$PATH • Run program Executable -procs 4 • View results jumpshot Unknown.clog2 Or clog2TOslog2 Unknown.clog2 jumpshot Unknown.slog2

Jumpshot

MPI_Wtime() • Returns elapsed (wall) time on the calling processor – Time in seconds since an arbitrary time in the past real*8 time time = MPI_Wtime() Calculate… write(*,*)’ elapsed time =‘,MPI_Wtime()-time

• Set MPI_WTIME_IS_GLOBAL to TRUE to synchronize across all processes in MPI_COMM_WORLD

Profiling • mpxlf_r -o myProg myProg.f -pg -g • ./myProg -procs 4 • xprofiler myProg gmon*out

Performance analysis with mpich • Use the “-mpilog” flag during compilation mpif77 -o executable source.f -mpilog

• Set MPE_LOG_FORMAT setenv MPE_LOG_FORMAT ALOG

• Run mpirun -np 4 executable

• Use logviewer to analyze logfile logviewer executable.alog

Blocking and Completion • MPI_Send and MPI_Recv block the calling process. i.e. they do not return until the communication operation is complete • MPI_Recv is complete when the message is copied to the output variable. • MPI_Send is complete when the message has been passed off to MPI

Deadlock • When two or more blocked processes are waiting for each other and cannot make progress. if( rank .eq. 0)then call MPI_Recv(x, 10, MPI_REAL, 1, 1, MPI_COMM_WORLD, status, ierr) call MPI_Send(y, 10, MPI_REAL, 1, 2, MPI_COMM_WORLD, ierr) else if ( rank .eq. 1)then call MPI_Recv(y, 10, MPI_REAL, 0, 2, MPI_COMM_WORLD, status, ierr) call MPI_Send(x, 10, MPI_REAL, 0, 1, MPI_COMM_WORLD, ierr) endif

Deadlock - solution 1 if( rank .eq. 0)then call MPI_Send(y, 10, MPI_REAL, 1, 2, MPI_COMM_WORLD, ierr) call MPI_Recv(x, 10, MPI_REAL, 1, 1, MPI_COMM_WORLD, status, ierr) else if ( rank .eq. 1)then call MPI_Send(x, 10, MPI_REAL, 0, 1, MPI_COMM_WORLD, ierr) call MPI_Recv(y, 10, MPI_REAL, 0, 2, MPI_COMM_WORLD, status, ierr) endif

MPI_Pack

Deadlock - solution 2 if( rank .eq. 0)then call MPI_Recv(x, 10, MPI_REAL, 1, 1, MPI_COMM_WORLD, status, ierr) call MPI_Send(y, 10, MPI_REAL, 1, 2, MPI_COMM_WORLD, ierr) else if ( rank .eq. 1)then call MPI_Send(x, 10, MPI_REAL, 0, 1, MPI_COMM_WORLD, ierr) call MPI_Recv(y, 10, MPI_REAL, 0, 2, MPI_COMM_WORLD, status, ierr) endif

parameter (bufsize=1000) real x(nx) integer nx, ny, iy(ny) character buffer(bufsize) count = 1 call MPI_Pack(nx, 1, MPI_INTEGER, buffer, bufsize, count, MPI_COMM_WORLD, ierr) call MPI_Pack(ny, 1, MPI_INTEGER, buffer, bufsize, count, MPI_COMM_WORLD, ierr) call MPI_Pack(x,nx,MPI_REAL, buffer, bufsize, count, MPI_COMM_WORLD, ierr) call MPI_Pack(iy,ny,MPI_INTEGER, buffer, bufsize, count, MPI_COMM_WORLD, ierr) call MPI_Send(count,1,MPI_INTEGER,dest,tag,MPI_COMM_WORLD,ierr) call MPI_Send(buffer, count, MPI_PACKED, dest, tag, MPI_COMM_WORLD, ierr)

MPI_Unpack call MPI_Recv(count,1,MPI_INTEGER,source,tag, MPI_COMM_WORLD,status,ierr) call MPI_Recv(buffer, count, MPI_PACKED, source, tag, MPI_COMM_WORLD, status,ierr) count = 1 call MPI_Unpack(buffer, bufsize, count, nx, 1, MPI_INTEGER, MPI_COMM_WORLD, ierr) call MPI_Unpack(buffer, bufsize, count, ny, 1, MPI_INTEGER, MPI_COMM_WORLD, ierr) call MPI_Unpack(buffer, bufsize, count, x, nx, MPI_REAL, MPI_COMM_WORLD, ierr) call MPI_Unpack(buffer, bufsize, count, iy, ny, MPI_INTEGER, MPI_COMM_WORLD, ierr)

MPI derived types (continued) Call MPI_Type_struct(4, len, loc, type, MY_MPI_TYPE, ierr) Call MPI_Type_commit(MY_MPI_TYPE,ierr) Call MPI_Send(MPI_BOTTOM,1,MY_MPI_TYPE, dest, tag, MPI_COMM_WORLD, ierr) Call MPI_Type_free(MY_MPI_TYPE, ierr)

MPI derived types len(1) = 1 len(2) = 1 len(3) = nx len(4)= ny type(1) = MPI_INTEGER type(2) = MPI_INTEGER type(3) = MPI_REAL type(4) = MPI_INTEGER call MPI_Address(nx,loc(1),ierr) call MPI_Address (ny,loc(2),ierr) call MPI_Address (x,loc(3),ierr) call MPI_Address (y,loc(4),ierr)

Load balancing • Make sure that each process is busy. • Example - calculating distances do j = 1, n-1 do i = j+1, n dist(i,j) = sqrt((x(i)-x(j))**2) enddo enddo

Load balancing continued

Dynamic scheduling • Small chunks of work are given to each process • As each process finishes the chunk of work, it gets the next chunk of work

debugging •

pdbx – Compile with -g – pdbx -procs 4 ./myProg

•

dbx (or gdb) integer debugwait debugwait = 1 do while(debugwait) continue enddo – Compile with -g – Run program – Find process numbers (ps) – dbx -p procnum • assign debugwait=0 • debug as usual

– http://www.ncsa.uiuc.edu/UserInfo/Resources/Hardware/IBMp690/Doc/poe_pdbx.html

Debugging continued • Check for deadlock. • Make use of tags to make sure that the correct message is received. • Write statements to make sure that the contents of messages are correct. • Add MPI_Block to synchronize processes.

• Books – Using MPI: Portable Parallel Programming with the Message Passing Interface • William Gropp, Ewing Lusk, and Anthony Skjellum

– Parallel Programming with MPI • Peter Pacheco

• Websites – MPI:The Complete Reference • http://www.netlib.org/utk/papers/mpi-book/mpi-book.html

– Introduction to MPI • http://webct.ncsa.uiuc.edu:8900/webct/public/show_courses.pl

– MPI and MPE routines • http://www-unix.mcs.anl.gov/mpi/www

– PVM • http://www.epm.ornl.gov/pvm/pvm_home.html

– Westgrid • http://www.westgrid.ca/support/programming/#parallel