FFT_OPENMP is a C program which demonstrates the computation of a Fast Fourier Transform in parallel, using OpenMP.
On an Apple PowerPC G5 with two processors, the following results were observed:
Threads | N | Time | MegaFLOPS |
---|---|---|---|
1 | 33,554,432 | 32.65 | 128.4 |
2 | 33,554,432 | 19.87 | 211.0 |
4 | 33,554,432 | 19.80 | 211.7 |
In the BASH shell, the program could be run with 2 threads using the commands:
export OMP_NUM_THREADS=2 ./fft_openmp
The computer code and data files described and made available on this web page are distributed under the GNU LGPL license.
FFT_OPENMP is available in a C version and a C++ version and a FORTRAN77 version and a FORTRAN90 version.
DIJKSTRA_OPENMP, a C program which uses OpenMP to parallelize a simple example of Dijkstra's minimum distance algorithm for graphs.
FFT_SERIAL, a C program which demonstrates the computation of a Fast Fourier Transform, and is intended as a starting point for implementing a parallel version using OpenMP.
FFTPACK4, a C library which implements the Fast Fourier Transform (FFT) by Paul Swarztrauber and Dick Valent, translated by Andrew Fernandes;
FFTW3, C programs which illustrate the use of the FFTW3 library for Fast Fourier Transforms, by Matteo Frigo and Steven Johnson.
HEATED_PLATE_OPENMP, a C program which solves the steady (time independent) heat equation in a 2D rectangular region, using OpenMP to run in parallel.
HELLO_OPENMP, a C program which prints out "Hello, world!" using the OpenMP parallel programming environment.
MD_OPENMP, a C program which carries out a molecular dynamics simulation using OpenMP.
MULTITASK_OPENMP, a C program which demonstrates how to "multitask", that is, to execute several unrelated and distinct tasks simultaneously, using OpenMP for parallel execution.
MXM_OPENMP, a C program which computes a dense matrix product C=A*B, using OpenMP for parallel execution.
OPENMP, C programs which illustrate the use of the OpenMP application program interface for carrying out parallel computations in a shared memory environment.
OPENMP_RCC, C programs which illustrate how a C program, using OpenMP, can be compiled and run in batch mode on the FSU High Performance Computing (HPC) cluster operated by the Research Computing Center (RCC).
POISSON_OPENMP, a C program which computes an approximate solution to the Poisson equation in a rectangle, using the Jacobi iteration to solve the linear system, and OpenMP to carry out the Jacobi iteration in parallel.
PRIME_OPENMP, a C program which counts the number of primes between 1 and N, using OpenMP for parallel execution.
QUAD_OPENMP, a C program which approximates an integral using a quadrature rule, and carries out the computation in parallel using OpenMP.
RANDOM_OPENMP, a C program which illustrates how a parallel program using OpenMP can generate multiple distinct streams of random numbers.
SATISFY_OPENMP, a C program which demonstrates, for a particular circuit, an exhaustive search for solutions of the circuit satisfy problem, using OpenMP for parallel execution.
SCHEDULE_OPENMP, a C program which demonstrates the default, static, and dynamic methods of "scheduling" loop iterations in OpenMP to avoid work imbalance.
SFTPACK, a C library which implements the "slow" Fourier transform, intended as a teaching tool and comparison with the fast Fourier transform.
SGEFA_OPENMP, a C program which solves a linear system by Gaussian elimination, using OpenMP.
ZIGGURAT_OPENMP, a C program which demonstrates how the ZIGGURAT library can be used to generate random numbers in an OpenMP parallel program.
FFT_LOCAL_GCC compiles, links, loads and runs the program using 1, 2 and 4 threads, the local interactive environment, and the GCC C compiler.
FFT_LOCAL_ICC compiles, links, loads and runs the program using 1, 2 and 4 threads, the local interactive environment, and the ICC C compiler.
FFT_ITHACA_GCC compiles, links, loads and runs the program using 1, 2 and 4 threads, using the PBS queueing system on Virginia Tech's Ithaca cluster, and the GCC C compiler.
FFT_ITHACA_ICC compiles, links, loads and runs the program using 1, 2 and 4 threads, using the PBS queueing system on Virginia Tech's Ithaca cluster, and the ICC C compiler.
You can go up one level to the C source codes.