Exploit Low CPU Usage
How can we use extra CPU available for using threaded implementation of MPI?
A. Overlapping communication/computation
Paper Idea: Quantify this
B. Virtual Machines: Creating large virtual testbeds
C. ?
Read more!
How can we use extra CPU available for using threaded implementation of MPI?
Assuming GPU can positively affect prince/performance curve of HPC machines, what the best way to use it?