K*****N 发帖数: 117 | 1 i have a 3 nodes (each nodes with 32 processors) machine.
but why if i run the following:
*****************************************************
/* Starts MPI processes ... */
MPI_Init(&argc,&argv); /* starts MPI */
MPI_Comm_rank(MPI_COMM_WORLD, &myid); /* get current process id */
MPI_Comm_size(MPI_COMM_WORLD, &p); /* get number of processes */
*****************************************************
p value is always 1?
how to access all 32 processes?
thanks. |
h***o 发帖数: 539 | 2 问个问题先...
你怎么跑的这个程序呀...
【在 K*****N 的大作中提到】 : i have a 3 nodes (each nodes with 32 processors) machine. : but why if i run the following: : ***************************************************** : /* Starts MPI processes ... */ : MPI_Init(&argc,&argv); /* starts MPI */ : MPI_Comm_rank(MPI_COMM_WORLD, &myid); /* get current process id */ : MPI_Comm_size(MPI_COMM_WORLD, &p); /* get number of processes */ : ***************************************************** : p value is always 1? : how to access all 32 processes?
|
K*****N 发帖数: 117 | 3 sorry. i am very new to this field. i just downloaded a piece of source
code. and upload to that supercomputer and mpcc my file.
it could pass debug.
i suppose "p" value in that program equals to the number of the processes,
what 's wrong with me? i might have some very silly problem. please let me
know.
【在 h***o 的大作中提到】 : 问个问题先... : 你怎么跑的这个程序呀...
|
K*****N 发帖数: 117 | 4 got it all. not bother to reply.
【在 K*****N 的大作中提到】 : sorry. i am very new to this field. i just downloaded a piece of source : code. and upload to that supercomputer and mpcc my file. : it could pass debug. : i suppose "p" value in that program equals to the number of the processes, : what 's wrong with me? i might have some very silly problem. please let me : know.
|
h***o 发帖数: 539 | 5 forgot to assign node number in queue script or command line ba
hoho
【在 K*****N 的大作中提到】 : got it all. not bother to reply.
|
K*****N 发帖数: 117 | 6 hehe. no. just much more silly. i didn't use script. hehe.
【在 h***o 的大作中提到】 : forgot to assign node number in queue script or command line ba : hoho
|
s*****l 发帖数: 167 | 7 Is it possible to detect how much a node is available?
suppose I have to use 10 nodes to do computation, and at
the same time someone else may submit a job to some of the
processors, and I want to send the computations in my program
that cannot be parallelized to a processors that has more
space. For instance I want to have a prog like this:
call mpi_init
...
mpi_scatter
...
mpi_gather
if this processor is fully available
do .....
end if
mpi_finalize |
b**g 发帖数: 335 | 8
Sure, use "uptime" command in UNIX. It shows the system load (actually
it is avg length of job queue.)
If you're running on Linux, check the file /proc/loadavg
【在 s*****l 的大作中提到】 : Is it possible to detect how much a node is available? : suppose I have to use 10 nodes to do computation, and at : the same time someone else may submit a job to some of the : processors, and I want to send the computations in my program : that cannot be parallelized to a processors that has more : space. For instance I want to have a prog like this: : call mpi_init : ... : mpi_scatter : ...
|
s*****l 发帖数: 167 | 9 That I know. I meant during the running, in the program unit.
【在 b**g 的大作中提到】 : : Sure, use "uptime" command in UNIX. It shows the system load (actually : it is avg length of job queue.) : If you're running on Linux, check the file /proc/loadavg
|
l******v 发帖数: 12 | 10 very interesting question. you basically want the slave node doesn't do its
job after the master node has assigning them one.
alternatively, you may want the slave node to check it's load average, respond
to the master node, and let the master decide if to allocate some job to that
node.
you may want to let the master node sort the load average. because you may
have to compete with other programs on all the nodes.
i don't know if it's implementable. if you did that, it would be good to
know.
【在 s*****l 的大作中提到】 : That I know. I meant during the running, in the program unit.
|