由买买提看人间百态

boards

本页内容为未名空间相应帖子的节选和存档,一周内的贴子最多显示50字,超过一周显示500字 访问原贴
Computation版 - A question about DQS3.3.2
相关主题
有lsf queue大牛么大家为什么用beowulf?
acm queue September/October 2016谁知道哪里有build cluster的workshop
请教:如何用MATLAB画出这样的图?谢谢实验室的PC clusters
Execute file under a UNC path in Matlab[linux cluster]
有没有用Lapack的?We are building new cluster now!
用mpiifort 编译 出错ps2的cluster, hehe
grid or cluster?Clustering Methods
Re: 学校里的Beowulf Cluster要升级了[转载] 这谁装过CLUSTER,自己配置过的?
相关话题的讨论汇总
话题: x2000话题: dqs话题: jobs话题: queue话题: qstat
进入Computation版参与讨论
1 (共1页)
L***i
发帖数: 11
1
On our PC-Cluster system, we are using DQS3.3.2 to manage all jobs. There are
several different classes of queues, for example, X1000, X2000, X3000. Now, I
got a problem, one queue X2000 could not be submitted jobs. For instance, if
you execute 'qsub file.job' and then execute "qstat', there is no jobs with
queue "X2000 in the queueing list or running list. But sometimes, the jobs
could be picked up. It's very strange to me.

Does anyone knows? You would be greatly appreciated for your hel
d*****w
发帖数: 124
2
r u sure the queue is okay?
1. dqs_execd is runing at the node?
2. the died job at the queue is clear?
try qstat -f to find the problem.

are
I

【在 L***i 的大作中提到】
: On our PC-Cluster system, we are using DQS3.3.2 to manage all jobs. There are
: several different classes of queues, for example, X1000, X2000, X3000. Now, I
: got a problem, one queue X2000 could not be submitted jobs. For instance, if
: you execute 'qsub file.job' and then execute "qstat', there is no jobs with
: queue "X2000 in the queueing list or running list. But sometimes, the jobs
: could be picked up. It's very strange to me.
:
: Does anyone knows? You would be greatly appreciated for your hel

d*****w
发帖数: 124
3

are
I
Seems it is okay. Perhaps the file.job is running at the other nodes.
if qstat -f show X2000 is UP, then hsould be normal.


【在 L***i 的大作中提到】
: On our PC-Cluster system, we are using DQS3.3.2 to manage all jobs. There are
: several different classes of queues, for example, X1000, X2000, X3000. Now, I
: got a problem, one queue X2000 could not be submitted jobs. For instance, if
: you execute 'qsub file.job' and then execute "qstat', there is no jobs with
: queue "X2000 in the queueing list or running list. But sometimes, the jobs
: could be picked up. It's very strange to me.
:
: Does anyone knows? You would be greatly appreciated for your hel

L***i
发帖数: 11
4
I checked all queues with "qstat -f", every machine is UP. But those machine
with X2000 queue could not pick up jobs and also dqs_execd does run.
And in err_file, the following message are listed( where host033 runs qmaster
daemon):
time=1058023801 DQS_WARNING_0257 dqs_open_tcp: cannot connect to peer host033
errno= 111 ../SRC/dqs_io.c 212 /usr/local/DQS_332/bin/dq
s_execd332 host067
time=1058023801 DQS_ERROR_0458 unable to connect to host "host033"
../SRC/dqs_send_receive.c 170 /usr/

【在 d*****w 的大作中提到】
:
: are
: I
: Seems it is okay. Perhaps the file.job is running at the other nodes.
: if qstat -f show X2000 is UP, then hsould be normal.
:

1 (共1页)
进入Computation版参与讨论
相关主题
[转载] 这谁装过CLUSTER,自己配置过的?有没有用Lapack的?
做bioinformatics的clustering的大侠能否交流一下?用mpiifort 编译 出错
大家看看这句话是个什么意思?grid or cluster?
clustering by openMosixRe: 学校里的Beowulf Cluster要升级了
有lsf queue大牛么大家为什么用beowulf?
acm queue September/October 2016谁知道哪里有build cluster的workshop
请教:如何用MATLAB画出这样的图?谢谢实验室的PC clusters
Execute file under a UNC path in Matlab[linux cluster]
相关话题的讨论汇总
话题: x2000话题: dqs话题: jobs话题: queue话题: qstat