X86的shared-memory 机器还是不行阿？ (转载) - Computation版 - 未名存档

本页内容为未名空间相应帖子的节选和存档，一周内的贴子最多显示50字，超过一周显示500字访问原贴

Computation版 - X86的shared-memory 机器还是不行阿？ (转载)

相关主题
● 怎么拟合这样的曲线	● Matlab problem
● Newton's method最后收敛速度很慢，求解释	● MATLAB 并行计算问题
● who know what is the advantage of UMS to MPI or SMP?	● any body have some experience using 64bit Linux?
● High Performance Computing software engineer wanted @ Cary NC	● 有没有sun unix的高手呀？
● 请教1万个变量的线性方程组怎么解啊?	● fortran里矩阵传递有维数限制吗?
● 我们碰到的大麻烦——急寻计算机工作和博后机会 (转载)	● 做计算买什么样的机器最好？
● 想买一台计算功能强的机子 (转载)	● 准备干两台dual opteron的机器
● 发现一个有趣的事情，关于fortran IMSL library	● 买机器建议

相关话题的讨论汇总
话题: x86话题: memory话题: 机器话题: shared话题: opteron

进入Computation版参与讨论

1

(共1页)

s**i 发帖数: 381	1 【以下文字转载自 Hardware 讨论区】发信人: seki (瓜金), 信区: Hardware 标题: X86的shared-memory 机器还是不行阿？发信站: BBS 未名空间站 (Wed Feb 28 19:54:48 2007), 站内发信人: seki (瓜金), 信区: Linux 标题: X86的shared-memory 机器还是不行阿？发信站: BBS 未名空间站 (Wed Feb 28 19:54:41 2007), 转信最近用了一台Sunfire X4600 8个dual core 的AMD Opteron 855 cpu MHz : 2613.696 cache size : 1024 KB 60GB RAM，号称memory bandwidth up to 6.4GB/sec 在上面运行一个MPI code（主要就是解大型稀疏矩阵）,parallel scaling 远远不如另外一台 Opteron cluster,而且cluster的机器的CPU还要慢些今天又试了试一台IBM 595: 64 Pow
b***e 发帖数: 38	2 Can you tell how large the sparse matrix is and how fast the solver runs? 【在 s**i 的大作中提到】 : 【以下文字转载自 Hardware 讨论区】 : 发信人: seki (瓜金), 信区: Hardware : 标题: X86的shared-memory 机器还是不行阿？ : 发信站: BBS 未名空间站 (Wed Feb 28 19:54:48 2007), 站内 : 发信人: seki (瓜金), 信区: Linux : 标题: X86的shared-memory 机器还是不行阿？ : 发信站: BBS 未名空间站 (Wed Feb 28 19:54:41 2007), 转信 : 最近用了一台Sunfire X4600 : 8个dual core 的AMD Opteron 855 : cpu MHz : 2613.696
s**i 发帖数: 381	3 For example, one matrix is 2,170,159 X 2,170,159 it has 58,400,311 nonzeros using preconditioned conjugate gradient method it converges in 7 iterations. If I use 32 processes, it only takes 0.28 seconds. If I use 2 processes, it takes 5.29 seconds The serial version converges in 4 iterations, taking 7.7 seconds. 【在 b***e 的大作中提到】 : Can you tell how large the sparse matrix is and how fast the solver runs?
m***t 发帖数: 254	4 hoho, interesting post, two things: for SMP, you want to use openMP, not MPI ; second, cache performance is the ultimate key, not the clock rate. 【在 s**i 的大作中提到】 : For example, one matrix is 2,170,159 X 2,170,159 : it has 58,400,311 nonzeros : using preconditioned conjugate gradient method : it converges in 7 iterations. : If I use 32 processes, it only takes 0.28 seconds. : If I use 2 processes, it takes 5.29 seconds : The serial version converges in 4 iterations, taking 7.7 seconds.

1

(共1页)

进入Computation版参与讨论

相关主题
● 买机器建议	● 请教1万个变量的线性方程组怎么解啊?
● Using OpenMP 3 zz	● 我们碰到的大麻烦——急寻计算机工作和博后机会 (转载)
● 大家为什么用beowulf?	● 想买一台计算功能强的机子 (转载)
● 大家用什么做平行计算	● 发现一个有趣的事情，关于fortran IMSL library
● 怎么拟合这样的曲线	● Matlab problem
● Newton's method最后收敛速度很慢，求解释	● MATLAB 并行计算问题
● who know what is the advantage of UMS to MPI or SMP?	● any body have some experience using 64bit Linux?
● High Performance Computing software engineer wanted @ Cary NC	● 有没有sun unix的高手呀？

相关话题的讨论汇总
话题: x86话题: memory话题: 机器话题: shared话题: opteron

未名新帖统计// 7月16日

#	版面	帖数(主题数)
-	全站	4871 (796)
1	Military	3777 (569)
2	Stock	341 (51)
3	Joke	117 (17)
4	History	116 (3)
5	Automobile	100 (9)
6	USANews	55 (9)
7	Midlife	45 (1)
8	Headline	41 (41)
9	Dreamer	33 (13)
10	FleaMarket	32 (20)
11	Living	30 (7)

* 这里只显示发帖超过25的版面，努力灌水吧:-)