t*****z 发帖数: 812 | 1 假设稀疏矩阵用CRS方式存储,为什么我的openmp并行不好?
#pragma omp parallel for private(i,j,t)
for(i=0; i
t = 0.0;
for(j=A.ptr[i];j
t += A.value[j] * x[A.index[j]];
y[i] = t;
}
n=400,000. 2,4,8threads 运行的时间差不多,比1thread w/ openmp快,根1thread w
/o openmp差不错
做iterative solver 大家出出点子? |
|