由买买提看人间百态

boards

本页内容为未名空间相应帖子的节选和存档,一周内的贴子最多显示50字,超过一周显示500字 访问原贴
Statistics版 - which route in SAS is faster?
相关主题
求救:SAS programmer面试题!请教SAS ADV一道题!
问个效率问题 SQL vs data step,大数据量ask R code
SAS memory managementWhat's the wrong with this SAS code?
SAS dataset 中,怎么把数据往上移一行?[提问]怎样sort这个dataset?
How to Macro it in SAS?how to avoid changing the raw dataset when you work on proc sort?
为什么合并这两个数据不能用sas merge 而要用SAS SQLsas adv 63题 52
请教 2 道SAS Adv 真题求教一个简单的data step 牛肉包
急!SAS adv的一道题,谢谢!请教一个用SAS作DATA MERGE的问题
相关话题的讨论汇总
话题: sas话题: sort话题: sql话题: sorting话题: faster
进入Statistics版参与讨论
1 (共1页)
l*********s
发帖数: 5409
1
1. sort then match join within data step vs. SQL inner join
2. sorting whole big datasets vs. splitting it into smaller subsets,
sorting individually then rejoin?
L***4
发帖数: 13
2
It depends on many factors - the size of base table/lookup table(s), your
CPU/IO/Memory capability, and the programming needs. Typically if the size
of larger dataset is below 1G, whatever methods are fine - I choose the
method easy to write and read. Otherwise, I will try to avoid sorting the
large data (if you have to sort, split sort or hash sort will use less
resource), for example, if base table is huge and there are several small
lookup tables, I like SET KEY. if you have two huge data and

【在 l*********s 的大作中提到】
: 1. sort then match join within data step vs. SQL inner join
: 2. sorting whole big datasets vs. splitting it into smaller subsets,
: sorting individually then rejoin?

l*********s
发帖数: 5409
3
Thank you very much ^__^

【在 L***4 的大作中提到】
: It depends on many factors - the size of base table/lookup table(s), your
: CPU/IO/Memory capability, and the programming needs. Typically if the size
: of larger dataset is below 1G, whatever methods are fine - I choose the
: method easy to write and read. Otherwise, I will try to avoid sorting the
: large data (if you have to sort, split sort or hash sort will use less
: resource), for example, if base table is huge and there are several small
: lookup tables, I like SET KEY. if you have two huge data and

p********a
发帖数: 5352
4
我平时基本上用PROC SQL,灵活好用啊。另外,那些老家伙们不懂SQL,让那些家伙看
不懂好了。
不过很大很大的DATA,还是MERGE快点。我做过很多试验,SORT/MERGE确实比SQL快点。
或者用PROC FORMAT/INDEX之类的方法提高效率。
b*******r
发帖数: 152
5
there was a heated debate on sql vs merge on this board not long time ago.
search and get some sense out of it.
A*******s
发帖数: 3942
6
for 2.
proc sort has threaded option from version 9. you dont have to do it on your
own.

【在 l*********s 的大作中提到】
: 1. sort then match join within data step vs. SQL inner join
: 2. sorting whole big datasets vs. splitting it into smaller subsets,
: sorting individually then rejoin?

l*********s
发帖数: 5409
7
Good to learn, thank you.

your

【在 A*******s 的大作中提到】
: for 2.
: proc sort has threaded option from version 9. you dont have to do it on your
: own.

h*e
发帖数: 10233
8
我的经验是大的dataset proc sort merge快一些.我的数据动不动就是几百万个record
,应该算是比较大的。
1 (共1页)
进入Statistics版参与讨论
相关主题
请教一个用SAS作DATA MERGE的问题How to Macro it in SAS?
贡献SAS Programmer 面试问题并求答案为什么合并这两个数据不能用sas merge 而要用SAS SQL
SAS Code 求助,如何把在另一个dataset的id找出来请教 2 道SAS Adv 真题
在SAS中不用proc sql的情况下实现inequality join急!SAS adv的一道题,谢谢!
求救:SAS programmer面试题!请教SAS ADV一道题!
问个效率问题 SQL vs data step,大数据量ask R code
SAS memory managementWhat's the wrong with this SAS code?
SAS dataset 中,怎么把数据往上移一行?[提问]怎样sort这个dataset?
相关话题的讨论汇总
话题: sas话题: sort话题: sql话题: sorting话题: faster