boards

本页内容为未名空间相应帖子的节选和存档,一周内的贴子最多显示50字,超过一周显示500字 访问原贴
Statistics版 - 真心请教: data cleaning
相关主题
帮朋友post一个SAS问题,求高人指点。多谢各位了!
How to get summary statistics from multiple imputed data sets
proc logistic遇到missing value怎么处理
新手求教:关于sas proc mianalyze
请教个multiple imputation的问题
Proc mianalyze 如何得到proc mixed repeated fixed effects P-value
请教一个sas问题
求解, 用SAS PROC MI 做 missing data imputation
missing values imputation
面试时关于如何处理missing data的回答
相关话题的讨论汇总
话题: data话题: sas话题: imputation话题: cleaning话题: missing
进入Statistics版参与讨论
1 (共1页)
l*u
发帖数: 114
1
最近在看版上的一些文章, 经常会有人提到data cleaning的重要性, 大家能不能说
到底怎么去clean data呢? 我能想到的就是missing value还有outlier, 具体也不知
道怎么去处理missing value... 谁给解释解释吧? 或者推荐一本这方面的书。 多谢
了!
c*******7
发帖数: 2506
2
还有一部分是data的logical error,简单的例子,一个logitudinal study里面用到学
生每个学期的cumulative credit hours,通常这个变量只能逐个学期增加,不能减少
。cleaning的时候就要check很多类似这个的点。
l*u
发帖数: 114
3
恩, 多谢!

【在 c*******7 的大作中提到】
: 还有一部分是data的logical error,简单的例子,一个logitudinal study里面用到学
: 生每个学期的cumulative credit hours,通常这个变量只能逐个学期增加,不能减少
: 。cleaning的时候就要check很多类似这个的点。

s*****9
发帖数: 285
4
SAS Programming in Pharmaceutical Industry, man man kan kan
g**a
发帖数: 2129
5
data provided by customer or PI won't always (actually never) suitable for
analysis. We need to do some extrating, computation,merging and combining to
generate the final dataset. Also to make sure the report and analysis
result are valid, we should check the internal validity of the data which
requires a lot reading of the project.
l*u
发帖数: 114
6
恩, 明白点了, 很感谢!

to

【在 g**a 的大作中提到】
: data provided by customer or PI won't always (actually never) suitable for
: analysis. We need to do some extrating, computation,merging and combining to
: generate the final dataset. Also to make sure the report and analysis
: result are valid, we should check the internal validity of the data which
: requires a lot reading of the project.

c*****a
发帖数: 808
7
在这看过几篇paper相关文章,希望对你有用
SAS缺失数据处理 Missing Data Imputation in SAS
Multiple Imputation for Missing Data: Concepts and New Development(Version 9
.0) (very good article)
An Introduction to Multiple Imputation Methods: HandlingMissing Data with
SAS V8.2
Imputation Techniques Using SAS Software for Incomplete Datain Diabetes
Clinical Trials
A SAS Macro for Single Imputation
quote:
"This paper reviews methods for analyzing missing data, including
basic concepts and applications of multiple imputation
techniques. The paper presents SASâprocedures,
PROC MI and PROC MIANALYZE, for creating multiple imputations
for incomplete multivariate data and for analyzing
results from multiply imputed data sets"
http://bbs.pinggu.org/forum.php?mod=viewthread&tid=1144107&high
l*u
发帖数: 114
8
很有用, 多谢!

9

【在 c*****a 的大作中提到】
: 在这看过几篇paper相关文章,希望对你有用
: SAS缺失数据处理 Missing Data Imputation in SAS
: Multiple Imputation for Missing Data: Concepts and New Development(Version 9
: .0) (very good article)
: An Introduction to Multiple Imputation Methods: HandlingMissing Data with
: SAS V8.2
: Imputation Techniques Using SAS Software for Incomplete Datain Diabetes
: Clinical Trials
: A SAS Macro for Single Imputation
: quote:

z**********i
发帖数: 12276
9
http://www.amazon.com/Codys-Cleaning-Techniques-Using-Second/dp
Ron Cody

【在 l*u 的大作中提到】
: 很有用, 多谢!
:
: 9

d**********o
发帖数: 1321
10
thx.

9

【在 c*****a 的大作中提到】
: 在这看过几篇paper相关文章,希望对你有用
: SAS缺失数据处理 Missing Data Imputation in SAS
: Multiple Imputation for Missing Data: Concepts and New Development(Version 9
: .0) (very good article)
: An Introduction to Multiple Imputation Methods: HandlingMissing Data with
: SAS V8.2
: Imputation Techniques Using SAS Software for Incomplete Datain Diabetes
: Clinical Trials
: A SAS Macro for Single Imputation
: quote:

G**7
发帖数: 391
11
lvu:
The book "Data Cleaning Techniques using SAS" can be downloaded free.
Search internet.
Also, is that you who posted an ad about job opening (statistician)?
l*u
发帖数: 114
12
非常感谢!
那个job opening不是我贴的, 是我转载的。。。 我没工作。。。

【在 G**7 的大作中提到】
: lvu:
: The book "Data Cleaning Techniques using SAS" can be downloaded free.
: Search internet.
: Also, is that you who posted an ad about job opening (statistician)?

1 (共1页)
进入Statistics版参与讨论
相关主题
面试时关于如何处理missing data的回答
大家平时怎么处理missing data?
[合集] 用SAS or SUDAAN处理人口统计数据的问题
如何计算imputed data set的mean的confidence interval
求 imputation 后 出来的iteration 的数据作用
SAS help needed, interpolating missing values
SAS question,thanks!
question about multiple imputation of not normally distributed variable
about Proc MI
about Proc MI
相关话题的讨论汇总
话题: data话题: sas话题: imputation话题: cleaning话题: missing