l*u 发帖数: 114 | 1 最近在看版上的一些文章, 经常会有人提到data cleaning的重要性, 大家能不能说
到底怎么去clean data呢? 我能想到的就是missing value还有outlier, 具体也不知
道怎么去处理missing value... 谁给解释解释吧? 或者推荐一本这方面的书。 多谢
了! |
c*******7 发帖数: 2506 | 2 还有一部分是data的logical error,简单的例子,一个logitudinal study里面用到学
生每个学期的cumulative credit hours,通常这个变量只能逐个学期增加,不能减少
。cleaning的时候就要check很多类似这个的点。 |
l*u 发帖数: 114 | 3 恩, 多谢!
【在 c*******7 的大作中提到】 : 还有一部分是data的logical error,简单的例子,一个logitudinal study里面用到学 : 生每个学期的cumulative credit hours,通常这个变量只能逐个学期增加,不能减少 : 。cleaning的时候就要check很多类似这个的点。
|
s*****9 发帖数: 285 | 4 SAS Programming in Pharmaceutical Industry, man man kan kan |
g**a 发帖数: 2129 | 5 data provided by customer or PI won't always (actually never) suitable for
analysis. We need to do some extrating, computation,merging and combining to
generate the final dataset. Also to make sure the report and analysis
result are valid, we should check the internal validity of the data which
requires a lot reading of the project. |
l*u 发帖数: 114 | 6 恩, 明白点了, 很感谢!
to
【在 g**a 的大作中提到】 : data provided by customer or PI won't always (actually never) suitable for : analysis. We need to do some extrating, computation,merging and combining to : generate the final dataset. Also to make sure the report and analysis : result are valid, we should check the internal validity of the data which : requires a lot reading of the project.
|
c*****a 发帖数: 808 | 7 在这看过几篇paper相关文章,希望对你有用
SAS缺失数据处理 Missing Data Imputation in SAS
Multiple Imputation for Missing Data: Concepts and New Development(Version 9
.0) (very good article)
An Introduction to Multiple Imputation Methods: HandlingMissing Data with
SAS V8.2
Imputation Techniques Using SAS Software for Incomplete Datain Diabetes
Clinical Trials
A SAS Macro for Single Imputation
quote:
"This paper reviews methods for analyzing missing data, including
basic concepts and applications of multiple imputation
techniques. The paper presents SASâprocedures,
PROC MI and PROC MIANALYZE, for creating multiple imputations
for incomplete multivariate data and for analyzing
results from multiply imputed data sets"
http://bbs.pinggu.org/forum.php?mod=viewthread&tid=1144107&high |
l*u 发帖数: 114 | 8 很有用, 多谢!
9
【在 c*****a 的大作中提到】 : 在这看过几篇paper相关文章,希望对你有用 : SAS缺失数据处理 Missing Data Imputation in SAS : Multiple Imputation for Missing Data: Concepts and New Development(Version 9 : .0) (very good article) : An Introduction to Multiple Imputation Methods: HandlingMissing Data with : SAS V8.2 : Imputation Techniques Using SAS Software for Incomplete Datain Diabetes : Clinical Trials : A SAS Macro for Single Imputation : quote:
|
|
z**********i 发帖数: 12276 | |
d**********o 发帖数: 1321 | 10 thx.
9
【在 c*****a 的大作中提到】 : 在这看过几篇paper相关文章,希望对你有用 : SAS缺失数据处理 Missing Data Imputation in SAS : Multiple Imputation for Missing Data: Concepts and New Development(Version 9 : .0) (very good article) : An Introduction to Multiple Imputation Methods: HandlingMissing Data with : SAS V8.2 : Imputation Techniques Using SAS Software for Incomplete Datain Diabetes : Clinical Trials : A SAS Macro for Single Imputation : quote:
|
G**7 发帖数: 391 | 11 lvu:
The book "Data Cleaning Techniques using SAS" can be downloaded free.
Search internet.
Also, is that you who posted an ad about job opening (statistician)? |
l*u 发帖数: 114 | 12 非常感谢!
那个job opening不是我贴的, 是我转载的。。。 我没工作。。。
【在 G**7 的大作中提到】 : lvu: : The book "Data Cleaning Techniques using SAS" can be downloaded free. : Search internet. : Also, is that you who posted an ad about job opening (statistician)?
|