由买买提看人间百态

boards

本页内容为未名空间相应帖子的节选和存档,一周内的贴子最多显示50字,超过一周显示500字 访问原贴
Statistics版 - 问个SAS 基本问题,请大家帮忙。
相关主题
about outlier identification正态分布,请教!
outlier detections请教如何做normalization,找峰值
问个outlier 和 sample size 的问题哈time series的数据detect anomalies(outliers)?
bond price data clearn (转载)C1 电话面经 (转载)
大侠 Help! microarray Quality Control原来还有too much statistical power这么一说 (转载)
Fitting model, 头大,求建议【求帮助】能否帮我下这个paper: Multiple outlier detection in multivariate data using self-organizing maps title
请问你们在上交结果之前都做哪些检查?贡献SAS Programmer 面试问题并求答案
求助,今天某老板问这么个屁问题请问这个问题应该用什么方法解决
相关话题的讨论汇总
话题: outliers话题: 3sigma话题: plot话题: box话题: so
进入Statistics版参与讨论
1 (共1页)
s******o
发帖数: 283
1
用box plot的时候,为啥用h = 1.5IQR 来 detect outliers, 系数为啥是1.5?
One other relevant question is when we use 3sigma for outliers? why the
coefficient is 3 here?
Thanks so much in advance.
p***r
发帖数: 920
2
h=1.5 IQR, because this is the range for 25% (24.65%) on each side of the
center 50% under normal distribution.
chance to be outliers (.35%+.35%) on either side under normality assumption.

【在 s******o 的大作中提到】
: 用box plot的时候,为啥用h = 1.5IQR 来 detect outliers, 系数为啥是1.5?
: One other relevant question is when we use 3sigma for outliers? why the
: coefficient is 3 here?
: Thanks so much in advance.

p*******r
发帖数: 1951
3
这个看看正态分布曲线就明白了。至于是3 sigma,还是 2.95 sigma 完全是个约定俗
成的东西了。

【在 s******o 的大作中提到】
: 用box plot的时候,为啥用h = 1.5IQR 来 detect outliers, 系数为啥是1.5?
: One other relevant question is when we use 3sigma for outliers? why the
: coefficient is 3 here?
: Thanks so much in advance.

s******o
发帖数: 283
4
thanks a lot for all of your help.
So both the Box-plot 1.5IQR and 3sigma methods all have fixed portion of
data to be outliers, the difference between them is the value of the
percentage.
is that correct?
Also when the sample size increase, the number of outliers also increase ?
a****t
发帖数: 1007
5
还是不大懂,我记得书本上是说1.5IQR以外算是outlier,那大家在工作中,是用1.
5IQR还是3sigma?

【在 s******o 的大作中提到】
: thanks a lot for all of your help.
: So both the Box-plot 1.5IQR and 3sigma methods all have fixed portion of
: data to be outliers, the difference between them is the value of the
: percentage.
: is that correct?
: Also when the sample size increase, the number of outliers also increase ?

s******o
发帖数: 283
6
I want to know too.

【在 a****t 的大作中提到】
: 还是不大懂,我记得书本上是说1.5IQR以外算是outlier,那大家在工作中,是用1.
: 5IQR还是3sigma?

1 (共1页)
进入Statistics版参与讨论
相关主题
请问这个问题应该用什么方法解决大侠 Help! microarray Quality Control
求助简单SAS Code identify outlierFitting model, 头大,求建议
SAS 高手请帮忙请问你们在上交结果之前都做哪些检查?
SAS E-Miner regression model 问题求助,今天某老板问这么个屁问题
about outlier identification正态分布,请教!
outlier detections请教如何做normalization,找峰值
问个outlier 和 sample size 的问题哈time series的数据detect anomalies(outliers)?
bond price data clearn (转载)C1 电话面经 (转载)
相关话题的讨论汇总
话题: outliers话题: 3sigma话题: plot话题: box话题: so