第5页 - 关于pvalue的讨论汇总 - 话题女王

全部话题 - 话题: pvalue

n****t
发帖数: 182

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

http://stat.duke.edu/~berger/papers/99-13.html
Is much better written and understandable, this one seems unclear.

y***i
发帖数: 11639

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

文章不在了。能不能重新给个链接。多谢！

n****t
发帖数: 182

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

链接打错了，原贴改正了。

v*******e
发帖数: 11604

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

I
回答一点哈，文章中说的是“odds of hypothesis is true”，断句是这样的，odds
of (hypothesis is true)，就是说，“hypothesis是对的”这事的odds。odds是EPI里
面有定义的东西，就是P/(1-P)，和“赢的概率”差不多。你也可以理解为：
probability of hypothesis is true。你断句成（odds of hypothesis）is true，那
是肯定理解不了的。

v*******e
发帖数: 11604

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

u*h
发帖数: 397

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

1*0.05/(1*0.05+0.05*0.95) = 0.51

05/
P

v*******e
发帖数: 11604

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

我算错了

y***i
发帖数: 11639

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

多谢！

y***i
发帖数: 11639

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

上面网友给的文章里的公式：
type 1 error = e*p*ln(p)/(1 + e*p*ln(p))
这个公式算出来的数和文章中一致。我还不懂这个公式怎么来的。verdelite的公式
比较好理解，可惜看来不对 @@

v*******e
发帖数: 11604

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

其实那片文章里面有reference说明计算是怎么做的。就是后面reference的第5篇。我
下载了还没仔细看,是个3页的小短文。

n****t
发帖数: 182

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

http://stat.duke.edu/~berger/papers/99-13.html
Is much better written and understandable, this one seems unclear.

y***i
发帖数: 11639

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

文章不在了。能不能重新给个链接。多谢！

n****t
发帖数: 182

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

链接打错了，原贴改正了。

v*******e
发帖数: 11604

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

v*******e
发帖数: 11604

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

u*h
发帖数: 397

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

1*0.05/(1*0.05+0.05*0.95) = 0.51

05/
P

v*******e
发帖数: 11604

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

我算错了

y***i
发帖数: 11639

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

多谢！

y***i
发帖数: 11639

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

v*******e
发帖数: 11604

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

其实那片文章里面有reference说明计算是怎么做的。就是后面reference的第5篇。我
下载了还没仔细看,是个3页的小短文。

m******e
发帖数: 21

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

mark!

n****t
发帖数: 182

来自主题: Statistics版 - 请教Nature上一篇关于pValue的短文

Before the experiment, Bayesian statisticians think about the “prior odds”
, Pr(H1)/Pr(H0), where Pr(H1) is the probability of H1 is true, and Pr(H0)
is the probability of H0 is true.
Frequentist, on the other hand, does not think that way: there is only one
simple truth, H0 is true or H1 is true.
After experiment, Bayesians gain Pr(H1/Data)/Pr(H0/Data), the so called
posterior odds.
Bayesian factor (bf)=Posterior odds/Prior odds. Bayesians use this for
hypothesis testing. (Another interesting ... 阅读全帖

n*****3
发帖数: 1584

来自主题: Statistics版 - 为什么中心极限定律要求sample size >= 30

like pvalue .05, 当初牛人拍拍脑`带` 想的，没`森莫．．．．

w*****1
发帖数: 473

来自主题: Statistics版 - 用R画图如何设置X-Y轴的比例 (转载)

我在画图之前设置了图的宽度和高度，
下面是我的code:
mhtdata=read.table('mht-bsbp.txt',head=T)
> png("bsbp.png",width=4,height=3)
> data <- with(mhtdata,cbind(chr,pos,PVALUE))
> par(las=2, xpd=TRUE, cex.axis=1.4, cex=1.2)
> color <- rep(c("black","red"),11)
> ops <- mht.control(colors=color,yline=1.5,xline=3,srt=0)
> mhtplot(data,ops,pch=19)
Loading required package: grid
结果出现这样的提示：
Error in plot.new() : figure margins too large
该如何设置参数呢？谢谢

s***y
发帖数: 1130

来自主题: Statistics版 - 用R画图如何设置X-Y轴的比例 (转载)

png("bsbp.png",width=4,height=3,units="in")
默认单位是像素点pixel啊。。

我在画图之前设置了图的宽度和高度，
下面是我的code:
mhtdata=read.table('mht-bsbp.txt',head=T)
> png("bsbp.png",width=4,height=3)
> data <- with(mhtdata,cbind(chr,pos,PVALUE))
> par(las=2, xpd=TRUE, cex.axis=1.4, cex=1.2)
> color <- rep(c("black","red"),11)
> ops <- mht.control(colors=color,yline=1.5,xline=3,srt=0)
> mhtplot(data,ops,pch=19)
Loading required package: grid
结果出现这样的提示：
Error in plot.new() : figure margins too large
该如何设置参数呢？谢谢

x******n
发帖数: 173

来自主题: Statistics版 - glm中设定pdiff=controll和controlu的pvalue一样

这个是怎么回事？
谢谢

c****y
发帖数: 94

来自主题: Statistics版 - 请教一个统计问题 (转载)

It sounds like that you did a pairwise comparisons among those methods. You
need to apply adjustment. You actually should have 10 pvalue (5x4/2). If you
want to use Bonferroni Correction, it may be too conservative.In sas, you
can use proc multtest with hommel option (different way to control family
error rate). Hope it can work for you. If adjusted p-value is not
significant, it just means there are not significant difference between the
methods (2 methods compared). Actually, you should do ANO... 阅读全帖

r********n
发帖数: 6979

来自主题: Statistics版 - clinical trial的subgroup analysis必须是pre-specified的？

就算做了adjustment也没用
每个trial里面的变量实在太多了
就像我做过的一个分析里面
有一个变量的pvalue是10e-11(p-value小到一定程度就没有啥意义了,毕竟这个东西和
sample size的关系太大了)
不管怎么adjust都是significant的
最后在另外一个independent dataset里面发现几乎没有什么区别
这样的例子太多了
说的不好听一点
只要overfit
retrospective analysis想要什么好结果都能做出来
fda这么严格也是有道理的

K******c
发帖数: 619

来自主题: Statistics版 - 请问multi variate linear regression 选择risk factor 问题

pvalue

E**********e
发帖数: 1736

来自主题: Statistics版 - 做credit risk scorecard的朋友们，请进来，有问题求教

是的。现在已经用regulation了。问题是每次cross validation，进入的变量都变
。 AUC提高了点，问题是原先的变量是基于数据一和二选的。可能还是有bias，用到
新的数据会performancwe 不好。
现在的问题就是怎么unsupervised的预选重要的变量。不知大银行的modeler是基于
pvalue， IV，clustering 来预选变量吗？就像那几本modeling的书讲的一样。

a****g
发帖数: 8131

来自主题: Statistics版 - 求助 sas 高手一个程序问题

有两种方法
1. 如果下面这个表格是你所有的结果，你可以用个hard coding
2. 如果结果比较多，你可以用个format，pvalue to stars的format.

E**********e
发帖数: 1736

来自主题: Statistics版 - 有一个统计题请教大家，搞糊涂了

你这个hypothesis 是对的，从定义出发，pvalue的定义在H0下是P(X>x) ，但是由
于u小于50,最后P（(X-u)/sigma<2.677),答案就是D了。
我也学习了。

：
：我觉得如果定H0: u<=50, 那么 H1: u>50

t********6
发帖数: 43

来自主题: DataSciences版 - p value被摈弃了？如何算confidence interval之类的东西？

Confidence interval和pvalue都是基于假定的distribution来估算的。
interpretation就是基于假定的分布的variance，如果有<0.05的可能性你观察到的
mean的差别是chance，那你就可以reject null，然后认定两组有差别。
但具体的数据未必符合假设，尤其是variance。相对来讲bayesian bandit的ab test更
接近实际一些

d*********k
发帖数: 1239

来自主题: DataSciences版 - T家onsite面经

第四轮，国人小哥DS，问AB test，我说要bootstrap来做confidence interval，他说
如果不bootstrap呢，我有点愣住了，然后百般提示之下，说要不T test吧。然后他说T
家AB test两边sample都上百万，我又愣住了，说这样的话p value会inflate，然后总
是会reject hypothesis。然后聊了会儿bandit algorithm。国人小哥非常谦虚，没有
因为我指出他们的问题而不高兴。
这个问题怎么解决呢？如果我的ttest是直接用所有的数据点（不存在sampling），由
于size很大，我有时候时发现，经常会不符合常理的要拒绝null hypothesis（pvalue<
0.05）

correct

c***z
发帖数: 6348

来自主题: DataSciences版 - T家onsite面经

This is because p value increases as standard error decreases, and the
latter decreases as sample size increases
Check out the formula and the math is very simple
In other words, it is time to dump p values altogether

说T
pvalue<

d*********k
发帖数: 1239

来自主题: DataSciences版 - T家onsite面经

那怎么解决这个问题？
特别是在null hypothesis成立的前提下，pvalue还是个uniform的distribution
我现在工作中就遇到了这个问题

发帖数: 1

来自主题: DataSciences版 - 请教一个用R做cox regression的问题

cox.zph is used to test the ph assumption, i.e., if the effect of a
covariate changes along with time. The theory behind this test is explained
here:
http://psfaculty.ucdavis.edu/bsjjones/coxslides2.pdf
If the pvalue from cox.zph is too small to assume the ph assumption holds,
you can use stratified cox ph model, see https://stat.ethz.ch/education/
semesters/ss2011/seminar/contents/presentation_5.pdf
To select the best model, use Likelihood ratio test.
But in reality the problem is more complex.... 阅读全帖

#	版面	帖数(主题数)
-	全站	4871 (796)
1	Military	3777 (569)
2	Stock	341 (51)
3	Joke	117 (17)
4	History	116 (3)
5	Automobile	100 (9)
6	USANews	55 (9)
7	Midlife	45 (1)
8	Headline	41 (41)
9	Dreamer	33 (13)
10	FleaMarket	32 (20)
11	Living	30 (7)

topics

未名新帖统计// 7月16日

历史上的今天