由买买提看人间百态

boards

本页内容为未名空间相应帖子的节选和存档,一周内的贴子最多显示50字,超过一周显示500字 访问原贴
Statistics版 - Order of Independent Variables in Linear Multiple Regression
相关主题
请教一个关于logistic regression参数的问题请问multi variate linear regression 选择risk factor 问题
how to interpret the interaction terms between two continuous variables in regression model?网上看到一道题
请问一个Regression的问题GBM in R
model里有multicollinearity,该如何处理呢?请教一个关于PCA的问题
model sample size重要吗?求教:Multivariable regression 中一个变量的coefficient t-statistics insignificant 怎么办?
R questionDell is hiring senior risk modeler/analyst at Austin
求助,怎么消除线性回归的multicollinearity请教, linear regression model问题
请教比较两个regression coefficient有人用SAS么 或者统计大拿帮忙看一个低级问题 (转载)
相关话题的讨论汇总
话题: x1话题: x2话题: regression话题: linear
进入Statistics版参与讨论
1 (共1页)
F****n
发帖数: 3271
1
理想状态下,IVs应该是 uncorrelated, 但实际上他们或多或少的会有一些相关性,当
相关性并不强或multicollinearity在可忍受范围内时,常常直接MODEL。但在弱相关的
情况下,IVs的Coeff显然受顺序的影响。比如
Y =b1X1 + b2X2 + b0 算出来的coefficients 和
Y =b2X2 + b1X1 + b0 不一样 (OLS)。
假设X1, X2 each accounts for 40% of Y variation when modeled separately, but
when modeled together only account for 70% (as indicated by R square). So:
1) Can we say there is 10% (80%-70%) of Y variation explained jointly by X1,
X2?
2) If so can we say b1 actually is calculated to reflect 40% variability of
F****n
发帖数: 3271
2
Moreover, for
Y = b1X1 + b2X2 + b0
is it possible to create
Y = b1X1 + b2X2' + b0
where X2' is a variable totally independent of X1, and only accounting for
the part of variance independently explained by X2? i.e. "strip away" the
part of X2 correlated with X1?

but
X1,
of

【在 F****n 的大作中提到】
: 理想状态下,IVs应该是 uncorrelated, 但实际上他们或多或少的会有一些相关性,当
: 相关性并不强或multicollinearity在可忍受范围内时,常常直接MODEL。但在弱相关的
: 情况下,IVs的Coeff显然受顺序的影响。比如
: Y =b1X1 + b2X2 + b0 算出来的coefficients 和
: Y =b2X2 + b1X1 + b0 不一样 (OLS)。
: 假设X1, X2 each accounts for 40% of Y variation when modeled separately, but
: when modeled together only account for 70% (as indicated by R square). So:
: 1) Can we say there is 10% (80%-70%) of Y variation explained jointly by X1,
: X2?
: 2) If so can we say b1 actually is calculated to reflect 40% variability of

l***a
发帖数: 12410
3
PCA

【在 F****n 的大作中提到】
: Moreover, for
: Y = b1X1 + b2X2 + b0
: is it possible to create
: Y = b1X1 + b2X2' + b0
: where X2' is a variable totally independent of X1, and only accounting for
: the part of variance independently explained by X2? i.e. "strip away" the
: part of X2 correlated with X1?
:
: but
: X1,

F****n
发帖数: 3271
4
Sorry, no PCA. I explained that multicollinearity was regarded as tolerable
(small correlation) and also PCA makes interpretation hard.
Basically I am looking for a method that can sequentially "strip away"
correlation from an IV to previous IVs. My questions actually are:
1) Whether such methods exist;
2) If no, can we interpret the coefficients as in my original post, e.g.
b1 is based on X1, b2 is based on X2 minus the correlated part with X1. My impression is probably yes, because OLS coeffs

【在 l***a 的大作中提到】
: PCA
s*r
发帖数: 2757
5
coefficient should be the same
type I ss will be different
type 3 ss will be the same
l*********s
发帖数: 5409
6
How can coef change at all? OLS estimate shall be unique.

but
X1,
of

【在 F****n 的大作中提到】
: 理想状态下,IVs应该是 uncorrelated, 但实际上他们或多或少的会有一些相关性,当
: 相关性并不强或multicollinearity在可忍受范围内时,常常直接MODEL。但在弱相关的
: 情况下,IVs的Coeff显然受顺序的影响。比如
: Y =b1X1 + b2X2 + b0 算出来的coefficients 和
: Y =b2X2 + b1X1 + b0 不一样 (OLS)。
: 假设X1, X2 each accounts for 40% of Y variation when modeled separately, but
: when modeled together only account for 70% (as indicated by R square). So:
: 1) Can we say there is 10% (80%-70%) of Y variation explained jointly by X1,
: X2?
: 2) If so can we say b1 actually is calculated to reflect 40% variability of

o****o
发帖数: 8077
7
regress your X2 on the residuals from Y=b0+b1*X1
use the projected X2 from that auxilary regression to your original model
besides, I don't think the ordering matters in OLS, check the linear algebra

tolerable
impression is probably yes, because OLS coeffs and confidence intervals are
computed sequentially, but I am not sure about it at all.

【在 F****n 的大作中提到】
: Sorry, no PCA. I explained that multicollinearity was regarded as tolerable
: (small correlation) and also PCA makes interpretation hard.
: Basically I am looking for a method that can sequentially "strip away"
: correlation from an IV to previous IVs. My questions actually are:
: 1) Whether such methods exist;
: 2) If no, can we interpret the coefficients as in my original post, e.g.
: b1 is based on X1, b2 is based on X2 minus the correlated part with X1. My impression is probably yes, because OLS coeffs

h****s
发帖数: 16779
8
coefficients should be unique: \hat{beta} = (X'X)^{-1} X'Y

but
X1,
of

【在 F****n 的大作中提到】
: 理想状态下,IVs应该是 uncorrelated, 但实际上他们或多或少的会有一些相关性,当
: 相关性并不强或multicollinearity在可忍受范围内时,常常直接MODEL。但在弱相关的
: 情况下,IVs的Coeff显然受顺序的影响。比如
: Y =b1X1 + b2X2 + b0 算出来的coefficients 和
: Y =b2X2 + b1X1 + b0 不一样 (OLS)。
: 假设X1, X2 each accounts for 40% of Y variation when modeled separately, but
: when modeled together only account for 70% (as indicated by R square). So:
: 1) Can we say there is 10% (80%-70%) of Y variation explained jointly by X1,
: X2?
: 2) If so can we say b1 actually is calculated to reflect 40% variability of

1 (共1页)
进入Statistics版参与讨论
相关主题
有人用SAS么 或者统计大拿帮忙看一个低级问题 (转载)model sample size重要吗?
model selection一般都用什么方法R question
请问model里面的model变量和control变量什么区别啊?求助,怎么消除线性回归的multicollinearity
请教simultaneous equation system请教比较两个regression coefficient
请教一个关于logistic regression参数的问题请问multi variate linear regression 选择risk factor 问题
how to interpret the interaction terms between two continuous variables in regression model?网上看到一道题
请问一个Regression的问题GBM in R
model里有multicollinearity,该如何处理呢?请教一个关于PCA的问题
相关话题的讨论汇总
话题: x1话题: x2话题: regression话题: linear