由买买提看人间百态

boards

本页内容为未名空间相应帖子的节选和存档,一周内的贴子最多显示50字,超过一周显示500字 访问原贴
Statistics版 - R 编程面试题,被弄残废了,在这里求解,钱不多,但会鼎力散财,
相关主题
logistic repression 多分类变量定义哑变量与不定义,结果为何截然不同?Two statistic questions!!
也弱问一个SAS里面genotype/SNP variable recoding的问题[合集] 怎样fit 这个 gene-SNP 的mixed model?
去哪里下载 1 million affymetrix SNP 的数据?Re: 请教一个统计学问题,需要多少个SNPs去鉴定一个人 (转载)
急求专家!搞生物信息学或者生物统计的专业人士!SAS问题,关于@和@@的区别
请问有人用基因型来预测表形吗?请教Base 70 中的一题 关于@
[合集] 请教如何分析tag SNPs 以找出和疾病相关的MarkersMCMC算法的Posterior Std. 一定是减少的么?
【R】双层loops提高R速度的一些tips
quant analyst 一道概率的面试题懂TDT那个test的同学帮我一下
相关话题的讨论汇总
话题: genotype话题: molecules话题: detector话题: simulation
进入Statistics版参与讨论
1 (共1页)
x******0
发帖数: 1490
1
面试了一个software职位,本以为很有戏,但考我的题基本上全是job description上
没有的R 编程,希望这里的大侠帮助解惑答疑,在下感恩不尽。
1)
An analytical technique used in a molecular biology lab involves dispensing
solutions of DNA in very
low concentration into 384-well plates. Consider the perfectly random
distribution of N=30 molecules
onto this device
a) What is the probability that two molecules fall in the same well? Derive
the closed-form
equation for this probability.
b) Plot the above expression for the probability as a function of the number
of molecules
dispensed.
c) Solve the same problem by means of a simulation. Include your R code and
provide
appropriate statements to insure exact reproducibility of your simulation
results
d) Consider the case where the above-described device is affected by the so
called “edge
effects” – that is, the probability of a molecule landing in the wells
located on the edge of the plate is
smaller than the probability of a molecule landing in any other well.
Assuming that the probability ratio
is 1/3, revise the simulation above to calculate the probability that half
of the molecules are found in the
center wells of the plate.
2)
For a given SNP with alleles a and A, the minor allele frequency is .
Assume that this frequency is the
same for both males and females, that there is no migration in or out of the
population, and that there
is no selective advantage for either allele. The proportions of these
alleles are stable in the population
over time. Denote the possible genotype states by aa=1, Aa=2, and AA=3. The
evolution of a population
over time considering this SNP alone can be described, e.g, as follows: in
the first step, a female is of
genotype aa so it is in state 1 ( . In the next step, a mate is selected
at random and one or more
daughters are produced, eldest of whom had genotype . In the following
step, this daughter selects a
mate at random and produces an eldest daughter with genotype and so on.
a) Calculate the transition matrix for the above Markov chain.
b) Show that this chain is ergodic. What is the smallest number of
iterations, N, for which the
power N of the transition matrix is strictly positive?
c) According to the Hardy-Weinberg law, this chain is supposed to have a
steady-state
distribution σ= [ ( ( ]. Does this match the calculated steady
state?
d) For , simulate this chain for n=100,000 iterations and compare the
sampling
distribution of the simulated states with the one from the Hardy-Weinberg
vector. Show your
code and include statements to insure exact reproducibility of your
simulation.
3)
A certain molecular analyte, comprised of long quasi-linear macromolecules
with approximately
constant length l is analyzed with the help of a specialized detector. This
detector is assembled in the
form of many parallel long strips, each strip of width L. If the
macromolecules are randomly distributed
across the surface of the detector, what is the probability that such a
macromolecule would cross the
boundary between two strips? (Assume that l is smaller than L and ignore any
“edge effect”, i.e., assume
the detector has a large surface.)
x******0
发帖数: 1490
2
实在让大家见笑了。
d*******1
发帖数: 854
3
这工作不去也罢

dispensing
Derive

【在 x******0 的大作中提到】
: 面试了一个software职位,本以为很有戏,但考我的题基本上全是job description上
: 没有的R 编程,希望这里的大侠帮助解惑答疑,在下感恩不尽。
: 1)
: An analytical technique used in a molecular biology lab involves dispensing
: solutions of DNA in very
: low concentration into 384-well plates. Consider the perfectly random
: distribution of N=30 molecules
: onto this device
: a) What is the probability that two molecules fall in the same well? Derive
: the closed-form

t******s
发帖数: 55
4
见过懒的,但没见过这么懒的。
公司的面试题你不会做,拿来问也就算了,但你至少改写一下吧,你还原文copy。
你拿到题目的时候也答应过公司不外传的,现在你要给公司的人查出来一群中国人在讨
论,你让公司以后还找中国人不?而且这个公司里的中国人也不少。
我建议你或者版主把这个贴删了。
1 (共1页)
进入Statistics版参与讨论
相关主题
懂TDT那个test的同学帮我一下请问有人用基因型来预测表形吗?
用R怎样能提取一个 .txt文件中的一个数?[合集] 请教如何分析tag SNPs 以找出和疾病相关的Markers
疾病位点不同OR值combine的问题【R】双层loops
Re: ZZ - "Scientist: Four golden lessons"quant analyst 一道概率的面试题
logistic repression 多分类变量定义哑变量与不定义,结果为何截然不同?Two statistic questions!!
也弱问一个SAS里面genotype/SNP variable recoding的问题[合集] 怎样fit 这个 gene-SNP 的mixed model?
去哪里下载 1 million affymetrix SNP 的数据?Re: 请教一个统计学问题,需要多少个SNPs去鉴定一个人 (转载)
急求专家!搞生物信息学或者生物统计的专业人士!SAS问题,关于@和@@的区别
相关话题的讨论汇总
话题: genotype话题: molecules话题: detector话题: simulation