由买买提看人间百态

boards

本页内容为未名空间相应帖子的节选和存档,一周内的贴子最多显示50字,超过一周显示500字 访问原贴
DataSciences版 - datascientist几个基本问题
相关主题
这样的数据怎么处理[Data Science Project Case] Data Monitoring
ask for help for R programming (转载)贝叶斯进行点估计的时候 先验概率怎么选择
请教一道面试题Role mining
请教几个问题克劳迪亚 管理员 培训材料
再来推广下picpac (转载)text mining中的relation extraction
生物博后转行ds经历大数据时代的最大挑战(一)?
analyzing the analyzersquestion about using Hive parameter
通过日志分析yarn app实际内存用量R describe dataset
相关话题的讨论汇总
话题: data话题: what话题: used话题: parameters
进入DataSciences版参与讨论
1 (共1页)
y****a
发帖数: 536
1
本人master 是统计的,现在想慢慢学习一些关于data science 的东西,请大侠们给点
basic idea,谢谢
In data science related jobs, how big was the data set (how many parameters/
fields the dataset contained) you guys usually work on?
were there dirty data problems? was there sampling bias? how did you guys
solve those problems, anything need to learn particularly?
How are you used to getting your data ? What software languages have you
used for extraction?
What do you use to clean and/or analyze the data? what quantitative methods
are usually used?
d****n
发帖数: 12461
2
高维度,稀疏,维度不断变化,或者根本没有简单的维度定义(例如SN数据)。
一般在几千到几个M之间,超过几个M的都是工程问题。
m********a
发帖数: 128
3
能elaborate一下,或者给个例子吗?

【在 d****n 的大作中提到】
: 高维度,稀疏,维度不断变化,或者根本没有简单的维度定义(例如SN数据)。
: 一般在几千到几个M之间,超过几个M的都是工程问题。

E*******s
发帖数: 994
4
interviewing for google?

parameters/
methods

【在 y****a 的大作中提到】
: 本人master 是统计的,现在想慢慢学习一些关于data science 的东西,请大侠们给点
: basic idea,谢谢
: In data science related jobs, how big was the data set (how many parameters/
: fields the dataset contained) you guys usually work on?
: were there dirty data problems? was there sampling bias? how did you guys
: solve those problems, anything need to learn particularly?
: How are you used to getting your data ? What software languages have you
: used for extraction?
: What do you use to clean and/or analyze the data? what quantitative methods
: are usually used?

1 (共1页)
进入DataSciences版参与讨论
相关主题
R describe dataset再来推广下picpac (转载)
请教一个R问题:怎么rbind一系列data,如data1,data2,....data1000生物博后转行ds经历
Senior Data Scientist in NCanalyzing the analyzers
Need senior data analyst - Seattle (转载)通过日志分析yarn app实际内存用量
这样的数据怎么处理[Data Science Project Case] Data Monitoring
ask for help for R programming (转载)贝叶斯进行点估计的时候 先验概率怎么选择
请教一道面试题Role mining
请教几个问题克劳迪亚 管理员 培训材料
相关话题的讨论汇总
话题: data话题: what话题: used话题: parameters