y****a 发帖数: 536 | 1 本人master 是统计的,现在想慢慢学习一些关于data science 的东西,请大侠们给点
basic idea,谢谢
In data science related jobs, how big was the data set (how many parameters/
fields the dataset contained) you guys usually work on?
were there dirty data problems? was there sampling bias? how did you guys
solve those problems, anything need to learn particularly?
How are you used to getting your data ? What software languages have you
used for extraction?
What do you use to clean and/or analyze the data? what quantitative methods
are usually used? | d****n 发帖数: 12461 | 2 高维度,稀疏,维度不断变化,或者根本没有简单的维度定义(例如SN数据)。
一般在几千到几个M之间,超过几个M的都是工程问题。 | m********a 发帖数: 128 | 3 能elaborate一下,或者给个例子吗?
【在 d****n 的大作中提到】 : 高维度,稀疏,维度不断变化,或者根本没有简单的维度定义(例如SN数据)。 : 一般在几千到几个M之间,超过几个M的都是工程问题。
| E*******s 发帖数: 994 | 4 interviewing for google?
parameters/
methods
【在 y****a 的大作中提到】 : 本人master 是统计的,现在想慢慢学习一些关于data science 的东西,请大侠们给点 : basic idea,谢谢 : In data science related jobs, how big was the data set (how many parameters/ : fields the dataset contained) you guys usually work on? : were there dirty data problems? was there sampling bias? how did you guys : solve those problems, anything need to learn particularly? : How are you used to getting your data ? What software languages have you : used for extraction? : What do you use to clean and/or analyze the data? what quantitative methods : are usually used?
|
|