Ancestry is hiring Data Scientist/Senior Data Scientist - JobHunting版

本页内容为未名空间相应帖子的节选和存档，一周内的贴子最多显示50字，超过一周显示500字访问原贴

JobHunting版 - Ancestry is hiring Data Scientist/Senior Data Scientist

相关主题
● RESTful 到底有啥优势呢	● 有大牛用Storm吗？
● 要不要跟风搞搞big data？	● Tango.me急招ads开发
● 2015年硅谷最火的高科技创业公司都有哪些？	● 面试归来，华人面试跟以前没变化啊，题目都巨难。
● hadoop面试和学习总结	● 【南加内推】Big data SWE
● 还有一周onsite，怎么看Hadoop.The.Definitive.Guide效率最高？	● 2014 找工作总结
● 请问怎样才能很好的学习hadoop (转载)	● Drawbridge fulltime openings
● Electronic Arts job openings on Redwood City, CA	● 提供Yahoo!内推
● System design这东西	● 现在只是web泡沫而已，传统行业都在大刀阔斧地改革

相关话题的讨论汇总
话题: data话题: experience话题: hadoop话题: scientist话题: ancestry

进入JobHunting版参与讨论

(共1页)

L*********y
发帖数: 1

https://www.smartrecruiters.com/Ancestry/88345635-senior-data-s
Company Description
Ancestry is the world's largest online resource for family history. We have
helped pioneer the market for online family history research, taking a
pursuit that was expensive and time-consuming and making it easy, affordable
and accessible to anyone with an interest in their family history. The
foundation of our service is an extensive collection of billions of
historical records that we have digitized, indexed and put online over the
past 17 years. These digital records and documents, combined with our
proprietary online search technologies, tools and collaboration features,
have enabled our more than two million subscribers to create over 13 billion
historical records, along with millions of DNA results to make meaningful
discoveries about the lives of their ancestors.
With over 1,400 employees around the world, we are known for our cutting-
edge technology, phenomenal innovation, and offer a compelling and rewarding
workplace where you will thrive. We seek out passionate people to join our
mission of helping people discover, preserve and share their family history.
We invite you to explore and discover the many opportunities that await you
at Ancestry.
Job Description
Data Mining Product team is looking for an experienced Data Scientists who
has a passion to build data products and data systems.
Key Responsibilities / Performance Requirements:
Understand existing business flow and website features, dive into the
underlying data, apply relevant Data Mining techniques and/or Machine
Learning algorithms and propose data analytic product to improve the website
intelligence
Implement the applicable Machine Learning or statistics based algorithm for
prediction and optimization and deliver the trained model to production
Create and implement algorithms in relevant statistical inference, graph and
network analysis, natural language processing with open source tools and
libraries.
Build and maintain code to populate HDFS, Hadoop with log from Kafka or data
loaded from SQL production systems.
Design, build and support algorithms of data transformation, conversion,
computation on Hadoop, Spark and other distributed Big Data Systems
Design and support effective storage and retrieval of Big Data
Qualifications
Required Skills:
Experience with Hadoop stack (HIVE, Pig, Hadoop streaming) and MapReduce
Expert of Data Mining, Machine Learning and related algorithms.
Experience in building Machine Learning based data products in production
Database experience with MySQL, MSSQL or equivalent
Experience with HBase or comparable NoSQL.
Proficient in two of the languages: Java, Python, Scala, C++ in Linux/Unix
Ph.D of Computer Science/Engineering or equivalent plus a minimum of 2-5
years relevant experience.
Desired:
Experience in Spark MLLib, Mahout
Familiarity out data formats and serialization, XML, JSON, AVRO, Thrift,
ProtoBuf
Experience with graph frameworks, such as Giraph, Hama, GraphLab, GraphX
Experience with R and/or MatLab
Strong communication skills
Read Tom White's "Hadoop: the Definitive Guide" and Jimmy Lin/Chris Dryer’s
“Data-Intensive Text Processing with MapReduce”
To apply:
https://www.smartrecruiters.com/Ancestry/88345635-senior-data-s
or send email to
[email protected]/* */

(共1页)

进入JobHunting版参与讨论

相关主题
● 现在只是web泡沫而已，传统行业都在大刀阔斧地改革	● 还有一周onsite，怎么看Hadoop.The.Definitive.Guide效率最高？
● [原创] 揭开大数据平台Hadoop的真面目 5分钟包教包会 (转载)	● 请问怎样才能很好的学习hadoop (转载)
● 请教下目前这些东东面试时更高可能被问到	● Electronic Arts job openings on Redwood City, CA
● 谈谈nosql这个query的问题	● System design这东西
● RESTful 到底有啥优势呢	● 有大牛用Storm吗？
● 要不要跟风搞搞big data？	● Tango.me急招ads开发
● 2015年硅谷最火的高科技创业公司都有哪些？	● 面试归来，华人面试跟以前没变化啊，题目都巨难。
● hadoop面试和学习总结	● 【南加内推】Big data SWE

相关话题的讨论汇总
话题: data话题: experience话题: hadoop话题: scientist话题: ancestry

#	版面	帖数(主题数)
-	全站	4871 (796)
1	Military	3777 (569)
2	Stock	341 (51)
3	Joke	117 (17)
4	History	116 (3)
5	Automobile	100 (9)
6	USANews	55 (9)
7	Midlife	45 (1)
8	Headline	41 (41)
9	Dreamer	33 (13)
10	FleaMarket	32 (20)
11	Living	30 (7)

boards

未名新帖统计// 7月16日

历史上的今天