L****n 发帖数: 3545 | 1 PM me if interested - Post for a friend (I'm not the HM, sorry, but he can
get you interview directly if qualified).
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Data Scientist job in Morrisville, NC
The ideal candidate is an unabashed data geek. You enjoy searching the
Internet for datasets that you can explore and mashup to tell interesting
stories. You view the skills you have at sourcing, collecting, and cleaning
data as a means to an end; you are equally interested gleaning insights from
the data that have business application. You probably have developed these
skills through a combination of education, work experience and hobbies. If
this describes you, we are interested. You will be an integral part of a
small, cross-disciplinary team working on highly visible projects that
improve performance and grow our product suite.
Responsibilities
- Identify new data sources that will improve online targeting efforts
- Establish links across existing data sources and find new,interesting
mashups
- Work closely with statisticians to identify, design and build appropriate
datasets for complex experiments
- Coordinate data resource requirements between analytics team and
engineering team
- Develop tools and libraries that will help analytics team members more
efficiently interface with huge amounts of data
- Help develop algorithms and predictive models to solve critical business
problems
- Create informative visualizations that intuitively display large amounts
of data and/or complex relationships
- Work with product managers, engineers and analytics team members to
translate prototypes into production
Qualifications
- Highly motivated individual with degree(s) in CS or applied quantitative
field (math/statistics, economics, engineering).
- Minimum of 3-5 years working with relational databases
- Ability to write and execute complex SQL queries to extract/process data.
- Strong analytical skills
- Experience in programming with JAVA, Python, Perl.
- Experience with statistics software (prefer R, SPlus, MatLab)
- Well versed in UNIX command line utilities (Sed, awk, etc.) and experience
using these tools to clean/process very large and often very messy datasets
- Coursework or practical experience with data mining, machine learning,
building algorithms, applied math and statistics a plus
- Deep knowledge of various data sources (government, open source APIs,
point-of-sale, proprietary sources, etc.) and experience in linking them
- Experience with very large datasets a must. Knowledge of map/reduce
framework (hive/pig other tools for accessing data in Hadoop/HBase cluster
systems) a plus | s*********h 发帖数: 6288 | | t********6 发帖数: 43 | |
|