A leading startup company in the Big Data and e-commerce industry is looking for a Super Star. We are a very dynamic and fast-paced start-up with many interesting challenges and we are looking for champions to join our team.
The successful candidate will be responsible for solving highly challenging and complex problems in the data science, analysis and algorithms fields. You will help us discover the information hidden in vast amounts of data, and help us make smarter decisions to deliver even better products with major impact on the company success.
Discover the information hidden in vast amounts of data, and help us make smarter decisions to deliver even better products.
Applying data mining techniques, doing statistical analysis, and building high quality prediction systems integrated with our products.
Suggest and develop innovative analytic methods that result in a technically superior product and/or create a competitive advantage, as well as meet design requirements and project timeline.
Lead research, evaluation, and recommendation of internal and external data sources and coordinate with data resources.
Serve as the senior technical expert for leadership on data cleansing, variable creation, variable transformation, etc., as well as best-practices in the creation of analytic datasets.
Serve as the consummate technical expert in model development and validation analyses - from driving pragmatic practice of methods to devising novel solutions and diagnostic measures, to coaching and mentoring junior staff
Provide significant input to Product Management on implementation specifications and production testing
Review reports and make recommendations for needed model refits/enhancements.
Keep abreast of business trends/product needs.
1+ years of professional experience building predictive and descriptive statistical models.
Graduate degree (M.S. required, Ph.D. preferred) in a quantitative discipline.
Extensive experience working with large datasets.
Knowledge, experience, and expertise in diverse statistical and data mining techniques (e.g. - GLM/Regression, Boosting, Random Forest, Trees, Clustering, PCA, SVM, Neural Networks, Deep Learning, etc).
Demonstrated proficiency with statistical packages in Spark ML-Lib, Python, R, or SAS - a must.
Understanding of RDBMs and interactive SQL programming skills- a must.
Ability to program in Python, Scala, R, or SAS – a big advantage
Experience with Big Data technologies like Hadoop, Spark, Hive, NoSQL, etc., and Cloud technologies (AWS, Azure, etc.) - a big advantage.