This is a data science position on our internal data team comprised of data engineers, data analysts, business intelligence engineers, and data scientists. This position will play a crucial role in partnering with our product teams to build out new machine learning models to predict several Key Performance Indicators (KPIs) to drive revenue and growth across our business. This position may also be involved in several Proof-of-Concept (POC) projects to enhance productivity throughout our organization by harnessing the power of Natural Language Processing (NLP) algorithms.
Responsibility
Collaborate with teams across the organization to understand business objectives and translate them into requirements that fuel innovative proof of concept projects
Work with complex data structures that require you to transform and cleanse data using SQL and Python
Develop and deliver Machine Learning models & predictive analysis
Work closely with engineering and QA to deploy models into production environments (using proper dev-ops techniques)
Develop test cases and hypothesis-validation frameworks (Feature Importance test, Cross Validation, A/B Tests, etc.)
Perform statistical analysis like Hypothesis testing, Chi-square, ANOVA, z-test, t-test etc.
Support existing data science owned projects
Work both independently and part of a broader data team to achieve team objectives
Requirements
Mandatory Requirements
3+ years of proven experience in a Data Science/ Machine Learning/ NLP role
Deep knowledge of math, probability, statistics, and algorithms
Strong programming experience in Python and SQL
Deep knowledge of data manipulation libraries like pandas, numpy and spark
Proficiency in machine learning techniques and libraries (e.g., scikit-learn, TensorFlow, PyTorch) as well as underlying concepts of regression, classification, decision trees, random forest, clustering, etc…
Understanding of NLP and text representation techniques – including common NLP algorithms like BERT, n-grams, sentiment analysis, semantic extraction techniques, etc..
Experience using NLG algorithms like HMM, LSTM, Transformers, AutoEncoders etc..
Ability to translate data analysis into clear and concise actionable insights
Able to effectively communicate complex, technical ideas to individuals of varying technical capability
Experience working with Git or other version control frameworks
Familiarity with cloud platforms (AWS, Azure) is a plus
Good to have knowledge of Linux and shell scripting