We are looking for a Data Scientist who will support our product, sales, leadership and marketing teams with insights gained from analyzing company data.
The ideal candidate is adept at using large data sets to find opportunities for product, sales and process optimization and using models to test the effectiveness of different courses of action.
They must have strong experience using a variety of data analysis methods, building and implementing models and using / creating appropriate algorithms.
Requirements : Experience using statistical computer languages (R, Python, etc.) to manipulate data and draw insights from large data sets.
Process, cleanse, and verify the integrity of data used for analysis.Knowledge of a variety of machine learning techniques (clustering, decision tree learning, artificial neural networks) and their real-
world advantages / drawbacks.Experience with messy real-world data handling missing / incomplete / inaccurate data.Understanding of a broad set of Algorithms and Applied Math.
Good at problem solving, probability and statistics and knowledge of advanced statistical techniques and concepts (regression, properties of distributions, statistical tests and proper usage) and experience with applications.
Knowledge of data scraping is preferable.Comfortable manipulating and analyzing complex, high-volume high-dimensionality data from varying, heterogeneous sources.
At least 1+ years experience writing production-quality Python code.Machine learning frameworks : TensorFlow, Torch, Caffe.
Version control : Git, GitHub / Bitbucket.Good to have : Good verbal and written communication skills.Strong understanding of relational databases like PostgreSQL.
Experience with big data tools (Hadoop, Hive, MapReduce) a plus.