Expert in cleansing data, removing anomalies, selecting features, reducing dimensions, building / optimizing classifiers, correlating hard business / technology generated data with social media data (like tweets / blogs / posts / etc.
to discover hidden or unseen data or trend from vast amounts of data by applying various data mining techniques, statistical analysis and thus building high quality prediction systems.
Excellent understanding of machine and deep learning concepts / techniques like k-NN, SVM, Decision Forests, linear regression models, gradient descent, etc.
Experience in using data science tools like R, Python, Matlab / Octave (at least one of them)
Knowledge of NoSQL databases such as MongoDB, Cassandra, Hbase, etc.
Knowledge of Hadoop stack, Spark, Map-Reduce, Hive, Pig, Shell Scripting and familiarity Unix / Linux OS.
Good understanding of relational databases with ability to write / modify complex SQL queries to generate required datasets and tune query performance.
Understands dimensional data model, logical data model and physical data model for analytics and reporting. Understands data design that supports integration of data and information flow across various applications, systems and platforms.
Strong analytical and problem solving skills. Should provide solutions to complex problems without known solutions.
Excellent communication skills and capability to effectively work with both Business and Technology teams.
Qualification - Degree in applied math, statistics, engineering, computer science or other quantitative field required.
PhD or MS / MTech preferred.
BE / BTECH in MIS, CS or related field. 1+ years of technology experience as a Data Scientist