Responsibilities : Build real-time and batch analytics platform for analytics & machine-learning.Design, propose and develop solutions keeping the growing scale & business requirements in mind.
As an integral part of the Data Engineering team, be involved in the entire development lifecycle from conceptualisation to architecture to coding to unit testing.
Help us design the Data Model for our data warehouse and other data engineering solutions.Requirements : Deep understanding of real-
time as well as batch processing big data solutions (Spark, Storm, Kafka, KSql, Flink, MapReduce, Yarn, Hive, HDFS, Pig etc).
Extensive experience developing applications that work with NoSQL stores (e.g.,Elastic Search, HBase, Cassandra, MongoDB).
Understands Data very well and has fair Data Modelling experience.Proven programming experience in Java or Scala.Experience in gathering and processing raw data at scale including writing scripts, web scraping, calling APIs, writing SQL queries, etc.
Experience in cloud based data stores like Redshift and Big Query is an advantage.Previous experience in a high-growth tech startup would be an advantage.
Skills : - Hadoop, Apache Kafka, Spark, Web Scraping, NOSQL Databases, Java, Data modeling and MongoDB