Principal Data Engineer - Data Pipeline/Warehousing/Mining (8-12 Yrs) Bangalore (Systems/Product Software)
Talent Acceleration Corridor
Karnataka, India
2d ago
source : Findojobs

We are scouting for Principal Data Engineer for one of our IT MNC clients. Responsibilities : - Lead large data initiatives from start to launch from data acquisition to analysis -

Mentor the data engineering team on technology standards, design and domain Architecture : - Collaborate with architecture to develop technical / product architecture for data platform -

Validate data architecture to business requirements - Predict / Map / leverage market trends to Blackhawk's data vision Technology : -

Discover, construct and test data acquisition pipelines - Employ a variety of data acquisition and processing tools to build data pipelines -

Leverage and process large volumes of data from internal and external sources to answer key business questions - Identify ways to improve data reliability, efficiency, and quality Business : -

Construct story boards and visual dashboards for quantitative business analysis - Be the technology ambassador for adoption of data platform and tools -

Champion the voice of the customer within the data organization Required : - 5 years of experience in data processing tools -

hadoop, spark, python, pig. - 8 years of experience in data warehousing data mining, data engineering, modelling with large scale data processing environments -

Prior experience in implementing large scale data lake - 3-5 years of experience with amazon or google data processing technologies -

Experience with ETL, reporting and analytical tools - 8 years of experience in software development, 5 years working with Data processing -

Knowledge of Java 8 or higher - Strong knowledge and experience in most of the following AWS services : EC2, Lambda, SQS, Kinesis, S3, CloudFormation, CLI, CloudWatch -

Basic experience working with Amazon EMR & Spark - Strong knowledge of SQL, experience in query performance optimization -

Experience working with column-based databases like Redshift - Passion to pro-active learning of new technologies and sharing it in the team Preferred : -

Build learning systems - Implement and optimize machine learning models recommended by Data scientists - Prepare data sets for use in modelling -

Experience in building real-time ETL processes - Previous experience in building data lake solution - Experience in administration of AWS services -

Experience developing on Scala, NodeJS, Python - DevOps experience : Linux, Jenkins, Docker, Kubernetes (ref :

Add to favorites
Remove from favorites
My Email
By clicking on "Continue", I give neuvoo consent to process my data and to send me email alerts, as detailed in neuvoo's Privacy Policy . I may withdraw my consent or unsubscribe at any time.
Application form