Data Engineer at Mirabel Technologies should be an avid programmer of Java, Python or R with expertise in implementing complex algorithms.
You will also be responsible for integrating them with the architecture used across the company in various products. Skillset : 1.
Proficient understanding of distributed computing principles. 2. Ability to build, run and manage data in large clusters.
3. Knowledge on Hadoop, MapReduce, HDFS 4. Strong proficiency in writing scalable algorithms using Java, Python 5. Strong experience in REST API back-
end services. 6. Hands on experience in Large Scale Web Crawling using frameworks like Scrapy, Nutch and custom crawling solutions 7.
Strong experience in search engine tools like Apache Solr, ELK Stack. 8. Hands on working experience in NoSQL databases such as MongoDB, GraphDB and Cassandra.
9. Knowledge on various ETL techniques and frameworks such as Flume. 10. Experience with NLP tools and systems for POS, NER and Information extraction.
11. Experience with ML libraries including scikit-learn, NLTK, NumPy and Pandas. 12. Experience with Machine Learning - Regression, Classification, Decision Trees.
13. Strong understanding of Linux, AWS and networking fundamentals. 14. Problem Solver - Able to work independently and be comfortable with deadlines and milestones.