Work on solving cutting edge problems in Deep Learning for Internal Nervana SW Stack
Opportunity to develop DL software stack for innovative DL accelerator - Intel Neural Network Processor
Contribute to optimizations in Graph compiler and Runtime System SW stack
Profile, analyse and optimizing performance of Deep Learning System SW stack.
Opportunity to participate in all phases of Software Development Life Cycle (SDLC) : Design, Develop, Debugging, Validation and Deployment.
MS or PhD in CS / ECE or related fields with 5+ years of relevant experience in AI / Deep Learning
Proven work experience in Modern C++ (C++11 / 14), minimum 5 years; exceptional coding skills
Experience in full stack development and performance engineering.
Hands on experience in multi-threaded applications and distributed programming
Familiarity with Python and C++ / Python interoperability
Experience with DL frameworks (eg, Tensorflow, PyTorch, Caffe)
Knowledge of GPGPU programming technologies
Knowledge of compiler optimization techniques and LLVM is a plus.