Principal Data Scientist-19000BCF
The Oracle CEGBU offers customers the industry’s most advanced solutions for planning, scheduling and delivering large-scale construction projects. We provide an end-to-end offering for project management and delivery that enables customers to effectively plan, build, and operate construction projects.
The Analytics & Data Science team at Oracle CEGBU is working to develop and deploy AI and Machine Learning based solutions at scale and throughout all of Oracle's CEGBU existing products and services. We are growing the team with brilliant and diverse individuals with exceptional technical ability. This is a challenging role that will stretch your knowledge and curiosity, while at the same time is a great opportunity to learn new skills and work within an unusually talented, global community at Oracle CEGBU.
This is a hands-on position where you will be expected to solve challenging problems and have the potential to directly impact Oracle’s CEGBU data strategy and business. The role requires that you have an extensive background in prototyping, modelling, model validation, production rollout at scale and post rollout improvements of machine learning based solutions. A proven track record in inventing and modifying advanced innovative algorithms and applying them to large data sets is essential.
This is a leadership role where you will lead a team of local and overseas data scientists. You will be a team player who is eager to both teach and learn daily, that is proactive and self-motivated and has excellent communication skills.
Responsibilities Works independently and with a small team to solve complex problems and create scalable models/algorithms that will be integrated into proprietary tools and products. Works directly with product managers & senior leadership to translate their vision into practical solutions Effectively communicate what is being worked on, problems being solved, customer impact, and progress of projects from time to time to various stakeholders Participates in industry forums to showcase the analytical depth of the organization Actively participate as a contributor and lead a team of peer data scientists, understanding the collaborative and transparent relationships with engineering and product teams and the ways of working of an agile environment. Proven track record in shipping successful data products Clearly communicate roadmap, backlog, and team updates across the organization Leads collaborative processes with cross-functional stakeholders to identify questions and complex business challenges and determine concrete plans of action in order to strategically define, design, and develop sophisticated machine learning models and algorithms to solve for each problem. Proactively identify and develop expertise in new technologies, methodologies, and techniques facilitating data science and systems engineering Identify predictive analytics opportunities to solve customer business problems and drive value Complete end-to-end execution of the data science process. This may be carried out in a collaborative environment with product and engineering teams, but ranges from understanding business requirements, data discovery and extraction, model development and evaluation, to production pipeline implementation.
Required Skills & Experience Masters, M.S or Ph.D. in a relevant technical field, or practical experience in a relevant discipline such as Computer Science, Physics, Engineering, Mathematics, or another relevant quantitative field. Overall 3+ years leading, building, mentoring data science teams Overall 6-8+ years of experience in data science Exceptionally proficient with Artificial Intelligence/Machine Learning/Data Mining/Natural Language Processing/Pattern Recognition/Computer Vision. Strong understanding of statistical modelling and its application to solving business problems Strong experience in building at scale, production grade machine learning solutions and data pipelines. Highly proficient in languages and tools used in ML modeling like R, Python (SciKit Learn, SciPy, Numpy, etc.), Apache Spark (Scala or Python), H2O, Weka, TensorFlow, Torch, Keras. Extensive experience in building and rolling out scoring models, response models, optimization, forecasting, segmentation etc. Experience with cloud infrastructure and deployments is a plus Experience with horizontally scalable data stores such as Hadoop and other NoSQL technologies such as Map Reduce, Spark, HBase, etc., and associated schemas. Strong skills in data management approaches such as relational databases, data schemas, object stores, column stores, triple stores, graph stores, and/or document stores Ability to deliver accurate work products in a cross-functional matrix environment with product teams, engineering teams and business stakeholders. Excellent technical design, problem solving, debugging and communication skills
Detailed Description and Job Requirements
Designs, develops and programs methods, processes, and systems to consolidate and analyze unstructured, diverse “big data” sources to generate actionable insights and solutions for client services and product enhancement.
Interacts with product and service teams to identify questions and issues for data analysis and experiments. Develops and codes software programs, algorithms and automated processes to cleanse, integrate and evaluate large datasets from multiple disparate sources. Identifies meaningful insights from large data and metadata sources; interprets and communicates insights and findings from analysis and experiments to product, service, and business managers.
Leading contributor individually and as a team member, providing direction and mentoring to others. Work is non-routine and very complex, involving the application of advanced technical/business skills in area of specialization. 8 years relevant work experience. BS/BA preferred.