Position : Lead Software Engineer
Department : Content Engineering
The Content Collection Services team specializes in the development and maintenance of financial data research applications used by internal research analysts globally for the collection of our proprietary content.
It is composed of highly talented and motivated individuals. The team is responsible for developing and enhancing tools that solves problems on the production floor with specific focus on Timeliness, Accuracy, Content, Coverage and usability, along with productivity of the operations.
The team also develops common products that are integrated with many other tools as well.
Design and implement data platform architecture providing access to large datasets, data ingestion pipelines and data infrastructure
Explore and evaluate new data technologies to build a scalable, cloud-oriented data platform
Make large and / or complex data more accessible, understandable and usable by implementing advanced APIs for storage and querying.
Create unified enterprise data models for analytics, mining and reporting
Interface with various teams to support their needs (cognitive computing, content, other engineering teams)
Collaborate with engineering, cloud infrastructure and security teams to understand the requirements and develop highly scalable system
As part of Agile development team contribute to architecture, tools and development process improvements.
12+ years of overall industry experience.
Bachelor or master’s degree in computer science, Math, or Engineering
5+ years of working experience in software development
3+ years of relevant experience in one of the following areas : Data engineering, business intelligence or business analytics
2+ years of experience in building data ingestion and ETL pipelines / supporting a large data
platform and data pipelining
Experience with modern data platforms such as Spark, Hadoop or other map / reduce big data systems and services
Experience with a variety of data stores such as ArangoDB, MongoDB, Cassandra, HBase, DynamoDB
Experience with AWS environment e.g., Amazon Kinesis, AWS Glue, Redshift, EC2, S3, EMR
Strong experience and proficiency with Python, Pandas, Numpy and AWS APIs
Experience with Github-based development processes
Experience with Agile / Scrum development
Expertise in database design and architectural principles and methodologies
Organized, self-directed, and resourceful with the ability to appropriately prioritize work in a fastpaced environment
HIGHLY DESIRED :
Knowledge of Unix-based operating systems