Who You'll Work With
You’ll work with the generative AI team focused on AI platforms or products. Our team is distributed across multiple locations, including offices in North America, Europe, and Latin America.
The generative AI team is dedicated to advancing the capabilities and applications of data retrieval and Generative AI within our firm, driving innovation, and delivering impactful AI solutions.
Your Impact
We are looking for a data engineer with expertise in Python development, who is passionate about cloud- based data engineering using AWS services and loves to build data solutions as part of multi-disciplinary team.
You'll be working closely with digital product professionals, data scientists, cloud engineers and others.
You’ll be a member of a global team working on generative AI initiatives that is part of our McKinsey’s Tech Ecosystem function responsible for developing and delivering all technology solutions for the firm’s internal use. You will work in a platform or product team with engineers, designers and product managers doing application development with an with the purpose delivering impactful AI solutions leveraging the best of the data within the firm.
You will work in a team of data engineers to develop data ingestion pipelines, create and mature data processing capabilities that ingest data into a data system used by generative AI applications.
Your work would include but won't be limited to creation of the python code, tests, creation and modification of GitHub Action CICD pipelines, working with AWS-based infrastructure and docker containers.
Your Qualifications and Skills
- 3+ years of professional experience as a data engineer, with a strong focus on cloud- based data engineering using AWS services
- Expertise with Python development
- Practicing high coding standards with clean code, modularity, error handling, testing automation and more
- Strong experience with relational databases
- Very driven, super strong on execution and output orientation, likes to get job done attitude and ability to figure things out independently; able to work in complex and very fast paced environment
- Hands-on experience with Docker
- Solid and demonstrable background in data pipeline performance and diagnostics
- Interest in generative AI and other ML topics
- Kedro framework experience as a plus
- Holds their ground, opinionated, not afraid to speak up at any level
- Familiarity with agile principles and product development
- Excellent problem-solving skills and the ability to analyze and resolve complex data engineering challenges
- Strong communication and collaboration skills, with the ability to work effectively in a cross-functional team environment