Mindteck

Data Engineer (Python, Spark)

Columbus, OH, US

8 months ago

Save Job

Summary

Duties and responsibilities

Collaborate with the team to build out features for the data platform and consolidate data

assets

Build, maintain and optimize data pipelines built using Spark
Advise, consult, and coach other data professionals on standards and practices
Work with the team to define company data assets
Migrate CMS' data platform into Chase's environment
Partner with business analysts and solutions architects to develop technical

architectures for strategic enterprise projects and initiatives

Build libraries to standardize how we process data
Loves to teach and learn, and knows that continuous learning is the cornerstone of every

successful engineer

Has a solid understanding of AWS tools such as EMR or Glue, their pros and cons and

is able to intelligently convey such knowledge

Implement automation on applicable processes

Mandatory Skills

5
years of experience in a data engineering position
Proficiency is Python (or similar) and SQL
Strong experience building data pipelines with Spark
Strong verbal & written communication
Strong analytical and problem solving skills
Experience with relational datastores, NoSQL datastores and cloud object stores
Experience building data processing infrastructure in AWS
Bonus: Experience with infrastructure as code solutions, preferably Terraform
Bonus: Cloud certification
Bonus: Production experience with ACID compliant formats such as Hudi, Iceberg or

Delta Lake

Bonus: Familiar with data observability solutions, data governance frameworks

Requirements

Bachelor's Degree in Computer Science/Programming or similar is preferred

Right to work

Must have legal right to work in the USA

How strong is your resume?

Upload your resume and get feedback from our expert to help land this job

How strong is your resume?

Upload your resume and get feedback from our expert to help land this job

MORE JOBS LIKE THIS

People also searched: