hireio

Senior Data Engineer (Python/Pyspark)

Mountain View, CA, US

about 1 month ago
Save Job

Summary

Responsibilities: Participate in all aspects data platform Writing production level Python and Pyspark for ETL pipelines Data processing, validation, cleaning, and debugging Using AWS services and technologies for application deployments to data APIs Qualifications: Experience creating an automated ETL to process data using python, pyspark, SQL Demonstrable deep understanding of healthcare data. Experience with building a production grade ETL pipeline Experience with AWS stacks Experience setting up and executing jobs through AWS API with EMR, Glue Experience with processing large data sets (over 1 GB) Experience with data validation process Experience with shell scripting Experience processing EHR and healthcare claims data Software Engineering

How strong is your resume?

Upload your resume and get feedback from our expert to help land this job