senior AWS Cloud Developer –
Onsite (Denver, CO) – 1 Position
Role Summary:
This role supports the onsite architect in developing PySpark-based jobs, managing EMR clusters, and coordinating with the offshore team to ensure technical consistency across the board.
Key Responsibilities:
Develop and validate PySpark jobs on AWS EMR with Iceberg table formats.
Implement transformation pipelines based on legacy Cloudera HiveQL logic.
Troubleshoot performance issues in EMR and optimize job configurations.
Create test frameworks and scripts to compare outputs from AWS and Cloudera during parallel runs.
Review offshore deliverables and support technical onboarding of cloud services.
Required Skills:
6+ years in big data development with 3+ years on AWS.
Strong knowledge of EMR, S3, Iceberg, PySpark, SQL, MWAA.
Hands-on with migration scenarios from on-prem HDFS to S3.
Experience with performance tuning in EMR and S3 I/O optimization.