Collaborate with engineers and business customers to understand data needs, capture requirements and is able to Analyze and organize raw data
Able to Conduct data analysis, create data models Design and implement data pipelines using SQL, Airflow and DBT
Is able to Develop pipelines and after deployment perform pipeline validations and monitoring
On-call support. Flexibly react to OpsGenie alerts, find root cause of an issue, determine the most appropriate course of action and proceed with implementation.
Own all documentation including all work being done and end users instructions
Skills Required:
AWS/Glue - intermediate to advanced
Job Orchestration – AWS Step Functions
Python/Pyspark - basic ability to implement a simple algorithm
Snowflake – Snow SQL- understanding concepts, hands-on implementation of pipelines
Cloud computing - Understanding concepts, hands-on experience with at least one major provider
Analytical skills - ability to analyze a problem, come up with an approach to solve the problem, discuss potential trade-offs in using different approaches