We are seeking a skilled Lead Site Reliability Engineer to join our team. As a Lead SRE, you will work to ensure that our systems, services, and applications running on Google Cloud Platform (GCP) are reliable, performant, and scalable. The ideal candidate will possess strong technical skills, have a passion for automation and infrastructure-as-code, and thrive in a collaborative team environment.
Responsibilities
Participate in on-call rotations and provide 24/7 support for critical systems
Respond to alerts of running services and applications, conducting RCA
Deploy microservices according to release cadence
Design, implement, and maintain scalable and reliable systems and applications on Google Cloud Platform (GCP)
Develop and maintain infrastructure as code using Terraform
Collaborate with engineering teams to identify and prioritize reliability, performance improvements, and right-sizing of dedicated cloud resources
Participate in incident management and response using ServiceNow
Manage and resolve technical issues and tickets using Jira
Develop a knowledge base for maintaining existing infrastructure and monitoring services
Requirements
8+ years of experience in an SRE, DevOps, or system administration role
Deep knowledge of Google Cloud Platform (GCP)
Experience with incident management and response using ServiceNow or similar tools
Strong problem-solving skills and experience in debugging complex technical issues
Understanding of monitoring, logging, and alerting systems (preferably Cloud Monitoring)
Familiarity with version control using GitHub
Experience with infrastructure-as-code
Excellent communication and collaboration skills
Experience with Kubernetes and containerization technologies
Experience with Terraform for infrastructure-as-code
Strong understanding of the Software Development Life Cycle (SDLC) and CI/CD pipelines, and experience with CI/CD tools
Experience with monitoring and logging tools like Prometheus, Grafana, Catchpoint, and ELK
How strong is your resume?
Upload your resume and get feedback from our expert to help land this job
How strong is your resume?
Upload your resume and get feedback from our expert to help land this job