Experience with cloud platforms (AWS, GCP). Utilize Infrastructure as Code (IaC) tools like Terraform or CloudFormation to manage cloud infrastructure. Hands-on working experience in any one of the cloud platforms is mandatory.
Cloud (AWS/GCP), Dev Ops, and CI/CD tools. JavaScript or TypeScript.
Common Soft Skills:
Experience in independently executing customer-facing roles, understanding SRE requirements, assisting in team building, and driving implementation.
Hands-on experience in working on RFP/proposals.
Excellent communication and business presentation skills.
Must-Have Skills:
Monitoring, observability, Open Telemetry: Using tools like Splunk, AppDynamics, Prometheus, Fluentd, ELK (Elastic Search, Logstash, Kibana), TIG (Telegraf, Influx, Grafana), DataDog, NewRelic.
Concepts of SLI, SLO, SLA: Define SLIs (Service Level Indicators), SLOs (Service Level Objectives), and error budgets, toil.
Writing complex PromQL or related queries for dashboards. Prepare SLA compliance monitoring dashboards.
Software Engineering and Development skills: .NET, Go, Python, C++, Ruby or Java, or software delivery platforms such as Puppet, Chef, Ansible, and/or Spinnaker. Being able to instrument services; write exporters and collectors, etc. (60% or above exposure to any one coding language is a must).
Experience with building and running microservices at scale, REST API integration, detailed solution design.
Optional Skills:
Incident Management Framework, L1/L2/L3: Facilitate blameless post-mortems to identify root causes of incidents and implement preventative measures.
Experience with Kubernetes. Good experience with automation across applications/services and infrastructure management.
How strong is your resume?
Upload your resume and get feedback from our expert to help land this job
How strong is your resume?
Upload your resume and get feedback from our expert to help land this job