Talent Worx

Site Reliability Engineer

Hyderabad, TS, IN

4 days ago
Save Job

Summary

Site Reliability Engineer (SRE)

At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between development and operations, contributing to the design, scalability, and performance optimization of our infrastructure.

Requirements

Key Responsibilities:

  • Ensure the reliability, availability, and performance of production systems
  • Develop and maintain monitoring, alerting, and incident response systems
  • Automate routine tasks and improve system performance using scripting and programming
  • Create and promote best practices for operational efficiency and reliability
  • Collaborate closely with development teams to enhance system designs for reliability and monitoring
  • Perform root cause analysis to resolve production incidents effectively
  • Contribute to capacity planning and performance analysis activities
  • Document processes, architectures, and troubleshooting steps for team knowledge-sharing

Required Skills and Qualifications:

  • 5+ years of experience as a Site Reliability Engineer, DevOps Engineer, or similar role
  • Strong experience with cloud service providers (AWS, Azure, Google Cloud)
  • Proficiency in programming/scripting languages such as Python, Go, or Ruby
  • Experience with container orchestration platforms (e.g., Kubernetes, Docker Swarm)
  • Familiarity with configuration management tools (e.g., Ansible, Puppet, Chef)
  • Experience with monitoring tools like Prometheus, Grafana, Datadog, or similar
  • Strong problem-solving skills with a focus on automation and reliability
  • Excellent communication skills and the ability to work collaboratively in a team environment

Preferred Skills:

  • Knowledge of microservices architecture and RESTful APIs
  • Familiarity with Agile methodologies and CI/CD practices
  • Experience in disaster recovery planning and execution
  • Certification in cloud technologies or site reliability practices

Education:

  • Bachelor's degree in Computer Science, Information Technology, or equivalent experience


Benefits

Talworx is an emerging recruitment consulting and services firm, we are hiring for our Product based health care client which is a leading precision medicine company focused on guarding wellness and giving every person more time free from cancer. Founded in 2012, we're transforming patient care by providing critical insights into what drives disease through its advanced blood and tissue tests, real-world data and AI analytics.

How strong is your resume?

Upload your resume and get feedback from our expert to help land this job

People also searched: