L&T Technology Services

Site Reliability Engineer

Chennai, TN, IN

14 days ago
Save Job

Summary

Responsibilities of the Site Reliability Engineer (SRE)

SREs monitor performance, collaborate with developers, and implement system improvements to prevent failures. They also enhance uptime and balance development speed with system stability.

  • Design and Implement Systems: SREs design and implement robust systems that ensure high availability and reliability. This involves creating architectures that are resilient to failures and can handle large traffic volumes.
  • Automate Operational Tasks: A key responsibility is automating repetitive operational tasks. This can improve efficiency and reduce the risk of human error. This includes creating and maintaining automation scripts and tools.
  • Monitor and Maintain System Health: SREs continuously monitor system performance using various tools and dashboards. They analyze metrics, logs, and alerts to ensure systems are running smoothly and address any issues that arise.
  • Manage Incidents and Troubleshoot Issues: When incidents occur, SREs are responsible for troubleshooting and resolving issues quickly. They perform root cause analysis to prevent future occurrences and improve system resilience.
  • Ensure Service Level Objectives (SLOs) and Service Level Agreements (SLAs): SREs work to meet and exceed defined SLOs and SLAs. They measure system performance against these objectives and take corrective actions if performance deviates from expected levels.
  • Collaborate with Development Teams: SREs collaborate with development teams to integrate reliability best practices into the software development lifecycle. They ensure that new features and services meet reliability standards before deployment.

Required Skills and Qualifications

Coding, system architecture, and proficiency with incident management systems are essential SRE competencies. To be successful in this position, one often has to have a background in computer science or a similar discipline. Also, individuals have expertise in software development or operations.

Proficiency in Programming Languages

SREs should be proficient in programming languages such as:

  • Python
  1. Bash/Shell Scripting
  2. Java
  3. Perl
  4. C/C++
  5. JavaScript
  6. PowerShell (for Windows environments)
  7. SQL (for database management)

How strong is your resume?

Upload your resume and get feedback from our expert to help land this job

People also searched: