At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between development and operations, contributing to the design, scalability, and performance optimization of our infrastructure.
Requirements
Key Responsibilities:
Ensure the reliability, availability, and performance of production systems
Develop and maintain monitoring, alerting, and incident response systems
Automate routine tasks and improve system performance using scripting and programming
Create and promote best practices for operational efficiency and reliability
Collaborate closely with development teams to enhance system designs for reliability and monitoring
Perform root cause analysis to resolve production incidents effectively
Contribute to capacity planning and performance analysis activities
Document processes, architectures, and troubleshooting steps for team knowledge-sharing
Required Skills and Qualifications:
5+ years of experience as a Site Reliability Engineer, DevOps Engineer, or similar role
Strong experience with cloud service providers (AWS, Azure, Google Cloud)
Proficiency in programming/scripting languages such as Python, Go, or Ruby
Experience with container orchestration platforms (e.g., Kubernetes, Docker Swarm)
Familiarity with configuration management tools (e.g., Ansible, Puppet, Chef)
Experience with monitoring tools like Prometheus, Grafana, Datadog, or similar
Strong problem-solving skills with a focus on automation and reliability
Excellent communication skills and the ability to work collaboratively in a team environment
Preferred Skills:
Knowledge of microservices architecture and RESTful APIs
Familiarity with Agile methodologies and CI/CD practices
Experience in disaster recovery planning and execution
Certification in cloud technologies or site reliability practices
Education:
Bachelor's degree in Computer Science, Information Technology, or equivalent experience
Benefits
Talworx is an emerging recruitment consulting and services firm, we are hiring for our Product based health care client which is a leading precision medicine company focused on guarding wellness and giving every person more time free from cancer. Founded in 2012, we're transforming patient care by providing critical insights into what drives disease through its advanced blood and tissue tests, real-world data and AI analytics.
How strong is your resume?
Upload your resume and get feedback from our expert to help land this job
How strong is your resume?
Upload your resume and get feedback from our expert to help land this job