about the company
A fast-growing tech company based in Selangor, Malaysia
Focused on delivering scalable digital infrastructure and cloud solutions
Emphasizes innovation, collaboration, and continuous learning
Supports a modern, flexible, and growth-oriented work culture
Works with diverse clients across industries on high-impact technology projects
about the job
SRE role focused on building and maintaining reliable, scalable systems
Involves automation, incident response, and system performance tuning
Works with cloud platforms (AWS, Azure, GCP), Linux, and scripting (Python, Golang, Java)
Key responsibilities include managing SLOs/SLIs, reducing operational toil, and post-incident reviews
Requires strong collaboration across DevOps, engineering, and operations teams
Requirements (Skills)
Proficiency in scripting/programming (Python, Golang, or Java)
Strong understanding of SRE principles (SLOs, SLIs, incident management, toil reduction)
Hands-on experience with cloud platforms (AWS, Azure, or GCP)
Solid Linux system administration and troubleshooting skills
Familiarity with Kubernetes, CI/CD, and infrastructure as code practices