Tenth Revolution Group

Lead Site Reliability Engineer

West Flanders, Flanders, BE

7 days ago
Save Job

Summary

The Company:

Headquatered in belgium, this company is a global leader in manufacturing of equipment that supports sectors like logistics and construction. Their services include, tehcnical services, electronics repair, and training programs all backed by a commitment to innovation and operational excellence.


About the Role:

I am looking for a Site Reliability Engineering (SRE) Lead to take ownership of the reliability, performance, and scalability of my clients systems. You’ll play a key role in designing and implementing infrastructure solutions that enable their engineering teams to deploy faster and more confidently—without compromising stability or uptime.


As the SRE Lead, you’ll mentor a growing team of SREs, drive best practices in observability, automation, and incident management, and collaborate cross-functionally to ensure a seamless experience for both our internal teams and customers.


What You’ll Be Doing:

Leadership & Strategy

-Lead and grow a high-performing SRE team.

-Define and drive the SRE roadmap aligned with business goals.

-Advocate for a culture of reliability, automation, and continuous improvement.

System Reliability & Performance

-Own SLAs, SLOs, and error budgets for critical systems.

-Monitor system performance, diagnose issues, and implement long-term fixes.

Incident Response & Prevention

-Coordinate high-impact incident response efforts and postmortems.

-Drive root cause analysis and long-term improvements.

Tooling & Automation

-Build and enhance internal tooling to improve deployment, monitoring, and reliability.

-Implement infrastructure as code and CI/CD best practices.

Collaboration

-Work closely with engineering, security, and product teams to ensure reliability is factored into planning and development.

-Promote DevOps principles and empower teams with self-service infrastructure.


What We’re Looking For:

-Proven experience in an SRE or DevOps leadership role.

-Deep understanding of networking, containers (Docker, Kubernetes), and --cloud infrastructure (AWS/GCP/Azure).

-Strong skills in monitoring, observability, and alerting systems (Prometheus, Grafana, Datadog, etc.).

-Proficiency with infrastructure-as-code tools like Terraform or Pulumi.

-Experience with CI/CD pipelines and GitOps practices.

-Excellent communication and incident management skills.

-Passion for automation, documentation, and mentoring others.

-English - Dutch considered a plus

-Visa sponsorship not possible


Nice to Have:

-Experience with high-scale, customer-facing applications.

-Familiarity with service meshes, distributed tracing, or chaos engineering.

-Certifications in cloud or Kubernetes.


Interested to learn more, lets chat!

How strong is your resume?

Upload your resume and get feedback from our expert to help land this job

People also searched: