We are seeking a highly skilled Lead Site Reliability Engineer to join our team.
The ideal candidate will have a strong background in software engineering and systems engineering, with a focus on reliability and scalability in cloud environments, specifically Azure.
Responsibilities
Design, implement, and maintain highly available and scalable systems across multi-region Azure cloud architectures
Ensure disaster recovery plans are in place and tested regularly
Configure and enhance monitoring and alerting processes using Prometheus, Grafana, Alertmanager, and OpsGenie
Develop dashboards to visualize system performance and reliability metrics
Utilize Terraform for infrastructure provisioning and management
Implement best practices for continuous deployment and infrastructure changes
Work closely with the development team to support ongoing development efforts
Communicate with the customer’s DevOps team to elaborate on requirements and collaborate on implementations
Enhance release management and CI/CD processes using Jenkins
Improve system security based on recommendations from the security team
Write and test runbooks to streamline operational tasks and incident response
Manage and optimize services running on Kubernetes, Docker/Linux environments
Handle data persistence using Cosmos DB (Mongo API & SQL API) and MS SQL Server
Work with messaging systems like RabbitMQ, Kafka, and EventHub
Utilize Azure Networking for secure and efficient communication
Requirements
5+ years experience as a DevOps or SRE engineer
Proven experience with multi-region Azure cloud architectures
Proficiency in Kubernetes and containerization technologies
Strong knowledge of Cosmos DB (both Mongo API & SQL API) and MS SQL Server
Familiarity with monitoring tools like Prometheus, Grafana, Alertmanager, OpsGenie
Experience with .NET Core and ASP.NET Core applications
Competency in Docker and Linux environments
Expertise in Terraform for infrastructure as code
Experience with CI/CD tools
Solid understanding of Azure Networking concepts
Excellent communication skills, both verbal and written
Strong self-motivation and ability to self-manage tasks and projects
Nice to have
Experience with Azure IoT Hub and EventHub
We offer
Engineering Heritage: Best-in-class experts sharing a culture of engineering excellence and tackling complex engineering challenges for over 30 years.
Advanced Tech Stack: Innovative projects where you can apply or enhance your expertise in Cloud, Data, AI, and other emerging technologies
World-Class Clients: Work closely with 295+ of the Forbes Global 2000 on creating disruptive solutions that make a global impact
Professional Growth: Exceptional support for career development with comprehensive resources for upskilling or reskilling in pioneering practices
GenAI Community: Strong AI competencies with 600+ experts across 55+ locations driving GenAI-enabled transformation journeys
Entrepreneurial Culture: If you're passionate and dedicated to improving business transformation, we provide the support you need to bring your ideas to life
Hybrid Setup: The flexibility to work from any location in Lithuania, whether it's your home or our dynamic offices in Vilnius and Kaunas
Other Benefits: Additional vacation and trust days, private health insurance, Employee Stock Purchase Plan and more
Salary range €4.8K-€6.7K gross, based on your experience and interview results.
Join our team in our cozy offices in Vilnius or Kaunas.
About EPAM
EPAM is a leading global provider of digital platform engineering and development services. For over 30 years, our team has helped leading brands navigate the waves of digital transformation, building solutions that help them stay competitive through constant market disruption.
With offices in 55+ countries, EPAM has grown in Lithuania to over 1,200+ talented innovators in just 4 years. We foster creativity and unconventional ways of doing things, welcoming like-minded professionals to join us
Feel free to work remotely from anywhere across Lithuania or connect with colleagues at our Vilnius and Kaunas offices.
How strong is your resume?
Upload your resume and get feedback from our expert to help land this job
How strong is your resume?
Upload your resume and get feedback from our expert to help land this job