The Technology Services Group (TSG) are seeking an experienced Systems Engineer to join our global Cloud Operations team to manage, support and deploy cloud infrastructure solutions across Microsoft Azure, Amazon AWS and Google GCP.
You will work closely with internal business partners, architects, network, cyber-security, finance and other groups to understand project requirements and deliver scalable, reliable solutions.
This role offers an exciting opportunity to work on cutting-edge cloud technologies, drive innovation, and shape the future of our cloud infrastructure to support the organization's growth and success. If you are passionate about cloud computing, possess strong technical acumen, and thrive in a collaborative environment, we encourage you to apply and join our team.
Skills and Competencies
- 5+ years’ experience managing production solutions in Azure/AWS/GCP.
- Strong knowledge of cloud services and their best practices; account/subscription management, compute, storage, databases, serverless, IAM etc.
- Strong knowledge of cloud networking concepts and services; virtual networks, network segmentation, VPN’s, load balancers, firewalls, DNS etc.
- Strong experience and understanding of Infrastructure-as-Code tools such as Terraform, Azure CLI and AWS CLI.
- Experience with network monitoring and troubleshooting tools, such as Azure Network Watcher, AWS VPC Flow Logs, or GCP Network Intelligence Center.
- Experience investigating complex issues and performing corrective actions. Highly impactful problem-solving skills, track record in providing recovery leadership.
- Experience working with ITSM systems such as ServiceNow or Jira.
- Experience with Linux and Windows operating systems.
- Experience with Git, Github, Azure DevOps or other version control systems.
- Experience and understanding of Infrastructure-as-Code tools such as Terraform.
- Experience with monitoring tools such as Nagios, Zenoss, LogicMonitor or similar.
- Client Focused & Solutions Driven
- Exceptional written and verbal communication skills.
- Desired - Exposure to container-based workloads advantageous.
- Desired - Industry Knowledge/Business Acumen
Education
Desired - Professional certification, eg AZ-104, SAA-03, Google ACE or similar.
Responsibilities
- Independently manage/support and deploy cloud infrastructure solutions across Amazon AWS, Microsoft Azure, and Google GCP platforms.
- Implement best practices for cloud resource provisioning, configuration management, monitoring and security.
- Develop and maintain infrastructure as code (IaC) using tools such as Terraform to
- automate deployment and configuration tasks.
- Leads service recovery from major incidents and facilitates mitigation within the SLO targets.
- Work collaboratively with business and technology stakeholders in achieving full ITIL process compliance, including incident, change, problem, configuration and major incident processes.
- Work closely with the cybersecurity team to ensure applications and infrastructure meet key operational security metrics.
- Drive continuous improvement through the adoption of automation and orchestration.
- Identify opportunities to optimize cloud infrastructure performance, reliability, and cost.
- Give expert technical advice and solve complex problems.
- Interaction across TSG towers and other business support areas for problem escalations, resolutions, reporting and coordination.