The Cloud Operations Engineer is responsible for ensuring the operational integrity, stability, and reliability of cloud-based systems and applications hosted on AWS. This role involves managing, monitoring, and automating AWS infrastructure and services, implementing configuration management practices, leveraging Infrastructure as Code (IaC) to streamline operations. This role involves collaboration with stakeholders and adhering to security and compliance standards.
Duties & Responsibilities
AWS Infrastructure Management
Design, install, configure, and maintain AWS cloud infrastructure, including server and serverless architectures.
Architect and design improvements using new native solutions in AWS and/or alternative Cloud Environment.
Good knowledge of core cloud services such as VMs, Containers, App Services, Virtual Networks, ASGs, Application Gateways, Load balancers and S3 accounts.
Regularly evaluate cloud applications, designs, and best practices.
Implement secure networking solutions such as VPC, subnets, routing and security groups
Manage security groups, IAM roles, and policies for secure access control.
Automation and Optimization
Create and maintain automated solutions for repetitive tasks, including infrastructure provisioning, monitoring, and patching.
Optimize cloud resources for cost management and performance.
Implement auto-scaling and elasticity solutions to ensure infrastructure reliability and efficiency
Configuration Management and Infrastructure as Code (IaC)
Use Infrastructure as Code tools to provision, manage and version AWS resources
Implement and maintain configuration management tools to standardize and automation configurations
Monitoring and Alerting
Configure and maintain monitoring and alerting systems to ensure the health and performance of the infrastructure.
Application Deployment and Support
Deploy, configure, and support applications and services on AWS infrastructure.
Collaborate with DevOps teams to optimize and automate CI/CD pipelines.
Change and Incident Management
Prepare and document Change Requests and Methods of Procedures (MOPs) for infrastructure changes.
Troubleshoot and resolve incidents affecting cloud services in accordance with predefined Service Level Agreements (SLAs).
Escalate issues to internal teams or third-party vendors as required.
Incident Response and Post-Incident Reporting
Participate in incident response processes and post-incident reporting (PIR).
Identify and deploy fixes to prevent recurrence of issues.
Security and Compliance
Work with IT Security to ensure cloud infrastructure aligns with security best practices and compliance requirements
Work with internal architecture, Solution Delivery(PMO), governance, and security teams to ensure that all security, governance and business continuity requirements and best practices are integrated and implemented.
.Cloud Governance - Cost Optimization, Data Management, Asset Management, Performance Management, and Deployment Acceleration
Implement encryption, backup, and disaster recovery solutions.
Safety and Compliance
Actively engage in the company’s Safety Management System (SMS) by reporting hazards and incidents encountered during daily operations.
Collaboration and Documentation
Coordinate with internal stakeholders, DevOps, and third-party vendors for deployments and fixes.
Work within a cross-functional team of System Ops Engineers, App Admins, Data Engineers DevOps and DBA’s to specify, design, develop, test, and implement AWS cloud services and solutions.
Provide guidance and knowledge to other team members, and promote efficiency, productivity, innovations, and knowledge-sharing across multi-functional teams.
Work closely with internal business partners to gather requirements, design and implement solutions, manage technical operations, and triage and resolve operational issues.
Maintain detailed and accurate system, application, and infrastructure documentation.
On-Call Support
Participate in an after-hours on-call rotation for critical incident resolution.
Other Responsibilities
Perform additional related duties as assigned by management.
Behavioural Competencies
Concern for Safety: Identifying hazardous or potentially hazardous situations and taking appropriate action to maintain a safe environment for self and others.
Teamwork: Working collaboratively with others to achieve organizational goals.
Passenger/Customer Service: Providing service excellence to internal and/or external customers (passengers).
Initiative: Dealing with situations and issues proactively and persistently, seizing opportunities that arise.
Results Focus: Focusing efforts on achieving high quality results consistent with the organization’s standards.
Fostering Communication: Listening and communicating openly, honestly, and respectfully with different audiences, promoting dialogue and building consensus.
Qualifications
Strong experience with AWS services such as EC2, S3, RDS, Lambda, CloudFormation, and VPC.
5+ years of experience in IT Infrastructure Operations with a minimum of 3 years of experience in AWS.
Proficiency in automation tools (e.g., AWS CLI, Terraform, or CloudFormation).
Familiarity with monitoring tools like CloudWatch, Dynatrace
Experience with scripting languages such as Python, PowerShell, or Bash
Experience with implementing, supporting and monitoring servers and applications in both Windows and Linux environments
Experience integrating applications and systems
Understanding of ITIL practices and change management processes
Strong problem-solving skills and ability to perform under pressure
Excellent communication and documentation skills
Experience with DevOps, CI/CD pipelines and Configuration Management is an asset
Willingness to work flexible hours, including after-hours and on-call rotations
Location
Toronto Downtown Office (250 Yonge Street)
Company Description
Since 2006, Porter Airlines has been elevating the experience of economy air travel for every passenger, providing genuine hospitality with style, care and charm. Porter’s fleet of Embraer E195-E2 and De Havilland Dash 8-400 aircraft serves a North American network from Eastern Canada. Headquartered in Toronto, Porter is an Official 4 Star Airline® in the World Airline Star Rating®. Visit www.flyporter.com or follow @porterairlines on Instagram, Facebook and Twitter.
How strong is your resume?
Upload your resume and get feedback from our expert to help land this job
How strong is your resume?
Upload your resume and get feedback from our expert to help land this job