Job Title: Linux Systems Engineer
Position Summary:
We're on the hunt for a seasoned AWS Cloud Linux Systems Engineer to strengthen our IT/DevOps team. This role is ideal for someone who thrives in a cloud-first environment and brings deep experience in managing Linux systems, automating infrastructure, and maintaining high-performing, secure cloud environments. You'll be responsible for architecting, deploying, and supporting critical AWS services while ensuring system reliability and performance.
Key Responsibilities
Cloud Infrastructure & System Management:
- Deploy and manage scalable, secure Linux environments across AWS and, occasionally, Azure or GCP.
- Handle core AWS services such as EC2, S3, VPCs, Load Balancers, and RDS; bonus if you're comfortable with GCP equivalents like Compute Engine and Cloud Functions.
- Ensure the infrastructure is robust, cost-effective, and designed for high availability.
- Collaborate with cross-functional teams to design and integrate cloud-native solutions into CI/CD pipelines.
Automation & Infrastructure as Code:
- Build and maintain Infrastructure as Code (IaC) using tools like Terraform and AWS CloudFormation.
- Streamline operations through automation with Ansible, Chef, or Puppet.
- Write and maintain scripts that help manage and scale cloud environments efficiently.
Cloud Security & Compliance:
- Apply best practices in cloud security-controlling access, managing IAM roles, securing endpoints, and encrypting data.
- Conduct regular audits and vulnerability assessments.
- Work alongside security teams to address compliance, access controls, and infrastructure hardening.
Monitoring, Performance & Optimization:
- Implement and manage tools such as Prometheus, Grafana, AWS CloudWatch, Datadog, or similar.
- Identify and resolve performance bottlenecks, manage resource usage, and optimize costs.
- Configure alerts and manage logs using solutions like ELK Stack, Splunk, or native cloud logging services.
Linux Systems & Application Management:
- Administer Linux-based environments including updates, patching, and troubleshooting.
- Configure and manage essential services like Apache, NGINX, MySQL, and PostgreSQL.
- Manage storage solutions such as EBS volumes and S3 buckets.
Collaboration & Documentation:
- Partner with Dev, QA, and Ops teams to ensure reliable application deployment and system stability.
- Maintain comprehensive documentation covering infrastructure, processes, and policies.
- Support and mentor junior team members through knowledge sharing and technical guidance.
Incident Response & Root Cause Analysis:
- Respond promptly to critical incidents affecting cloud infrastructure.
- Conduct root cause analysis and implement long-term solutions to recurring problems.
Qualifications & Skills
Required Education & Experience:
- Bachelor's degree in Computer Science, IT, or equivalent professional experience.
- 7-9 years of experience in cloud infrastructure, specifically AWS.
- 5-7 years of deep Linux systems engineering experience (Ubuntu, CentOS, or RedHat).
Core Technical Skills:
- Expertise in AWS cloud technologies and infrastructure management.
- Strong Linux administration background.
- Hands-on experience with IaC tools such as Terraform and CloudFormation.
- Proficiency in automation tools like Ansible, Puppet, or Chef.
- Solid grasp of cloud networking: DNS, VPNs, CIDR, subnets, and VPCs.
- Knowledge of cloud security principles and IAM management.
- Experience with monitoring/logging tools like CloudWatch, Datadog, LogicMonitor, or similar.
Preferred Qualifications:
- Relevant certifications (AWS, ITIL, CAMP, etc.).
- Familiarity with asset management and compliance in cloud environments.
- Experience with cloud-native monitoring and alerting systems.
Soft Skills:
- Strong communication skills and a proactive mindset.
- Exceptional troubleshooting ability and a customer-service-first attitude.
- Ability to work independently and as part of a team in a fast-paced, evolving environment.
- Flexibility for after-hours or weekend work during updates, outages, or project rollouts.