CXC

Senior Cloud Applications Engineer

Mandaluyong, National Capital Region, Philippines

25 days ago
Save Job

Summary

You will act as a subject matter expert for distributed application systems, leveraging your expertise to drive operational improvements, manage incidents, and maintain enterprise applications in a secure, scalable environment. Your efforts will help ensure the optimal performance and availability of systems used by both internal teams and external partners.


Your Qualifications:

  • Bachelor’s degree in Information Technology, Engineering, or a related field
  • At least 5 years of experience in Linux system administration (RHEL, CentOS, or similar)
  • Strong background in supporting critical production systems and driving operational improvements through automation
  • Deep understanding of distributed systems, microservices architecture, and Platform as a Service (PaaS) support
  • Hands-on experience with infrastructure troubleshooting, including hardware and OS-level issues
  • Proficient in containerization and orchestration tools such as Docker and Kubernetes
  • Skilled in using monitoring and logging tools like Splunk, Grafana, and Prometheus
  • Experienced in incident management using tools such as PagerDuty and ServiceNow
  • Familiar with structured data formats and APIs, including JSON and YAML
  • At least 3 years of experience in managing CI/CD pipelines and version control using Git and Spinnaker
  • Proficient in infrastructure configuration tools like Puppet, Ansible, or Salt
  • Practical knowledge of hosted services including content delivery networks, messaging systems, API gateways, and proxies
  • Strong verbal and written communication skills, with the ability to explain complex topics clearly


Nice to Have

  • Relevant industry certifications such as CKA (Certified Kubernetes Administrator) or CKAD (Certified Kubernetes Application Developer)


Responsibilities

  • Act as the subject matter expert for distributed applications running on hybrid cloud platforms.
  • Lead incident response efforts, including post-incident reviews and long-term resolution planning.
  • Drive continuous improvement through operational metrics, customer feedback, and root cause analysis.
  • Collaborate with development teams to investigate complex issues and develop scalable solutions.
  • Monitor, maintain, and optimize the performance and availability of enterprise applications and platforms.
  • Automate repetitive tasks and streamline operational processes to reduce manual effort.
  • Maintain detailed documentation for system setup, operations, and troubleshooting procedures.

How strong is your resume?

Upload your resume and get feedback from our expert to help land this job