Compunnel Inc.

Site Reliability Engineer

Toronto, ON, CA

6 days ago
Save Job

Summary

  • SRE: In depth knowledge and experience in Observability, Toil Management, Monitoring tools (Dynatrace, CW, Azure Monitor), Resilient Arch, IaC, CaC, JSON, Typescript, API and Webhook development using Python, Node.js, Ruby, PowerShell, and Shell Scripting languages.
  • Cloud Experience: In depth knowledge in Cloud Native tools / services: CDK, Cloud Watch, EKS, EC2, ELB, S3, Lambda, & SSM.
  • In depth understanding of Dynatrace advanced features (DT Guardian, RUM, Synthetic testing and monitoring, AI event correlation)
  • Experience in Logs ingestion (AWS Firehose, DT Open Pipeline), Reporting and Dashboard tools, Operational Metrics and analytics
  • Automation: Leverage Ansible Tower, AWS SSM, BitBucket / GitHub to build automated workflow that eliminate Toil, improve response time and streamline deployment pipeline.
  • Cloud Orchestration tools (AWS Step functions, Containers, Apache Airflow) with special focus on Data Batch Processing and Pipelines
  • Deep knowledge in Data Management, Data Warehouse, Data lakes, & Database reliability (RedShift, RDS, Aurora), PostgreSQL, SQL Server, Oracle with DevOps experience.
  • Exceptional Problem-Solving skills, Knowledge Management and effective communicator that can speak the language of people, process and technology.
  • Decisive, energetic, focused team player who builds and leads high-performing teams / CoP and foster a culture of diversity, inclusion, recognition and growth.

How strong is your resume?

Upload your resume and get feedback from our expert to help land this job

People also searched: