Meta IT North America

Site Reliability Engineer - Python (Híbrido)

São Paulo, SP, BR

about 1 month ago
Save Job

Summary

Our partners are seeking Site Reliability Engineers for our team who thrive on pushing the limits of technology to produce state of the art solutions. The SRE team is challenged with creating scalable solutions for monitoring live trading infrastructures, building command frameworks, and generating actionable alerts for on call operations members as well as playing a vital role in providing

proactive support by responding to alerts, diagnosing issues, and ensuring the continuous availability of their trading platforms.

What will you be involved with?

  • Code, script and automate using Python and Go Lang
  • Implement new product features, as well as enhance and maintain existing functionality by monitoring solutions and performance characteristics
  • Create/enhance tools to make operational workflows more automated and less error-prone
  • Provide troubleshooting and support for trading system issues across the software, hardware, and network stacks to ensure that services are restored immediately
  • Participate in design discussions, review sessions and prototyping
  • Ensure the scalability and quality of all code
  • Assist with product documentation, unit testing, monitoring and ensuring overall product quality
  • Work with application teams to ensure they provide proper monitoring and tools before their application moves into prod environment

What Will You Bring to the Table?

  • Minimum AWS Certification (Associate Level)
  • Minimum RedHat Certification ( RHCSA or higher )
  • Minimum 3 years of experience with Python
  • Familiarity with Terraform
  • Experience with Ruby and Golang a plus
  • Experience with observability and monitoring tools like Grafana or ELK a plus
  • Ability to write Chef Manifests
  • Understanding of network protocols, load balancing, and HA Proxy
  • Solid understanding of functional programming, object-oriented programming, and computer science foundations
  • Good understanding of low-latency backend and server-side components
  • Proven and strong communication skills
  • Proven experience working within Agile/Scrum development methodologies, participating in sprint planning, daily stand-ups, and retrospectives.

Location São Paulo - Hybrid (2x a week)

How strong is your resume?

Upload your resume and get feedback from our expert to help land this job