Our partners are seeking Site Reliability Engineers for our team who thrive on pushing the limits of technology to produce state of the art solutions. The SRE team is challenged with creating scalable solutions for monitoring live trading infrastructures, building command frameworks, and generating actionable alerts for on call operations members as well as playing a vital role in providing
proactive support by responding to alerts, diagnosing issues, and ensuring the continuous availability of their trading platforms.
What will you be involved with?
Code, script and automate using Python and Go Lang
Implement new product features, as well as enhance and maintain existing functionality by monitoring solutions and performance characteristics
Create/enhance tools to make operational workflows more automated and less error-prone
Provide troubleshooting and support for trading system issues across the software, hardware, and network stacks to ensure that services are restored immediately
Participate in design discussions, review sessions and prototyping
Ensure the scalability and quality of all code
Assist with product documentation, unit testing, monitoring and ensuring overall product quality
Work with application teams to ensure they provide proper monitoring and tools before their application moves into prod environment
What Will You Bring to the Table?
Minimum AWS Certification (Associate Level)
Minimum RedHat Certification ( RHCSA or higher )
Minimum 3 years of experience with Python
Familiarity with Terraform
Experience with Ruby and Golang a plus
Experience with observability and monitoring tools like Grafana or ELK a plus
Ability to write Chef Manifests
Understanding of network protocols, load balancing, and HA Proxy
Solid understanding of functional programming, object-oriented programming, and computer science foundations
Good understanding of low-latency backend and server-side components
Proven and strong communication skills
Proven experience working within Agile/Scrum development methodologies, participating in sprint planning, daily stand-ups, and retrospectives.
Location São Paulo - Hybrid (2x a week)
How strong is your resume?
Upload your resume and get feedback from our expert to help land this job
How strong is your resume?
Upload your resume and get feedback from our expert to help land this job