Upstart Inc.

Senior Software Engineer, Site Reliability Tooling

94401, San Mateo, CA, United States

Remote
Full-time/Part-time
24 days ago
Save Job

Summary

The Team Upstart's Site Reliability Engineering (SRE) team owns the reliability, resiliency, and observability of Upstart's production systems. The SRE team builds tooling and automation to monitor the health of our infrastructure and create a fast, reliable, and productive environment for other engineers and a world-class experience for our customers. SRE defines Upstart's strategy for technology operations risk mitigation, which includes disaster planning and on-call procedures. We use data-driven approaches to drive our decisions, and provide reports and insights to the business to improve visibility into the system and customer experience. As a Senior Software Engineer focused on Site Reliability Tooling your work will directly impact the success of the SRE team and all of Upstart. Your expertise will inform the team's direction, and your work with other SREs and Upstart engineers will make Upstart's systems as effective as possible for our customers. SRE at Upstart is ever-changing, and you will be a primary contributor in shaping our future path. How you'll make an impact: * Embody and share SRE principles at Upstart * Exercise state-of-the-art SRE practices throughout the company. * Uphold a culture of visibility, ownership, and responsibility around service reliability. * Implement standards for monitoring microservices, web apps, mobile apps, databases, Kubernetes clusters, and machine learning platforms, in a fast-paced environment. * Improve incident response practices, both within SRE and throughout the company. * Automate away toil that make sense to be automated. What we're looking for: * Minimum requirements: * Minimum of 6 years combined experience between Software Engineering, Site Reliability, and/or DevOps Engineering including CI/CD, TDD, internal tooling, observability, and other agile development practices. * Proficiency coding Python, Go, JavaScript/TypeScript * Proficiency with Terraform and Infrastructure as Code * Software engineering background with experience building internal tooling from scratch, and other agile development techniques * Experience with on-call and incident management environments. * Experience with observability, monitoring, and reporting tools (e.g., Datadog, Prometheus, etc.) * Experience supporting SaaS software in a microservice-oriented cloud environment * Ability to work with multiple teams for enterprise-wide deliverables * Preferred qualifications: * Experience with IaC technologies like CDK or Pulumi * Full Stack development skills * Experience building tooling for an observability platform Position Location - This role is available in the following locations: Remote, San Mateo, Columbus, Austin Time Zone Requirements - This team operates across all U.S. time zones. Travel Requirements - This team has regular on-site collaboration sessions. These occur 3 days per quarter at an Upstart office. If you need to travel to make these meetups, Upstart will cover all travel related expenses. What you'll love: * Competitive Compensation (base + bonus & equity) * Comprehensive medical, dental, and vision coverage with Health Savings Account contributions from Upstart * 401(k) with 100% company match up to $4,500 and immediate vesting and after-tax savings * Employee Stock Purchase Plan (ESPP) * Life and disability insurance * Generous holiday, vacation, sick and safety leave * Supportive parental, family care, and military leave programs * Annual wellness, technology & ergonomic reimbursement programs * Social activities including team events and onsites, all-company updates, employee resource groups (ERGs), and other interest groups such as book clubs, fitness, investing, and volunteering * Catered lunches + snacks & drinks when working in offices #LI-REMOTE #LI-MidSenior

How strong is your resume?

Upload your resume and get feedback from our expert to help land this job