Tesla

Staff Site Reliability Engineer, Fleetnet

94306, Palo Alto, CA, United States

Onsite
Full-time
2 days ago
Save Job

Summary

We are a product focused global team creating the next-generation of server-side infrastructure and code to support the growing suite of Tesla products and services. We are looking for seasoned SREs with domain expertise in areas related to developing infrastructure as a service, Kubernetes, Gitops, K8s Operator development, and platform security. The Fleetnet SRE team is part of the Vehicle Software division and is embedded with our backend application, data platform and navigation development teams. You will be part of a high-impact team at Tesla and play a key role in shaping the future of automotive and energy technology. Your work will have direct customer-facing impact by enabling customers to summon their Teslas safely and securely with just their phones, make it possible for customers to securely share access to their cars with friends and family, and deliver personalized experiences, so when drivers hop into a new Tesla, all their preferences follow them. In addition, the backend systems you will support enables seamless, over-the-air updates that keep Tesla vehicles and devices ahead of the curve. One day, when an autonomous vehicle arrives to drive you to your destination, you'll have played an integral part of making that vision a reality. Join us and you will work alongside world-class software and data engineers on some of the newest and most challenging IoT and service engineering problems in the world today. The platform you help us build and automate will be used daily by millions of Tesla owners (and tens of thousands of Tesla employees) to improve and enhance the functionality of our cars, chargers, and batteries worldwide. The Fleenet SRE position is a diverse job with a wide range of responsibilities and impact. If you're a highly self-motivated software engineer with a passion for driving infrastructure, security and reliability, Fleetnet SRE is a good fit for you. * Design and write software that enables rapid prototyping by development teams, while ensuring the highest levels of reliability and availability * Drive the migration of large-scale, distributed fleet applications towards cloud-native microservices * Influence architectural decisions with focus on security, scalability and high-performance * Automate the build and deployment of infrastructure using Docker, Kubernetes & other orchestration technologies in a hybrid-cloud environment * Setup and maintain monitoring, metrics & reporting systems for fine-grained observability and actionable alerting * Experience building and maintaining SaaS infrastructure * Expert skills with Linux, networking, storage and virtualization automation with tools like Kubernetes, Terraform, Ansible, Chef et al * Setting up and supporting CI/CD * Proficiency in a high-level language like Python, Go, Ruby and/or Java * Scaling through data-driven capacity planning, within both physical data centers and Cloud infrastructure (AWS, GCP or Azure) * Troubleshooting and full-cycle incident response (mitigation, correction, prevention) * Strong belief in spreading (& acquiring) knowledge through mentorship and acting like an owner * Smart but humble, with a bias for action and for enabling others' success

How strong is your resume?

Upload your resume and get feedback from our expert to help land this job