Rivian and Volkswagen Group Technologies

Staff Site Reliability Engineer

Palo Alto, CA, US

$232.5k
5 days ago
Save Job

Summary

About Us

Rivian and Volkswagen Group Technologies is a joint venture between two industry leaders with a clear vision for automotive’s next chapter. From operating systems to zonal controllers to cloud and connectivity solutions, we’re addressing the challenges of electric vehicles through technology that will set the standards for software-defined vehicles around the world.

The road to the future is uncharted. By combining our expertise across connectivity, AI, security and more, we’ll map a new way forward. Working together, we’ll create a future that’s more connected, more intelligent, more sustainable for everyone.

Role Summary

We are seeking a highly skilled and experienced Senior Site Reliability Engineer (SRE) to join our team. In this critical role, you will be instrumental in ensuring the reliability, performance, and scalability of our complex distributed systems. You will act as a central point of expertise, focusing on incident coordination, providing a crucial backstop for service owners, and proactively engaging with development teams to enhance service maturity. A strong understanding of our distributed architecture, data flows, and the ability to design innovative technology patterns to address challenges at scale are essential for success in this role.

Responsibilities

  • Incident Coordination: Lead and manage the end-to-end process for major incidents, ensuring effective communication, clear roles and responsibilities, and timely resolution. Drive post-incident reviews to identify root causes and implement preventative measures.
  • On-Call Backstop: Serve as an escalation point and provide expert support during on-call rotations for service owners, particularly for complex or systemic issues.
  • Service Maturity Engagement: Proactively engage with development teams on short-term projects to improve the reliability, observability, and operational excellence of their services. This includes guidance on SLO/SLI definition, error budgeting, monitoring strategies, and automation.
  • Distributed System Architecture Expertise: Maintain a comprehensive and up-to-date mental model of RVT's distributed system architecture, including inter-service dependencies, data flows, and critical infrastructure components.
  • Technology Pattern Design: Identify recurring challenges and design new, scalable technology patterns and best practices to improve the reliability, efficiency, and resilience of our systems. This may involve exploring new technologies and advocating for their adoption.
  • Performance Optimization: Analyze system performance, identify bottlenecks, and collaborate with development teams to implement optimizations for latency, throughput, and resource utilization.
  • Platform Conditioning: Contribute to capacity planning efforts by analyzing trends, defining saturation points of the system, and recommending scaling strategies.
  • Mentorship: Mentor and guide junior SRE team members, fostering a culture of learning and knowledge sharing.

Qualifications

  • Deep understanding of modern distributed systems principles, including microservices, Kubernetes, and cloud-native architectures.
  • Experience working with cell-based architectures and managing IoT environments at scale, including understanding the unique challenges and considerations of these systems.
  • Software development experience in modern programming languages such as Rust and Golang, with a strong understanding of software development lifecycle and best practices.
  • Extensive experience in designing and coordinating incident response plans and processes at scale in large cloud environments (e.g., AWS, Azure, GCP).
  • Strong knowledge of data platforms, including relational and NoSQL databases, data warehousing concepts, and data governance.
  • Experience with real-time streaming technologies (e.g., Kafka, Flink) and data lake architectures (e.g., S3, ADLS, Data Lake Storage).
  • Familiarity with global traffic management techniques (e.g., DNS-based routing, load balancing strategies, CDN).
  • Proficiency with observability tools and practices, including monitoring, logging, tracing, and alerting (e.g., Prometheus, Grafana, ELK stack, Datadog).
  • Excellent troubleshooting and analytical skills, with the ability to diagnose complex issues in distributed environments.
  • Strong communication and collaboration skills, with the ability to effectively communicate technical concepts to both technical and non-technical audiences.
  • Experience with infrastructure-as-code (IaC) tools like Crossplane or Terraform.
  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.

Pay Disclosure

Salary Range/Hourly Rate for California Based Applicants: $186,000 - $232,500 USD

Actual Compensation will be determined based on experience, location, and other factors permitted by law.

Benefits Summary: Rivian and Volkswagen Group Technologies provides robust medical, prescription, dental and vision insurance packages for full-time employees, their spouse or domestic partner, and their children up to age 26. Coverage is effective on the first day of employment.

Equal Opportunity

Rivian and Volkswagen Group Technologies is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, ancestry, sex, sexual orientation, gender, gender expression, gender identity, genetic information or characteristics, physical or mental disability, marital/domestic partner status, age, military/veteran status, medical condition, or any other characteristic protected by law. We are also committed to ensuring compliance with all applicable fair employment practice laws regarding citizenship and immigration status.

Rivian and Volkswagen Group Technologies is committed to ensuring that our hiring process is accessible for persons with disabilities. If you have a disability or limitation, such as those covered by the Americans with Disabilities Act, that requires accommodations to assist you in the search and application process, please email us at [email protected].

Candidate Data Privacy

Rivian and VW Group Technologies (“Rivian and Volkswagen Group Technologies”) may collect, use and disclose your personal information or personal data (within the meaning of the applicable data protection laws) when you apply for employment and/or participate in our recruitment processes (“Candidate Personal Data”). This data includes contact, demographic, communications, educational, professional, employment, social media/website, network/device, recruiting system usage/interaction, security and preference information. Rivian and Volkswagen Group Technologies may use your Candidate Personal Data for the purposes of (i) tracking interactions with our recruiting system; (ii) carrying out, analyzing and improving our application and recruitment process, including assessing you and your application and conducting employment, background and reference checks; (iii) establishing an employment relationship or entering into an employment contract with you; (iv) complying with our legal, regulatory and corporate governance obligations; (v) recordkeeping; (vi) ensuring network and information security and preventing fraud; and (vii) as otherwise required or permitted by applicable law.

Rivian and Volkswagen Group Technologies may share your Candidate Personal Data with (i) internal personnel who have a need to know such information in order to perform their duties, including individuals on our People Team, Finance, Legal, and the team(s) with the position(s) for which you are applying; (ii) Rivian and Volkswagen Group Technologies affiliates; and (iii) Rivian and Volkswagen Group Technologies’ service providers, including providers of background checks, staffing services, and cloud services.

Rivian and Volkswagen Group Technologies may transfer or store internationally your Candidate Personal Data, including to or in the United States, Canada, and the European Union and in the cloud, and this data may be subject to the laws and accessible to the courts, law enforcement and national security authorities of such jurisdictions.

Please see our Candidate Data Privacy Notice (English) and Candidate Data Privacy Notice (Serbian) for more information.

Please note that we are currently not accepting applications from third party application services.

How strong is your resume?

Upload your resume and get feedback from our expert to help land this job

People also searched: