Are you dedicated to helping those around you become more effective? Do you see technology as a means to enhance efficiency and transform business operations? We are seeking candidates committed to building and supporting teams, tackling complexity head-on, and developing scalable software platforms.
AI Infrastructure's mission is to build foundational infrastructure software systems, tools, and services that empower researchers and engineers across Meta to develop industry-leading large language models, multimodal generative foundation models, and ranking models.
As a Technical Program Manager (TPM), you will play a critical role within the AI Infrastructure team, leading large-scale, highly technical, cross-functional projects that span Meta. Our team includes individuals with diverse experience and backgrounds. While relevant experience is important, demonstrated capabilities and a problem-solving approach are ultimately the most valuable qualities. Meta's infrastructure continually pushes the boundaries of what is possible, and we need Technical Program Managers who can do the same.8+ years of experience in software engineering, systems engineering, or technical product/program management. Experience managing the delivery of technical programs or products from inception through to successful delivery. Experience in implementing monitoring systems and instrumentation solutions to track performance and identify issues. Familiarity with technical program management methodologies, including Agile, Waterfall, or hybrid approaches, to drive project success and ensure efficient execution. Proven track record of working independently, taking initiative while proactively seeking feedback and input when necessary to ensure alignment and improve outcomes. Proven track record of leading by influencing others rather than relying on direct authority, fostering collaboration and alignment across teams. Demonstrated experience in communicating and collaborating with technical management teams to design, develop, and implement systems, solutions, and products. Proven experience in analytical thinking and problem-solving, particularly with large-scale systems. Experience building and maintaining work relationships across multidisciplinary teams and with partners in different time zones. Experience supporting large-scale infrastructure software programs, especially those focused on AI/ML. Experience in a start-up-like environment, emphasizing technical infrastructure management, where agility and rapid execution are essential for scaling systems and tackling growth-related challenges. Experience working with large GPU clusters.