Meta Platforms, Inc.

Software Engineer Systems Machine Learning - Frameworks

Menlo Park, CA, US

Onsite
Full-time
2 months ago
Save Job

Summary

You will be a member of the Inference Enablement Team and part of the bigger industry-leading PyTorch AI framework organization. We build the foundational technology and frameworks that enable and optimize state-of-the-art model architectures on new hardware. We work with many different types of model architectures across Ads, Reels, Feed, Marketplace, IG, and GenAI. We support the enablement of models on all kinds of hardware as well, including CPU, GPU, and custom Silicon. We build the model publishing frameworks and platforms to accomplish this at scale for a large, broad set of models that are pushing the state of the art in Inference. Our work involves a blend of software systems and ML, and you'll have the opportunity to learn about how PyTorch works at a more in-depth level as well. We play an integral role in helping Meta ship the most important, state-of-the-art models for inference, including models with significant Ads Revenue gain, Reels Watch Time gain, all while saving multiple MegaWatts of power through our efficiency work. Come join us!Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta. Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Python and C/C++ programming skills. Experience in AI framework development or accelerating deep learning models on hardware architectures Must obtain work authorization in country of employment at the time of hire and maintain ongoing work authorization during employment. Knowledge of GPU, CPU, or AI hardware accelerator architectures. Experience working with kernel frameworks like Triton, or CUDA. OR AI frameworks: Experience in developing training and inference framework components. Experience in system performance optimizations such as runtime analysis of latency, memory bandwidth, I/O access, compute utilization analysis and associated tooling development. OR AI Compiler: Experience with compiler optimizations such as loop optimizations, vectorization, parallelization, hardware specific optimizations such as SIMD. Experience with MLIR, LLVM, IREE, XLA, TVM, Halide is a plus.

How strong is your resume?

Upload your resume and get feedback from our expert to help land this job