Core Responsibilities
Technical Development & Innovation
• Develop and optimize state-of-the-art computer vision and multimodal models
• Design and implement advanced model architectures for visual understanding tasks
• Create robust evaluation frameworks and testing methodologies
• Drive technical improvements in model performance and efficiency
• Research and evaluate new approaches in computer vision and multimodal learning
• Implement novel computer vision algorithms and techniques
Project Leadership
• Lead technical implementation of ML projects
• Drive architectural decisions for model development
• Review code and model architectures
• Contribute to technical planning and decision-making
• Support cross-functional collaboration with product and infrastructure teams
• Help maintain technical standards and best practices
Infrastructure & Scale
• Implement scalable training and evaluation pipelines for computer vision models
• Build efficient visual data processing workflows
• Develop monitoring solutions for production ML systems
• Optimize model performance and resource utilization
• Support deployment and production maintenance
• Design efficient inference systems for visual processing
Required Qualifications
Education & Experience
• MS or PhD in Computer Science, Engineering, Mathematics, or related field
• 5+ years of experience in Machine Learning/AI
• Strong track record of shipping computer vision models to production
• Deep expertise in computer vision and deep learning
Technical Expertise
• Expert knowledge of computer vision techniques and architectures
• Deep expertise in modern deep learning architectures for visual and multimodal tasks
• Strong programming skills in Python and proficiency with PyTorch or equivalent frameworks
• Experience with cloud platforms and MLOps tools
• Demonstrated ability in building and optimizing training pipelines
• Strong background in image processing and visual computing
Leadership & Communication
• Experience leading technical projects
• Strong problem-solving and analytical skills
• Excellence in cross-functional collaboration and technical communication
• Ability to translate complex technical concepts to various audiences
This role offers the opportunity to work on cutting-edge computer vision and multimodal ML systems while solving complex technical challenges. The ideal candidate will combine deep computer vision expertise with strong implementation skills to deliver innovative solutions.