Collabera

Machine Learning Engineer - Gen AI

Charlotte, NC, US

28 days ago
Save Job

Summary

About Collabera:

Collabera is a leading global technology services and solutions provider committed to delivering high-quality, innovative solutions to our clients. Our diverse, global talent helps clients transform every aspect of their business and achieve exceptional results. We achieve success through collaboration and the use of our digital platforms. With AI, our extensive talent network and in-depth learning solutions on the newest technologies, we provide the best Talentforce for today, tomorrow, and the next ERA.


Location: Charlotte NC, Hybrid - 3 days a week onsite, 2 days remote


About the Role:

Join our innovative team as a AI Machine Learning//LLM Engineer, you will be responsible for designing, developing, and deploying scalable AI solutions. You will work with a team of skilled professionals to create and optimize AI models, leveraging state-of-the-art technologies in cloud-native architectures and generative AI frameworks. Your expertise in API development, real-time data streaming, and distributed computing will be crucial in delivering high-performance AI applications.

Key Responsibilities:

  • Develop and implement APIs using FastAPI, Unicorn, and Swagger to support AI and machine learning applications.
  • Design cloud-native architectures for scalable and efficient deployment of AI models.
  • Utilize generative AI frameworks such as LLaMA and Mistral to create innovative solutions.
  • Optimize and deploy AI models on GPU clusters, leveraging parallel processing for deep learning and generative AI applications.
  • Implement multi-GPU training and distributed computing frameworks like TensorFlow Distributed, PyTorch Distributed, and Horovod to enhance AI/ML workloads.
  • Configure and manage NVIDIA GPU and Google Cloud Platform (GCP) resources, including TPUs and GPU instances.
  • Utilize Apache Spark (PySpark) and Kubernetes for distributed data processing and orchestration.
  • Implement real-time data streaming solutions using Apache Kafka.
  • Collaborate with cross-functional teams to integrate AI solutions into existing systems and workflows.

Qualifications:

  • Proven experience in AI/ML engineering, with a focus on large language models (LLMs) and generative AI.
  • Strong proficiency in Python and experience with frameworks like Django.
  • Hands-on experience with distributed computing frameworks and multi-GPU training.
  • Knowledge of cloud platforms, especially GCP, and experience configuring GPU and TPU instances.
  • Familiarity with Apache Kafka for real-time data streaming.
  • Experience in optimizing AI models for performance and scalability.
  • Strong problem-solving skills and the ability to work in a fast-paced, collaborative environment.
  • Excellent communication skills and the ability to convey complex technical concepts to non-technical stakeholders.

Preferred Qualifications:

  • Experience with Apache Spark (PySpark) and Kubernetes.
  • Familiarity with Unicorn and Swagger for API development.
  • Previous experience with LLaMA, Mistral, or similar generative AI frameworks.


Why Join Us?

  • Be part of a high-impact team at the forefront of technology and finance transformation.
  • Opportunity to work on cutting-edge engineering projects with top-tier tools and frameworks.
  • Collaborate with a team of experts passionate about driving innovation in financial technology.


How to Apply:

If you're passionate about machine learning and Generative AI, and you're looking to make a significant impact in a forward-thinking financial institution, we'd love to hear from you. Please submit your resume and a cover letter detailing your relevant experience and interest in the role.

How strong is your resume?

Upload your resume and get feedback from our expert to help land this job

People also searched: