Project description
Luxoft is searching for talented developers with GPU compute and performance profiling experience to join the rapidly growing team.
We are seeking an experienced individual proficient in HIP / ROCm applications to join our team. The primary responsibility of this role will be to lead the effort in porting CUDA kernels to HIP. The candidate should possess a strong background in GPU computing, parallel programming, and a deep understanding of CUDA or HIP frameworks. Additionally, familiarity with optimization techniques is highly desirable.
Responsibilities
The main task will be to help port CUDA kernels on HIP
Collaborate with development teams to optimize and enhance GPU-accelerated applications.
Debug, profile, and fine-tune code for performance improvements.
Stay updated with the latest advancements in GPU architectures and programming models.
Skills
Must have
CUDA or HIP
GPGPU
C/C++
Python
One of AI/ML/DL/NN/NLP/Computer Vision
Mandatory Skills Description:
Proficiency with C++ and GPU Assembler
Proficiency in CUDA or HIP / ROCm programming
Solid understanding of GPU architectures, parallel programming models, and optimization techniques
Strong problem-solving skills and the ability to work in a collaborative environment
Other
Languages
English: B2 Upper Intermediate
Seniority
Senior
We offer numerous benefits such as:
⏰ Flexible work schedule
🧑 Great company culture and friendly environment
🚀 Work within a fast-moving, exciting, and challenging environment
🎓 Talent development ecosystem
👨🏻💻 Luxoft Training Center services with ad-hoc leadership and technical programs
📚 Knowledge sharing in professional communities
🧠 Meetings for knowledge sharing, celebrations, and brainstorming: your ideas count!
🏆 Regular team-building activities
💸 Variety of discounts for our employees