We are looking for a mid-level Data Engineer with a keen interest in the Data Science field to join our team. The ideal candidate will have a background in data engineering and software development, complemented by a strong curiosity for AI/ML, natural language processing (NLP), and agent-based systems.
In this role, you will focus on designing and maintaining scalable data pipelines and supporting the development of intelligent systems. You will be dedicated to a team working on AI agents and the infrastructure that powers them, contributing to the development of enterprise-grade cloud solutions using the latest AI technologies.
This is an opportunity to gain experience with real-world business cases, actively build the company's knowledge base in the field, and grow your expertise at the intersection of data engineering and artificial intelligence.
Challenges you'll tackle:
Develop and maintain ETL/ELT pipelines using PySpark in Azure Databricks, with SQL for data transformations and Python/Pandas for data manipulation, where applicable
Design and implement data models for structured and unstructured data
Work on NLP, AI/ML, and agentic networks to build intelligent solutions
Develop and optimise machine learning models and integrate them into data pipelines
Collaborate with Data Scientists and Engineers to implement data-driven solutions
Work with Git and version control to manage code and data pipelines effectively
Research and experiment with new AI/ML techniques and apply them to real-world business problems
Requirements
Skills for success:
2+ years of experience in Data Engineering and/or Data Science
Strong programming skills in Python
Basic proficiency in PySpark and SQL
Basic proficiency with Azure Databricks and cloud-based data engineering
Conceptual understanding of NLP, AI/ML, and agentic networks
Experience in data and process modeling for large-scale systems
Understanding Git and software engineering best practices
Basic proficiency with data wrangling, transformation, and feature engineering
Problem-solving skills and the ability to work independently
Nice to Have:
Experience with MLOps and model deployment in production environments
Experience in implementing CI/CD pipelines for automated data workflows and model deployment, dockerization technologies
Basic proficiency in Huggingface, Langchain and generative AI technologies for agentic networks
Understanding of data streaming (e.g., Kafka, Azure Event Hubs)
Knowledge of machine learning frameworks such as TensorFlow, PyTorch, or Scikit-Learn
Benefits
Competitive Compensation & Growth Opportunities
Dedicated training budget for conferences, online courses, and books to support continuous learning
Access to English and Lithuanian language lessons
Professional development through workshops, coaching sessions, and tech events
Work-Life Balance & Flexibility
Flexible working hours to suit your schedule
Unlimited work-from-home option for greater autonomy
A 300€ Personal Perks Pack to support your work-life balance needs
Community & Team Connection
Employee referral program with rewards up to 2000€ net
Clients & External Ambassadors with rewards up to 5000€ net
Social events, including Summer/Winter parties and a Dev Day celebration
Team-building activities and annual live meet-ups with clients for enhanced collaboration
For this position, we offer 2975 € - 3636 €/month gross salary.
The final offer will depend on your experience and competencies.
How strong is your resume?
Upload your resume and get feedback from our expert to help land this job
How strong is your resume?
Upload your resume and get feedback from our expert to help land this job