Job Title: Data Science Intern
Job ID: 0418
Work Mode: Remote
Experience: Fresher
Stipend: ₹10,000 – ₹12,000 per month
Role Overview
We are seeking a motivated Data Science Intern to join our AI/ML team. In this role, you will work on analyzing large-scale structured and unstructured datasets, building data pipelines, and leveraging state-of-the-art models such as OpenAI’s GPT-4 APIs and Google’s Gemini. This internship offers a unique opportunity to contribute to data-driven solutions in the health, wellness, and fitness domains.
Key Responsibilities
- Analyze structured and unstructured datasets, including conversational data, fitness records, and health reports
- Design and implement data pipelines for cleaning, transformation, and feature extraction
- Fine-tune and evaluate prompt strategies using OpenAI and Gemini APIs
- Perform model evaluation, benchmarking, and result explainability
- Build visualization dashboards and analytics tools for health and wellness data
- Support the development of personalized recommendation systems for fitness, nutrition, and mental health
- Explore and implement LLM + RAG (Retrieval-Augmented Generation) pipelines using domain-specific datasets
What We’re Looking For
- Final-year students or recent graduates in Computer Science, Data Science, AI/ML, or related fields
- Strong proficiency in Python, Pandas, NumPy, Scikit-learn, and data visualization libraries like Matplotlib or Seaborn
- Familiarity with LLMs (OpenAI, Gemini) and prompt engineering concepts
- Knowledge of tools like LangChain, Pinecone, Weaviate, or other Vector DBs is a plus
- Good understanding of statistics, model evaluation metrics, and real-world data challenges
- Bonus: Exposure to healthcare/wellness datasets, time-series analysis, or bioinformatics
What You’ll Gain
- Hands-on experience with real-world health and wellness datasets
- Exposure to cutting-edge AI/ML technologies and multimodal data analysis
- Mentorship from professionals associated with IITs, AIIMS, and global AI communities
- Potential opportunity for a pre-placement offer (PPO) based on performance
Skills: langchain,scikit-learn,seaborn,openai,python,data visualization,numpy,llms,pandas,data science,pinecone,weaviate,gemini,matplotlib