Khushi Baby

Lead Data Engineer

Jaipur, RJ, IN

$160
about 2 months ago
Save Job

Summary

Khushi Baby, a nonprofit organization in India, serves as a technical partner to health departments. Established in 2016 from a Yale University classroom, it has grown into a 90+ member team with offices in Jaipur, Udaipur, Delhi, and Bengaluru.

Khushi Baby focuses on digital health solutions, health program strengthening, and R&D. Its flagship platform, the Community Health Integrated Platform (CHIP), supports over 70,000 community health workers across 40,000 villages, reaching 45 million beneficiaries. The platform has identified and monitored 5+ million high-risk individuals, with the Ministry of Health allocating ₹160 crore ($20M) for its scale-up.

CHIP has enabled initiatives like Rajasthan's digital health census, TB case finding, vector-borne disease surveillance, labor room monitoring, and immunization drives, co-designed with extensive field input.

In R&D, Khushi Baby advances community-level geospatial analysis and individual health diagnostics, including smartphone-based tools and low-literacy models. Programmatically, it focuses on maternal health, child malnutrition, and zero-dose children.

Backed by donors like GAVI, Skoll Foundation, and CSR funding, Khushi Baby partners with IITs, AIIMS Jodhpur, JPAL South Asia, MIT, Microsoft Research, WHO, and multiple state governments.

Khushi Baby seeks skilled, creative, and driven candidates eager to make a large-scale public health impact by joining its interdisciplinary team in policy, design, development, implementation, and data science.

Job Overview

We are looking for a Lead Data Engineer to design, build, and optimize scalable data systems for public health analytics. You will define data workflows, layer architecture, and pipelines, ensuring data quality, security, and efficiency while leading a team of engineers.

Key Responsibilities

  • Plan and define data architecture, modelling & workflows for efficient data processing.
  • Build and optimize ETL/ELT pipelines for structured & unstructured health data.
  • Ensure data quality, security, and compliance (FHIR, HL7, etc.).
  • Develop real-time & batch processing systems (Kafka, Flink, RisingWave, etc.).
  • Monitor & optimize performance, scalability, and cloud costs (AWS, GCP, Azure).
  • Manage & mentor data engineers, fostering technical growth.
  • Performance tuning of databases, queries, and pipelines.
  • Handle large-scale data efficiently while ensuring cost-effectiveness.
  • Implement access control, encryption, and compliance measures.
  • Works closely with data scientists, analysts, and business teams to define data requirements.
  • Strong documentation for maintaining pipeline workflows and architecture.
  • Ability to translate technical concepts into business-friendly insights.
  • Stays updated with the latest trends in data engineering and cloud technologies.
  • Adapt and set up new tools, frameworks, and evolving business needs.

Required Qualifications

  • Master’s in Computer Science, Data Engineering, or related field.
  • 7+ years in data engineering, 2+ years in leadership.
  • Expertise in SQL, Python, Data Modeling, and pipeline orchestration (Mage AI, Airflow, etc.)
  • Knowledge of partitioning, indexing, caching, and compression techniques, data lineage, cataloging, and metadata management is required.
  • Experience with cloud data services
  • Proficiency in big data processing (Iceberg/Delta).
  • Knowledge of real-time streaming & CDC tools (Kafka, Debezium, Redpanda, etc.).
  • Familiarity with public health standards (FHIR, HL7, ICD-10) is a plus.
  • Strong problem-solving, logical thinking, communication, and leadership skills.

Good to have

  • Experience in public health data projects and key health metrics.
  • Knowledge of data lakehouse architecture and federated learning.

Remuneration

The remuneration offered will be depending on the candidate’s experience, skill set, and evaluation based on our internal parameters

  • Medical Insurance
  • Paid sick leave, paid parental leave and menstrual leave
  • Learning stipend policy
  • A flexible, enabling environment workplace with the opportunity to grow into leadership roles.
  • Opportunities to attend and actively participate in prestigious International conferences and workshops

Note: The candidate will be on a probationary period for the first 90 days of the contract

How strong is your resume?

Upload your resume and get feedback from our expert to help land this job

People also searched: