Job Title: Data Engineer
Location: Hybrid Schedule for San Francisco Office location (Work Remotely Monday-Friday)
About Us:
Flashii App is a premier boutique Staffing Firm dedicated to helping clients find exceptional candidates who make a difference. We provide comprehensive staffing solutions, including project-based consulting and direct hire services.
About the Company:
The company provides a powerful suite of tools for businesses across the U.S, including identity verification (Know Your Business), enhanced due diligence, fraud prevention, risk profiling, lien filing, and portfolio monitoring. Their mission is to help companies make informed decisions with confidence, powered by proprietary private databases.
Job Overview:
As a Data Engineer you will be responsible with data collection across a variety of fragmented data sources available from both government, public, and private databases.
Requirements:
- 1–3 years of experience in data engineering, working with Python, SQL, and cloud-native data platforms
- Security & Governance: Partner with security and compliance teams to ensure data pipelines adhere to regulatory standards (e.g., SOC 2, GDPR, KYC/KYB)
- Experience with ETL (extract, transform, load)
- Experience with Database design (Primary key, foreign key, Indexing, Partitioning, Access patterns, and Migrations)
- Experience with Data pipelining/ building data pipelines end-to-end.
- Migrating production tables Indexing
- Working with large scale data (i.e. terabytes Access patterns)
- Maintaining operational efficiency of databases
- Normalizing disparate schemas to a single unified schema
- Abstracting reusable components
- Pipelines are rinse and repeat, minimize the amount of new code
- Core data concepts: ACID transactions, Idempotency, Orchestration
Experience with Technologies:
- Airflow
- Google Cloud Platform (GCP)
- GCP Dataflow (aka Apache Beam)
- PostgreSQL
- Python
- Pydantic (a python library)
- Distributed systems