📍 Remote (US-Based) | Full-Time
We're seeking a Senior Data Engineer to lead the design and development of a new, high-impact dataset from the ground up. This is a foundational role on a fast-growing team where you’ll own architecture decisions, shape our approach to data modeling, and build pipelines that power the next generation of our platform.
We’re tackling deep data normalization and identity resolution problems across a complex, multi-source domain — and we need someone who thrives on untangling messy data and building clean, scalable systems.
🔧 What You'll Do
- Architect and implement a new, large-scale data model to power customer-facing insights.
- Design and build ETL pipelines to ingest, clean, normalize, and enrich messy, real-world data.
- Build systems that can deduplicate entities across partial, conflicting, or inconsistent inputs.
- Evaluate and integrate tools for address standardization, fuzzy matching, and data linking.
- Select and implement appropriate data storage solutions (graph, relational, or hybrid).
- Collaborate with product and engineering leadership to define system requirements and data integrity standards.
- Mentor junior engineers and establish best practices for data quality, documentation, and monitoring.
📦 What You Bring
- 5+ years of experience in data engineering or backend systems focused on data processing.
- Experience with graph databases (e.g., Neo4j, GraphDb, Cloud Spanner Graph, Amazon Neptune)
- Experience with relational databases (PostgreSQL MySQL, etc.).
- Proven experience architecting data models and designing data pipelines from scratch.
- Strong command of Javascript/Typescript / Node.js and SQL.
- Deep understanding of data normalization, identity resolution, and deduplication techniques.
- Knowledge of record linkage, entity matching, or probabilistic modeling
- Thoughtful approach to data governance, integrity, and scale.
- Clear communicator who thrives in ambiguous, fast-moving environments.
✨ Bonus Points
- Experience working with geospatial data or address resolution
- Familiarity with address tools like libpostal, Google Maps API, or custom fuzzy matching libraries.
- Experience in early-stage or high-growth tech environments
🧭 Why Join Us
We’re building something ambitious from the ground up — with a small, smart, and highly collaborative team. You’ll get to shape core architecture, make big technical decisions, and see your work directly impact how our customers interact with our platform.