Lead Data Engineer, Behavioral AI Infrastructure
Compensation: $250,000 + equity
Location: US Remote or Hybrid (East Coast preferred)
Join a pioneering machine learning company that transforms messy, hard-to-use behavioral data into powerful, privacy-safe insights for some of the world’s most data-driven enterprises.
This stealth-scaleup is reshaping how third-party data fuels predictive models—by converting fragmented, high-liability consumer data into compact, ML-ready features without exposing any raw information. With over $100M in non-dilutive funding and strategic partnerships in place, they're approaching unicorn status and scaling fast.
The Opportunity
This is a high-impact leadership role for a systems-minded data engineer who wants to shape the technical backbone of privacy-first behavioral AI. You'll architect and evolve critical data workflows—balancing scale, performance, cost, and reliability—while enabling ML teams to move fast with confidence.
What You’ll Do
- Own end-to-end data infrastructure powering behavioral ML systems
- Design and optimize pipelines for batch and streaming ingestion, transformation, and feature generation
- Define schemas, data contracts, lineage, and orchestration best practices
- Collaborate across engineering and ML teams to build scalable abstractions
- Lead by example in coding, architecture, and reliability mindset
- Balance pragmatic delivery with long-term platform quality
What You’ll Bring
- Proven experience building robust, modular data workflows at scale
- Strong command of architectural tradeoffs: batch vs stream, push vs pull, eventual consistency, etc.
- Ability to influence engineering standards and data conventions in evolving environments
- Deep expertise in tools like Delta Lake, Spark (Scala and/or Python), and Airflow
- Pragmatic mindset—knows when to polish and when to ship
- Clear systems thinking and comfort navigating ambiguity
Tech Stack
- Delta Lake
- Apache Spark (Scala & Python)
- Airflow
- [Add relevant infrastructure/tooling here if available]
Why Join?
- Work with massive, real-world behavioral datasets
- Help redefine how machine learning leverages third-party data—ethically and effectively
- Collaborate with top-tier talent in data engineering, AI research, and product
- Be part of a mission-driven team using data for positive, privacy-respecting impact
- Shape foundational infrastructure as the company scales 10x
About People In AI
We’re a specialized recruitment partner helping exceptional technical talent connect with ambitious AI-driven companies. Our focus is on meaningful work, ethical innovation, and long-term fit.